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PREFACE 


The  chief  objective  of  the  fourth  edition  is  to  respond  to  the  tremendous  amount  of  tech¬ 
nological  progress  in  communication  systems  we  have  witnessed  over  the  decade  since 
the  third  edition  was  published.  At  the  same  time,  newer  software  and  teaching  tools  have 
also  become  available,  making  it  much  easier  to  provide  solid  and  illustrative  examples  as 
well  as  more  experimental  opportunities  for  students.  In  this  new  edition,  major  changes  are 
implemented  to  incorporate  recent  technological  advances  of  telecommunications.  To  captivate 
students’  attention  and  make  it  easier  for  students  to  relate  the  course  materials  to  their  daily 
experience  with  communication  tools,  we  will  provide  relevant  information  on  the  operation 
and  features  of  cellular  systems,  wireless  local  area  networks  (LANs),  and  wire-line  (digital 
subscriber  loop  or  DSL)  internet  services,  among  others. 

Major  Revision 

A  number  of  critical  changes  are  motivated  by  the  need  to  emphasize  the  fundamentals  of 
digital  communication  systems  that  have  permeated  our  daily  lives.  Specifically,  in  light  of  the 
widespread  applications  of  new  technologies  such  as  spread  spectrum  and  orthogonal  frequency 
division  multiplexing  (OFDM),  we  present  two  new  chapters:  Chapter  12  on  spread  spectrum 
communications  and  Chapter  13  on  frequency-selective  channels  and  OFDM  systems.  As 
practical  examples  of  such  systems,  we  provide  a  basic  introduction  of  current  wireless  commu¬ 
nication  standards  including  cellular  systems  and  IEEE  802.1  la/b/g/n  wireless  LAN  systems. 
In  addition,  we  summarize  the  latest  in  DSL  modem  technology  and  services.  At  the  funda¬ 
mental  level,  information  theory  and  coding  have  also  been  transformed  by  several  important 
new  progresses.  In  this  edition,  we  include  the  basic  principles  of  multiple-input-multiple- 
output  (MIMO)  technology,  which  has  just  begun  to  see  broad  commercial  applications.  We 
also  cover  several  notable  breakthroughs  in  error  correction  coding,  including  soft  decoding, 
turbo  codes,  and  low-density  parity  check  (LDPC)  codes. 

To  enhance  the  learning  experience  and  to  give  students  opportunities  for  computer-based 
experimental  practices,  relevant  MATLAB  examples  and  exercises  have  been  provided  in 
chapters  that  can  be  enhanced  by  these  hands-on  experiments. 

Organization 

With  respect  to  organization,  we  begin  the  fourth  edition  with  a  traditional  review  of  signal  and 
system  fundamentals  before  proceeding  to  the  core  communication  topics  of  analog  modulation 
and  digital  pulse-coded  modulation.  We  then  present  the  fundamental  tools  of  probability  theory 
and  random  processes  to  be  used  in  the  design  and  analysis  of  digital  communications  in  the  rest 
of  this  text.  After  the  fundamentals  of  digital  communication  systems  have  been  covered,  the 
last  two  chapters  provide  an  overview  of  information  theory  and  the  fundamentals  of  forward 
error  correction  codes. 
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Ideally  the  communications  subjects  germane  to  this  text  should  be  covered  in  two 
courses  one  on  the  basic  operations  of  communication  systems  and  one  on  the  analysis  of 
modern  communication  systems  under  noise  and  other  distortions.  The  former  rel.es  heav¬ 
ily  (,n  deterministic  analytical  tools  such  as  Fourier  series,  the  Fourier  transform,  and  the 
sampling  theorem,  while  the  latter  relies  on  tools  from  probability  and  random  processes  to 
tackle  the  unpredictability  of  message  signals  and  noises.  In  today’s  academic  environment, 
however,  with  so  many  competing  courses,  it  may  be  difficult  to  squeeze  two  basic  courses 
on  communications  into  a  typical  electrical  engineering  curriculum.  Some  universities  do 
require  a  course  in  probability  and  random  processes  as  a  prerequisite.  In  that  case,  it  is  pos¬ 
sible  to  cover  both  areas  reasonably  well  in  a  one-semester  course.  This  book  is  designed  tor 
adoption  in  both  cases.  It  can  be  used  as  a  one-semester  course  in  which  the  deterministic 
aspects  of  communication  systems  are  emphasized  with  little  consideration  ot  the  effects  ot 
noise  and  interference.  It  can  also  be  used  for  a  course  that  deals  with  both  the  deterministic 
and  the  probabilistic  aspects  of  communication  systems.  The  book  is  self-contained,  since 
it  provides  all  the  necessary  background  in  probabilities  and  random  processes.  As  stated 
earlier,  however,  if  both  deterministic  and  probabilistic  aspects  of  communications  are  to  be 
covered  in  one  semester,  it  is  highly  desirable  for  students  to  have  a  good  background  in 
probabilities. 

Chapter  1  introduces  a  panoramic  view  of  communication  systems.  Important  concepts  of 
communication  theory  are  explained  qualitatively  in  a  heuristic  way.  This  attracts  the  students 
to  the  topics  of  communications.  With  this  momentum,  students  are  motivated  to  study  the 
tools  of  signal  analysis  in  Chapters  2  and  3,  which  encourage  them  to  see  a  signal  as  a  vector, 
and  to  think  of  the  Fourier  spectrum  as  a  way  of  representing  a  signal  in  terms  of  its  vector 
components.  Chapters  4  and  5  discuss  amplitude  (linear)  and  angle  (nonlinear)  modulations, 
respectively.  Many  instructors  feel  that  in  this  digital  age,  modulation  should  be  deemphasized 
with  a  minimal  presence.  We  hold  the  view  that  modulation  is  not  so  much  a  method  of  com¬ 
munication  as  a  basic  tool  of  signal  processing;  it  will  always  be  needed,  not  only  in  the  area 
ot  communication  (digital  or  analog),  but  also  in  many  other  areas  of  electrical  engineering. 
Hence,  neglecting  modulation  may  prove  to  be  rather  shortsighted.  Chapter  6  serves  as  the 
fundamental  link  between  analog-and-digital  communications  by  describing  the  process  of 
analog-to-digital  conversion  (ADC).  It  provides  the  details  of  sampling,  pulse  code  modula¬ 
tion  (including  DPCM),  delta  modulation,  speech  coding  (vocoder),  image/video  coding,  and 
compression.  Chapter  7  discusses  the  principles  and  techniques  used  in  digital  modulations.  It 
introduces  the  concept  of  channel  distortion  and  presents  equalization  as  an  effective  means 
ot  compensating  for  distortion. 

(  haptcrs  S  and  )  provide  the  essential  background  on  theories  of  probability  and  random 
Fv  »r\  itt  i  ,1C  LOIJ1Ilr'SL  tllc  seconc*  t0°l  squired  for  the  study  of  communication  systems. 

“  7*1' 10  mo,lvale  *“*»  <"Kl  lo  elevate  their  interest  through  these  chapters 

r  *  TTiCa',0nS  praHemS  whcrev"  P<)ss'hle  Chapters  10  and 

(Xum  r  ,  8"al  commun'cadon  systems  in  the  presence  of  noise. 

cEXus^  h  8"al  OOIT,ntunication  is  thoroughly  presented  in  Chapter  1 1 . 
techniques  ta  ca  u  ’"‘T  “""T ^  Chap,.,  13  presents  various  practical 

both  channel  equaliralion  and  mlTiroadT annl?rr  ‘Jis,ortions' This  cM»er  wi"  capture 
an  overview  of  information  theory  Finajmfn hn°lo8y  “f  0FDM Chapter  14  provides 
control  coding  are  given  in  Chapter  15  ^  P  Clp  e  and  key  practical  aspects  of  error 

intimidating  experience  fol’smdm^  3  P,eaSan‘  °r  *  ,eaS‘ 3 

logically  organized  manner.  Every  efforth  h  *  th  subject. in  a  clear'  understandable,  and 
an  understanding— as  well  -c  k.,'  .:  dS.  een  made  t0  del>ver  an  insight — rather  than  just 


as  heiirictie  i  ”T  “““*v  dedver  an  insight — rather  than  just 

explanations  of  theoretical  results  wherever  possible. 
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Many  examples  are  provided  for  further  clarification  of  abstract  results.  Even  a  partial  success 
in  achieving  this  stated  goal  would  make  all  our  efforts  worthwhile. 

A  Whole  New  World 

There  have  been  a  number  of  major  technology  developments  since  the  publication  of  the  third 
edition  in  1998.  First  of  all,  the  cellular  telephone  has  deeply  penetrated  the  daily  lives  of  urban 
and  suburban  households  in  most  developed  and  even  developing  nations.  In  1998  there  were 
very  few  students  with  beepers  and  cellphones  in  the  classrooms.  Now,  nearly  every  college 
student  has  a  cellphone.  Second,  in  1998  most  of  the  household  internet  connections  were 
linked  via  low-speed  (of  approximately  28.8  kbit/s)  voice  modems.  Today,  a  majority  of  our 
students  are  connected  to  cyberspace  through  high  speed  DSL  or  cable  services.  In  addition, 
wireless  LAN  has  turned  esoteric  terms  such  as  IEEE  802. 1 1  into  household  names.  Most 
students  in  the  classroom  have  explored  these  technologies  to  some  extent. 

Vast  technological  advances  have  produced  a  new  generation  of  students  extremely  inter¬ 
ested  in  learning  about  the  new  technologies  and  their  implementations.  These  students  are 
eager  to  understand  how  and  where  they  may  be  able  to  make  contributions  in  the  future.  Such 
strong  motivation  must  be  encouraged  and  taken  advantage  of.  This  new  edition  will  enable 
instructors  either  to  cover  the  topics  themselves  or  to  assign  reading  materials  that  will  allow 
the  students  to  acquire  relevant  information.  The  new  edition  achieves  these  goals  by  stressing 
the  digital  aspects  of  the  text  and  by  incorporating  the  most  commonly  known  wireless  and 
wire-line  digital  technologies. 

Course  Adoption 

With  a  combined  teaching  experience  of  over  50  years,  we  have  taught  communication  classes 
under  both  quarter  and  semester  systems  in  several  major  universities.  On  the  other  hand,  the 
students'  personal  experiences  with  communication  systems  have  continuously  been  multiply¬ 
ing,  from  a  simple  radio  set  in  the  1960s,  to  the  turn  of  the  twenty-first  century,  with  its  easy 
access  to  wireless  LAN.  cellular  devices,  satellite  radio,  and  home  internet  services.  Hence, 
more  and  more  students  are  interested  in  learning  how  familiar  electronic  gadgets  work.  With 
this  important  need  and  our  past  experiences  in  mind,  we  revised  the  fourth  edition  of  this 
text  to  fit  well  within  several  different  curriculum  configurations.  In  all  cases,  basic  coverage 
should  teach  the  fundamentals  of  analog  and  digital  communications  (Chapters  1-7). 

One-Semester  Course  (without  strong  probability  background) 

In  many  existing  curricula,  undergraduate  students  are  not  exposed  to  simple  probability  tools 
until  they  begin  to  take  communications.  This  occurs  often  because  the  students  were  required 
to  take  an  introductory  statistical  course  disconnected  from  engineering  science.  This  text  is 
well  suited  to  students  of  such  a  background.  The  first  seven  chapters  form  a  comprehensive 
coverage  of  modern  digital  and  analog  communication  systems  for  average  ECE  undergraduate 
students.  Such  a  course  can  be  taught  within  one  semester  (40-45  instructional  hours).  Under 
the  premise  that  each  student  has  built  a  solid  background  in  Fourier  analysis  via  a  prerequisite 
class  on  signals  and  systems,  most  of  the  first  three  chapters  can  be  treated  as  a  review  in  a 
single  week.  The  rest  of  the  semester  can  be  fully  devoted  to  teaching  Chapters  4  to  7,  with 
partial  coverage  on  the  practical  systems  of  Chapters  12  and  13  to  enhance  students  interest. 

One-Semester  Course  (with  a  strong  probability  background) 

For  curricula  that  have  strengthened  the  background  coverage  of  probability  theory,  a  much 
more  extensive  coverage  of  digital  communications  can  be  achieved  within  one  semester. 
A  rigorous  probability  class  can  be  taught  within  the  context  of  signal  and  system  analysis 
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(cf.  Cooper  and  McGillem,  Probabilistic  Methods  of  Signal  and  System  Analysis ,  Oxford 
University  Press,  1999,  ISBN:  0195123549).  For  this  scenario,  in  addition  to  Chapters  1  to  7, 
Chapter  1 1  and  part  of  Chapter  13  on  equalization  can  also  be  taught  in  one  semester,  provided 
the  students  have  a  solid  probability  background  that  permits  the  coverage  of  Chapter  8  and 
Chapter  9  in  a  few  hours.  Students  completing  this  course  would  be  well  prepared  to  enter  the 
telecommunications  industry  or  to  continue  in  a  program  of  graduate  studies. 

Two-Semester  Series  (without  a  separate  probability  course) 

The  entire  text  can  be  thoroughly  covered  in  two  semesters  for  a  curriculum  that  does  not 
have  any  prior  probability  course.  In  other  words,  for  a  two-course  series,  the  goal  is  to  teach 
both  communication  systems  and  fundamentals  of  probabilities.  In  an  era  of  many  competing 
courses  in  the  ECE  curriculum,  it  is  hard  to  set  aside  two  semester  courses  for  communications 
alone.  On  the  other  hand,  most  universities  do  have  a  probability  course  that  is  separately 
taught  by  nonengineering  professors.  In  this  scenario  it  would  be  desirable  to  fold  probability 
theory  into  the  two  communication  courses.  Thus,  for  two  semester  courses,  the  coverage  can 
be  as  follows: 

•  1st  semester:  Chapters  1-7  (Signals  and  Communication  Systems) 

2nd  semester:  Chapters  8-13  (Modem  Digital  Communication  Systems) 

One-Quarter  Course  (with  a  strong  probability  background) 

In  a  quarter  system,  students  must  have  prior  exposure  to  probability  and  statistics  at  a  rigorous 
le\el  (cf.  C  ooperand  McGillem,  Probabilistic  Methods  of  Signal  and  System  Analysis,  Oxford 
University  Press.  1999,  ISBN:  0195123549).  They  must  also  have  solid  knowledge  of  Fourier 
analysis.  Within  a  quarter,  the  class  can  teach  the  basics  of  analog  and  digital  communication 
systems  (Chapters  3-7).  analysis  of  digital  communication  systems  (Chapter  1 1 ),  and  spread 
spectrum  communications  (Chapter  12). 


'  1st  quarter:  Chapters  1-10  (Communication  Systems 


are  communication  laboratories,  MATLAB  based 
e  available  to  enhance  the  learning  experience. 
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Students  will  be  able  to  design  systems  and  modify  their  parameters  to  evaluate  the  overall 
effects  on  the  performance  of  communication  systems  through  computer  displays  and  bit  error 
rate  measurement.  The  students  will  acquire  first-hand  knowledge  of  how  to  design  and  perform 
simulations  of  communication  systems. 
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INTRODUCTION 


Over  the  past  decade,  the  rapid  expansion  of  digital  communication  technologies  has 
been  simply  astounding.  Internet,  a  word  and  concept  once  familiar  only  to  technolo¬ 
gists  and  the  scientific  community,  has  permeated  every  aspect  of  people’s  daily  lives. 
It  is  quite  difficult  to  find  any  individual  in  a  modern  society  that  has  not  been  touched  by  new 
communication  technologies  ranging  from  cellular  phones  to  Bluetooth.  This  book  examines 
the  basic  principles  of  communication  by  electric  signals.  Before  modern  times,  messages 
were  carried  by  runners,  carrier  pigeons,  lights,  and  fires.  These  schemes  were  adequate  for  the 
distances  and  “data  rates”  of  the  age.  In  most  parts  of  the  world,  these  modes  of  communication 
have  been  superseded  by  electrical  communication  systems,*  which  can  transmit  signals  over 
much  longer  distances  (even  to  distant  planets  and  galaxies)  and  at  the  speed  of  light. 

Electrical  communication  is  dependable  and  economical;  communication  technologies 
improve  productivity  and  energy  conservation.  Increasingly,  business  meetings  are  conducted 
through  teleconferences,  saving  the  time  and  energy  formerly  expended  on  travel.  Ubiqui¬ 
tous  communication  allows  real-time  management  and  coordination  of  project  participants 
from  around  the  globe.  E-mail  is  rapidly  replacing  the  more  costly  and  slower  “snail  mails.” 
E-commerce  has  also  drastically  reduced  some  costs  and  delays  associated  with  marketing, 
while  customers  are  also  much  better  informed  about  new  products  and  product  information. 
Traditional  media  outlets  such  as  television,  radio,  and  newspapers  have  been  rapidly  evolving 
in  the  past  few  years  to  cope  with,  and  better  utilize,  the  new  communication  and  networking 
technologies.  The  goal  of  this  textbook  is  to  provide  the  fundamental  technical  knowledge 
needed  by  next-generation  communication  engineers  and  technologists  for  designing  even 
better  communication  systems  of  the  future. 


1 . 1  COMMUNICATION  SYSTEMS 

Figure  1 . 1  presents  three  typical  communication  systems:  a  wire-line  telephone-cellular  phone 
connection,  a  TV  broadcasting  system,  and  a  wireless  computer  network.  Because  of  the 
numerous  examples  of  communication  systems  in  existence,  it  would  be  unwise  to  attempt 
to  study  the  details  of  all  kinds  of  communication  systems  in  this  book.  Instead,  the  most 
efficient  and  effective  way  to  learn  about  communication  is  by  studying  the  major  func¬ 
tional  blocks  common  to  practically  all  communication  systems.  This  way,  students  are  not 


*  With  the  exception  of  the  postal  service. 
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Figure  1.1 

Some  examples 
of  communi¬ 
cations  sys¬ 
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Figure  1.2 
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The  channel  is  a  medium  of  choice  that  can  convey  the  electric  signals  at  the  transmitter 
output  over  a  distance.  A  typical  channel  can  be  a  pair  of  twisted  copper  wires  (telephone  and 
DSL),  coaxial  cable  (television  and  internet),  an  optical  fiber,  or  a  radio  link.  Additionally,  a 
channel  can  also  be  a  point-to-point  connection  in  a  mesh  of  interconnected  channels  that  form 
a  communication  network. 

The  receiver  reprocesses  the  signal  received  from  the  channel  by  reversing  the  signal 
modifications  made  at  the  transmitter  and  removing  the  distortions  made  by  the  channel.  The 
receiver  output  is  fed  to  the  output  transducer,  which  converts  the  electric  signal  to  its  original 
form — the  message. 

The  destination  is  the  unit  to  which  the  message  is  communicated. 

A  channel  is  a  physical  medium  that  behaves  partly  like  a  filter  that  generally  attenuates 
the  signal  and  distorts  the  transmitted  waveforms.  The  signal  attenuation  increases  with  the 
length  of  the  channel,  varying  from  a  few  percent  for  short  distances  to  orders  of  magni¬ 
tude  in  interplanetary  communications.  Signal  waveforms  are  distorted  because  of  physical 
phenomena  such  as  frequency-dependent  gains,  multipath  effects,  and  Doppler  shift.  For 
example,  a  frequency-selective  channel  causes  different  amounts  of  attenuation  and  phase 
shift  to  different  frequency  components  of  the  signal.  A  square  pulse  is  rounded  or  “spread 
out”  during  transmission  over  a  low-pass  channel.  These  types  of  distortion,  called  linear 
distortion,  can  be  partly  corrected  at  the  receiver  by  an  equalizer  with  gain  and  phase 
characteristics  complementary  to  those  of  the  channel.  Channels  may  also  cause  nonlin¬ 
ear  distortion  through  attenuation  that  varies  with  the  signal  amplitude.  Such  distortions 
can  also  be  partly  corrected  by  a  complementary  equalizer  at  the  receiver.  Channel  distor¬ 
tions,  if  known,  can  also  be  precompensated  by  transmitters  by  applying  channel-dependent 
predistortions. 

In  a  practical  environment,  signals  passing  through  communication  channels  not  only 
experience  channel  distortions  but  also  are  corrupted  along  the  path  by  undesirable  inter¬ 
ferences  and  disturbances  lumped  under  the  broad  term  noise.  These  interfering  signals  are 
random  and  are  unpredictable  from  sources  both  external  and  internal.  External  noise  includes 
interference  signals  transmitted  on  nearby  channels,  human-made  noise  generated  by  faulty 
contact  switches  of  electrical  equipment,  automobile  ignition  radiation,  fluorescent  lights  or 
natural  noise  from  lightning,  microwave  ovens,  and  cellphone  emissions,  as  well  as  elec¬ 
tric  storms  and  solar  and  intergalactic  radiation.  With  proper  care  in  system  design,  external 
noise  can  be  minimized  or  even  eliminated  in  some  cases.  Internal  noise  results  from  thermal 
motion  of  charged  particles  in  conductors,  random  emission,  and  diffusion  or  recombina¬ 
tion  of  charged  carriers  in  electronic  devices.  Proper  care  can  reduce  the  effect  of  internal 
noise  but  can  never  eliminate  it.  Noise  is  one  of  the  underlying  factors  that  limit  the  rate  of 
telecommunications. 

Thus  in  practical  communication  systems,  the  channel  distorts  the  signal,  and  noise  accu¬ 
mulates  along  the  path.  Worse  yet,  the  signal  strength  decreases  while  the  noise  level  remains 
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designing  modem  communication  systems. 


1 .2  ANALOG  AND  DIGITAL  MESSAGES 


Messages  are  digital  or  analog.  Digital  messages  are  ordered  combinations  of  finite  symbols  or 
codewords.  For  example,  printed  English  consists  of  26  letters,  1 0  numbers,  a  space,  and  several 
punctuation  marks.  Thus,  a  text  document  written  in  English  is  a  digital  message  constructed 
from  the  ASCII  keyboard  of  1 28  symbols.  Human  speech  is  also  a  digital  message,  because  it  is 
made  up  from  a  finite  vocabulary  in  a  language.1"  Music  notes  are  also  digital,  even  though  the 
music  sound  itself  is  analog.  Similarly,  a  Morse-coded  telegraph  message  is  a  digital  message 
constructed  from  a  set  of  only  two  symbols — dash  and  dot.  It  is  therefore  a  binary  message, 
implying  only  two  symbols.  A  digital  message  constructed  with  M  symbols  is  called  an  M  -ary 
message. 

Analog  messages,  on  the  other  hand,  are  characterized  by  data  whose  values  vary  over  a 
continuous  range  and  are  defined  for  a  continuous  range  of  time.  For  example,  the  temperature 
or  the  atmospheric  pressure  of  a  certain  location  over  time  can  vary  over  a  continuous  range  and 
can  assume  an  (uncountable)  infinite  number  of  possible  values.  A  piece  of  music  recorded  by 
a  pianist  is  also  an  analog  signal.  Similarly,  a  particular  speech  waveform  has  amplitudes  that 
vary  over  a  continuous  range.  Over  a  given  time  interval,  an  infinite  number  of  possible  different 
speech  waveforms  exist,  in  contrast  to  only  a  finite  number  of  possible  digital  messages. 


1 .2.1  Noise  Immunity  of  Digital  Signals 

It  is  no  secret  to  even  a  casual  observer  that  every  time  one  looks  at  the  latest  electronic 
communication  products,  newer  and  better  “digital  technology”  is  replacing  the  old  analog 
technology.  Y\  ithin  the  past  decade,  cellular  phones  have  completed  their  transformation  from 
the  first-generation  analog  AMPS  to  the  current  second-generation  fen  r.SM  rDMAl  and 


orse  code,  a  dash  can  be  transmitted  by  an  electri¬ 
cal!  be  transmitted  by  a  pulse  of  negative  amplitude 


•  Actually,  amplification  may  further  deteriorate 
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Figure  1.3 
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-A/2  (Fig  1.3a).  In  an  M-ary  case,  M  distinct  electrical  pulses  (or  waveforms)  are  used; 
each  of  the  M  pulses  represents  one  of  the  M  possible  symbols.  Once  transmitted,  the 
receiver  must  extract  the  message  from  a  distorted  and  noisy  signal  at  the  channel  output. 
Message  extraction  is  often  easier  from  digital  signals  than  from  analog  signals  because 
the  digital  decision  must  belong  to  the  finite-sized  alphabet.  Consider  a  binary  case:  two 
symbols  are  encoded  as  rectangular  pulses  of  amplitudes  A/2  and  -A/2.  The  only  deci¬ 
sion  at  the  receiver  is  to  select  between  two  possible  pulses  received;  the  fine  details  of 
the  pulse  shape  are  not  an  issue.  A  finite  alphabet  leads  to  noise  and  interference  immu¬ 
nity.  The  receiver’s  decision  can  be  made  with  reasonable  certainty  even  if  the  pulses 
have  suffered  modest  distortion  and  noise  (Fig.  1 .3).  The  digital  message  in  Fig.  1 ,3a  is  dis¬ 
torted  by  the  channel,  as  shown  in  Fig.  1.3b.  Yet,  if  the  distortion  is  not  too  large,  we  can 
recover  the  data  without  error  because  we  need  make  only  a  simple  binary  decision:  Is  the 
received  pulse  positive  or  negative?  Figure  1 .3c  shows  the  same  data  with  channel  distortion 
and  noise.  Here  again,  the  data  can  be  recovered  correctly  as  long  as  the  distortion  and  the 
noise  are  within  limits.  In  contrast,  the  waveform  shape  itself  in  an  analog  message  carries  the 
needed  information,  and  even  a  slight  distortion  or  interference  in  the  waveform  will  show  up 
in  the  received  signal.  Clearly,  a  digital  communication  system  is  more  rugged  than  an  analog 
communication  system  in  the  sense  that  it  can  better  withstand  noise  and  distortion  (as  long 
as  they  are  within  a  limit). 


1 .2.2  Viability  of  Distortionless  Regenerative  Repeaters 

One  main  reason  for  the  superior  quality  of  digital  systems  over  analog  ones  is  the  viability 
of  regenerative  repeaters  and  network  nodes  in  the  former.  Repeater  stations  are  placed  along 
the  communication  path  of  a  digital  system  at  distances  short  enough  to  ensure  that  noise 
and  distortion  remain  within  a  limit.  This  allows  pulse  detection  with  high  accuracy.  At  each 
repeater  station,  or  network  node,  the  incoming  pulses  are  detected  such  that  new,  “clean”  pulses 
are  retransmitted  to  the  next  repeater  station  or  node.  This  process  prevents  the  accumulation 
of  noise  and  distortion  along  the  path  by  cleaning  the  pulses  at  regular  repeater  intervals. 
We  can  thus  transmit  messages  over  longer  distances  with  greater  accuracy.  There  has  been 
widespread  application  of  distortionless  regeneration  by  repeaters  in  long-haul  communication 
systems  and  by  nodes  in  a  large  (possibly  heterogeneous)  network. 

For  analog  systems,  signals  and  noise  within  the  same  bandwidth  cannot  be  separated. 
Repeaters  in  analog  systems  are  basically  filters  plus  amplifiers  and  are  not  “regenerative.” 
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power.  Despite  these  limitations,  analog  communication  was  used  widely  and  successtully  in 
the  past  for  short-  to  medium-range  communications.  Nowadays,  because  ot  the  advent  ot 
optical  fiber  communications  and  the  dramatic  cost  reduction  achieved  in  the  fabrication  ot 
high-speed  digital  circuitry  and  digital  storage  devices,  almost  all  new  communication  sys¬ 
tems  being  installed  are  digital.  But  some  old  analog  communication  facilities  are  still  in  use, 
including  those  for  AM  and  FM  radio  broadcasting. 


1.2.3  Analog-to-Digital  (A/D)  Conversion 

Despite  the  differences  between  analog  and  digital  signals,  a  meeting  ground  exists  between 
them:  conversion  of  analog  signals  to  digital  signals  (A/D  conversion).  A  key  device  in 
electronics,  the  analog-to-digital  (A/D)  converter,  enables  digital  communication  systems  to 
convey  analog  source  signals  such  as  audio  and  video.  Generally,  analog  signals  are  continuous 
in  time  and  in  range;  that  is,  they  have  values  at  every  time  instant,  and  their  values  can  be  any¬ 
thing  within  the  range.  On  the  other  hand,  digital  signals  exist  only  at  discrete  points  of  time, 
and  they  can  take  on  only  finite  values.  A/D  conversion  can  never  be  100%  accurate.  Since, 
however,  human  perception  does  not  require  infinite  accuracy,  A/D  conversion  can  effectively 
capture  necessary  information  from  the  analog  source  for  digital  signal  transmission. 

Two  steps  take  place  in  A/D  conversion:  a  continuous  time  signal  is  first  sampled  into  a 
discrete  time  signal,  whose  continuous  amplitude  is  then  quantized  into  a  discrete  level  signal. 
1  irst,  the  Irequency  spectrum  of  a  signal  indicates  relative  magnitudes  of  various  frequency 
components.  I  he  sampling  theorem  (Chapter  6)  states  that  if  the  highest  frequency  in  the 
signal  spectrum  is  B  (in  hertz),  the  signal  can  be  reconstructed  from  its  discrete  samples, 
taken  unilormly  at  a  rate  not  less  than  2 B  samples  per  second.  This  means  that  to  preserve 
the  information  from  a  continuous-time  signal,  we  need  transmit  only  its  samples  (Fig.  1.4). 


Figure  1.4 

Anolog- 
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However,  the  sample  values  are  still  not  digital  because  they  lie  in  a  continuous  dynamic 
range.  Here,  the  second  step  of  quantization  comes  to  rescue.  Through  quantization,  each 
sample  is  approximated,  or  “rounded  off,”  to  the  nearest  quantized  level,  as  shown  in  Fig.  1 .4. 
As  human  perception  has  only  limited  accuracy,  quantization  with  sufficient  granularity  does 
not  compromise  the  signal  quality.  If  amplitudes  of  the  message  signal  m(t)  lie  in  the  range 
(—nip,  mp),  the  quantizer  partitions  the  signal  range  into  L  intervals.  Each  sample  amplitude 
is  approximated  by  the  midpoint  of  the  interval  in  which  the  sample  value  falls.  Each  sam¬ 
ple  is  now  represented  by  one  of  the  L  numbers.  The  information  is  thus  digitized.  Hence, 
after  the  two  steps  of  sampling  and  quantizing,  the  analog-to-digital  (A/D)  conversion  is 
completed. 

The  quantized  signal  is  an  approximation  of  the  original  one.  We  can  improve  the  accu¬ 
racy  of  the  quantized  signal  to  any  desired  level  by  increasing  the  number  of  levels  L. 
For  intelligibility  of  voice  signals,  for  example,  L  =  8  or  16  is  sufficient.  For  commercial 
use,  L  =  32  is  a  minimum,  and  for  telephone  communication,  L=  128  or  256  is  commonly 
used. 

Atypical  distorted  binary  signal  with  noise  acquired  over  the  channel  is  shown  in  Fig.  1 .3.  If 
A  is  sufficiently  large  in  comparison  to  typical  noise  amplitudes,  the  receiver  can  still  correctly 
distinguish  between  the  two  pulses.  The  pulse  amplitude  is  typically  5  to  10  times  the  rms  noise 
amplitude.  For  such  a  high  signal-to-noise  ratio  (SNR)  the  probability  of  error  at  the  receiver 
is  less  than  10~6;  that  is,  on  the  average,  the  receiver  will  make  fewer  than  one  error  per 
million  pulses.  The  effect  of  random  channel  noise  and  distortion  is  thus  practically  eliminated. 
Hence,  when  analog  signals  are  transmitted  by  digital  means,  some  error,  or  uncertainty,  in  the 
received  signal  can  be  caused  by  quantization,  in  addition  to  channel  noise  and  interferences. 
By  increasing  L,  we  can  reduce  to  any  desired  amount  the  uncertainty,  or  error,  caused  by 
quantization.  At  the  same  time,  because  of  the  use  of  regenerative  repeaters,  we  can  transmit 
signals  over  a  much  longer  distance  than  would  have  been  possible  for  the  analog  signal.  As 
will  be  seen  later  in  this  text,  the  price  for  all  these  benefits  of  digital  communication  is  paid 
in  terms  of  increased  processing  complexity  and  bandwidth  of  transmission. 


1.2.4  Pulse-Coded  Modulation— A  Digital  Representation 

Once  the  A/D  conversion  is  over,  the  original  analog  message  is  represented  by  a  sequence 
of  samples,  each  of  which  takes  on  one  of  the  L  preset  quantization  levels.  The  transmission 
of  this  quantized  sequence  is  the  task  of  digital  communication  systems.  For  this  reason, 
signal  waveforms  must  be  used  to  represent  the  quantized  sample  sequence  in  the  transmission 
process.  Similarly,  a  digital  storage  device  also  would  need  to  represent  the  samples  as  signal 
waveforms.  Pulse-coded  modulation  (PCM)  is  a  very  simple  and  yet  common  mechanism  for 
this  purpose. 

First,  one  information  bit  refers  to  one  binary  digit  of  1  or  0.  The  idea  of  PCM  is  to  represent 
each  quantized  sample  by  an  ordered  combination  of  two  basic  pulses:  p\  ( t )  representing  1  and 
po(t)  representing  0.  Because  each  of  the  L  possible  sample  values  can  be  written  as  a  bit  string 
of  length  log2  L,  each  sample  can  therefore  also  be  mapped  into  a  short  pulse  sequence  that 
represents  the  binary  sequence  of  bits.  For  example,  if  L  =  16,  then,  each  quantized  level  can 
be  described  uniquely  by  4  bits.  If  we  use  two  basic  pulses,  p\ (/)  =  A/2  and  po(t)  =  —A/2.  A 
sequence  of  four  such  pulses  gives  2x2x2x2  =  16  distinct  patterns,  as  shown  in  Fig.  1 .5. 
We  can  assign  one  pattern  to  each  of  the  16  quantized  values  to  be  transmitted.  Each  quantized 
sample  is  now  coded  into  a  sequence  of  four  binary  pulses.  This  is  the  principle  of  PCM 
transmission,  where  signaling  is  carried  out  by  means  of  only  two  basic  pulses  (or  symbols). 
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Figure  1.5 
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The  binary  case  is  of  great  practical  importance  because  of  its  simplicity  and  ease  of  detection. 
Much  of  today’s  digital  communication  is  binary.* 


/viinougn  i  im  was  invented  by  r.  M.  Rainey  in  1926  and  rediscovered  by  A.  H.  Reeves  in 
1 030.  it  was  not  until  the  early  1960s  that  the  Bell  System  installed  the  first  communication  link 
using  PC  M  lor  digital  voice  transmission.  The  cost  and  size  of  vacuum  tube  circuits  were  the 
chiet  impediments  to  the  use  ot  PCM  in  the  early  days  before  the  discovery  of  semiconductor 
devices.  It  was  the  transistor  that  made  PCM  practicable. 

Prom  all  these  discussions  on  PCM,  we  arrive  at  a  rather  interesting  (and  to  certain  extent 
not  obvious)  conclusion  that  every  possible  communication  can  be  carried  on  with  a  mini¬ 
mum  of  two  symbols.  Thus,  merely  by  using  a  proper  sequence  of  a  wink  of  the  eye,  one  can 
convey  any  message,  be  it  a  conversation,  a  book,  a  movie,  or  an  opera.  Every  possible  detail 
( such  as  various  shades  of  colors  of  the  objects  and  tones  of  the  voice,  etc.)  that  is  reproducible 

on  a  movie  screen  or  on  the  high-definition  color  television  can  be  conveyed  with  no  less 
accuracy,  merely  by  winks  of  an  eye.1  y 


‘  An  intermediate  case  exists  where  we  use  four  tv..;,.  .  / 

sequence  of  two  quaternary  pulses  can  form  4x4  ir,  qufternaT'  Pulses)  of  amplitudes  ±A/2  and  ±3A/2.  A 
Of  course,  to  convey  the  information  inTmov-  =  ?  *  'S  °f  values- 

at  an  inhumanly  high  speed.  Forexamnle  ih,>  um?  *C  eV. .  on  Pr°gram  in  real  time,  the  winking  would  have  to  be 
xample,  the  HDTV  stgnal  ,s  represented  by  19  million  bits  (winks)  per  second. 
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1 .3  CHANNEL  EFFECT,  SIGNAL-TO-NOISE  RATIO 
AND  CAPACITY 

In  designing  communication  systems,  it  is  important  to  understand  and  analyze  important  fac¬ 
tors  such  as  the  channel  and  signal  characteristics,  the  relative  noise  strength,  the  maximum 
number  ot  bits  that  can  be  sent  over  a  channel  per  second,  and,  ultimately,  the  signal 
quality. 


1.3.1  Signal  Bandwidth  and  Power 

In  a  given  (digital)  communication  system,  the  fundamental  parameters  and  physical  limita¬ 
tions  that  control  the  rate  and  quality  are  the  channel  bandwidth  B  and  the  signal  power  Ps. 
Their  precise  and  quantitative  relationships  will  be  discussed  in  later  chapters.  Here  we  shall 
demonstrate  these  relationships  qualitatively. 

The  bandwidth  of  a  channel  is  the  range  of  frequencies  that  it  can  transmit  with  reasonable 
fidelity.  For  example,  if  a  channel  can  transmit  with  reasonable  fidelity  a  signal  whose  frequency 
components  vary  from  0  Hz  (dc)  up  to  a  maximum  of  5000  Hz  (5  kHz),  the  channel  bandwidth 
B  is  5  kHz.  Likewise,  each  signal  also  has  a  bandwidth  that  measures  the  maximum  range  of 
its  frequency  components. 

The  faster  a  signal  changes,  the  higher  its  maximum  frequency  is,  and  the  larger  its 
bandwidth  is.  Signals  rich  in  content  that  changes  quickly  (such  as  those  for  battle  scenes  in 
a  video)  have  larger  bandwidth  than  signals  that  are  dull  and  vary  slowly  (such  as  those  for  a 
daytime  soap  opera  or  a  video  of  sleeping  animals).  A  signal  can  be  successfully  sent  over  a 
channel  if  the  channel  bandwidth  exceeds  the  signal  bandwidth. 

To  understand  the  role  of  B.  consider  the  possibility  of  increasing  the  speed  of  information 
transmission  by  compressing  the  signal  in  time.  Compressing  a  signal  in  time  by  a  factor 
of  2  allows  it  to  be  transmitted  in  half  the  time,  and  the  transmission  speed  (rate)  doubles. 
Time  compression  by  a  factor  of  2,  however,  causes  the  signal  to  “wiggle”  twice  as  fast, 
implying  that  the  frequencies  of  its  components  are  doubled.  Many  people  have  had  firsthand 
experience  of  this  effect  when  playing  a  piece  of  audiotape  twice  as  fast,  making  the  voices  of 
normal  people  sound  like  the  high-pitched  speech  of  cartoon  characters.  Now,  to  transmit  this 
compressed  signal  without  distortion,  the  channel  bandwidth  must  also  be  doubled.  Thus,  the 
rate  of  information  transmission  that  a  channel  can  successfully  carry  is  directly  proportional 
to  B.  More  generally,  if  a  channel  of  bandwidth  B  can  transmit  N  pulses  per  second,  then 
to  transmit  KN  pulses  per  second  by  means  of  the  same  technology,  we  need  a  channel  of 
bandwidth  KB.  To  reiterate,  the  number  of  pulses  per  second  that  can  be  transmitted  over  a 
channel  is  directly  proportional  to  its  bandwidth  B. 

The  signal  power  Ps  plays  a  dual  role  in  information  transmission.  First,  Ps  is  related  to 
the  quality  of  transmission.  Increasing  Ps  strengthens  the  signal  pulse  and  diminishes  the  effect 
of  channel  noise  and  interference.  In  fact,  the  quality  of  either  analog  or  digital  communication 
systems  varies  with  the  signal-to-noise  ratio  (SNR).  In  any  event,  a  certain  minimum  SNR  at 
the  receiver  is  necessary  for  successful  communication.  Thus,  a  larger  signal  power  Ps  allows 
the  system  to  maintain  a  minimum  SNR  over  a  longer  distance,  thereby  enabling  successful 
communication  over  a  longer  span. 

The  second  role  of  the  signal  power  is  less  obvious,  although  equally  important.  From 
the  information  theory  point  of  view,  the  channel  bandwidth  B  and  the  signal  power  Ps  are, 
to  some  extent,  exchangeable;  that  is,  to  maintain  a  given  rate  and  accuracy  of  information 
transmission,  we  can  trade  Ps  for  B,  and  vice  versa.  Thus,  one  may  use  less  B  if  one  is  willing 
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increase  P„  or  one  may  redu«  P,  if  one  is  given  bigger  «.  The  ngorons  prof  of  this  wii, 

be  provided  in  Chapler  14.  ,he  bandwidth  and  ihe  transmitted 

In  short,  the  two  penary  contntunteahon  re»met « «  ^  tha„  ,he  olher. 

hand,  in  space  vehicles,  huge  bandwidth  is  available  but  the  power  is  severely  limited.  Hence, 
^mm.miratinn  solutions  in  the  two  cases  are  radically  difterent. 


1 .3.2  Channel  Capacity  and  Data  Rate 

Channel  bandwidth  limits  the  bandwidth  of  signals  that  can  successfully  pass  through,  whereas 
signal  SNR  at  the  receiver  determines  the  recoverability  of  the  transmitted  signals.  H  ig  ki 
means  that  the  transmitted  signal  pulse  can  use  more  signal  levels,  thereby  carrying  more  bits 
with  each  pulse  transmission.  Higher  bandwidth  B  also  means  that  one  can  transmit  more 
pulses  (faster  variation)  over  the  channel.  Hence,  SNR  and  bandwidth  B  can  both  altect  the 
underlying  channel  "throughput.”  The  peak  throughput  that  can  be  reliably  carried  by  a  channel 
is  defined  as  the  channel  capacity. 

One  of  the  most  commonly  encountered  channels  is  known  as  the  additive  white  Gaussian 
noise  (AWGN)  channel.  The  AWGN  channel  model  assumes  no  channel  distortions  except 
for  the  additive  white  Gaussian  noise  and  its  finite  bandwidth  B.  This  ideal  model  captures 
application  cases  with  distortionless  channels  and  provides  a  performance  upper  bound  tor 
more  general  distortive  channels.  The  band-limited  AWGN  channel  capacity  was  dramatically 
highlighted  by  Shannon’s  equation, 

C  =  Slog2(l  +  SNR)  bit/s  0D 


Here  the  channel  capacity  C  is  the  upper  bound  on  the  rate  of  information  transmission  per 
second.  In  other  words,  C  is  the  maximum  number  of  bits  that  can  be  transmitted  per  second 
with  a  probability  of  error  arbitrarily  close  to  zero;  that  is,  the  transmission  is  as  accurate  as  one 
desires.  The  capacity  only  points  out  this  possibility,  however;  it  does  not  specify  how  it  is  to  be 
realized.  Moreover,  it  is  impossible  to  transmit  at  a  rate  higher  than  this  without  incurring  errors. 
Shannon  s  equation  clearly  brings  out  the  limitation  on  the  rate  of  communication  imposed  by  B 
and  SNR.  It  there  is  no  noise  on  the  channel  (assuming  SNR  =  oo),  then  the  capacity  C  would 
be  "c,  and  communication  rate  could  be  arbitrarily  high.  We  could  then  transmit  any  amount  of 
information  in  the  world  over  one  noiseless  channel.  This  can  be  readily  verified.  If  noise  were 
/ero.  there  would  be  no  uncertainty  in  the  received  pulse  amplitude,  and  the  receiver  would 
be  able  to  detect  any  pulse  amplitude  without  error.  The  minimum  pulse  amplitude  separation 
can  be  arbitrarily  small,  and  for  any  given  pulse,  we  have  an  infinite  number  of  fine  levels 
ava.  able.  We  can  assign  one  level  to  every  possible  message.  Because  an  infinite  number  of 
evels  are  available,  it  is  possible  to  assign  one  level  to  any  conceivable  message.  Cataloging 
such  a  code  may  not  be  practical,  but  that  is  beside  the  point.  Rather,  the  point  is  that  if  the 
i  C  i  Ll>miminuu,lon  ceases  to  be  a  problem,  at  least  theoretically.  Implementation 
would  be  difficult  because  of  the  requirement  of  generation  and  detection 
clmmunicLmn  l,  praCtical  difficulties  would  then  set  a  limit  on  the  rate  of 

limit  on  the  rate  of  comir  remembered  that  Shannon’s  result,  which  represents  the  upper 
monstrous  and  3  ctlanne1,  would  be  achievable  only  with  a  system  of 

Practical  „*m,  opt*  aZSwtltaoT  rc“P'i0n  appr°aChin8 
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In  conclusion.  Shannon  s  capacity  equation  demonstrates  qualitatively  the  basic  role 
played  by  B  and  SNR  in  limiting  the  performance  of  a  communication  system.  These  two 
parameters  then  represent  the  ultimate  limitation  on  the  rate  of  communication.  The  possi¬ 
bility  of  resource  exchange  between  these  two  basic  parameters  is  also  demonstrated  by  the 
Shannon  equation. 

As  a  practical  example  ot  trading  SNR  for  bandwidth  B ,  consider  the  scenario  in  which  we 
meet  a  soft-spoken  man  who  speaks  a  little  bit  too  fast  for  us  to  fully  understand.  This  means 
that  as  listeners,  our  bandwidth  B  is  too  low  and  therefore,  the  capacity  C  is  not  high  enough  to 
accommodate  the  rapidly  spoken  sentences.  However,  if  the  man  can  speak  louder  (increasing 
power  and  hence  the  SNR),  we  are  likely  to  understand  him  much  better  without  changing 
anything  else.  This  example  illustrates  the  concept  of  resource  exchange  between  SNR  and 
B.  Note,  however,  that  this  is  not  a  one-to-one  trade.  Doubling  the  speaker  volume  allows  the 
speaker  to  talk  a  little  faster,  but  not  twice  as  fast.  This  unequal  trade  effect  is  fully  captured  by 
Shannon  s  equation  [Eq.  (1.1)],  where  doubling  the  SNR  cannot  always  compensate  the  loss 
of  B  by  50%. 


1 .4  MODULATION  AND  DETECTION 

Analog  signals  generated  by  the  message  sources  or  digital  signals  generated  through  A/D 
conversion  of  analog  signals  are  often  referred  to  as  baseband  signals  because  they  typically 
are  low  pass  in  nature.  Baseband  signals  may  be  directly  transmitted  over  a  suitable  channel 
(e.g.,  telephone,  fax).  However,  depending  on  the  channel  and  signal  frequency  domain  char¬ 
acteristics,  baseband  signals  produced  by  various  information  sources  are  not  always  suitable 
for  direct  transmission  over  a  given  channel.  When  signal  and  channel  frequency  bands  do 
not  match  exactly,  channels  cannot  be  moved.  Hence,  messages  must  be  moved  to  the  right 
channel  frequency  bandwidth.  Message  signals  must  therefore  be  further  modified  to  facilitate 
transmission.  In  this  conversion  process,  known  as  modulation,  the  baseband  signal  is  used 
to  modify  (i.e.,  modulate),  some  parameter  of  a  radio-frequency  (RF)  carrier  signal. 

A  carrier  is  a  sinusoid  of  high  frequency.  Through  modulation,  one  of  the  carrier  sinusoidal 
parameters — such  as  amplitude,  frequency,  or  phase — is  varied  in  proportion  to  the  baseband 
signal  m(r).  Accordingly,  we  have  amplitude  modulation  (AM),  frequency  modulation  (FM), 
or  phase  modulation  (PM).  Figure  1.6  shows  a  baseband  signal  m(f)  and  the  corresponding 
AM  and  FM  waveforms.  In  AM,  the  carrier  amplitude  varies  in  proportion  to  m(t),  and  in 
FM,  the  carrier  frequency  varies  in  proportion  m(t).  To  reconstruct  the  baseband  signal  at  the 
receiver,  the  modulated  signal  must  pass  through  a  reversal  process  called  demodulation. 

As  mentioned  earlier,  modulation  is  used  to  facilitate  transmission.  Some  of  the  important 
reasons  for  modulation  are  given  next. 

1 .4. 1  Ease  of  Radiation/Transmission 

For  efficient  radiation  of  electromagnetic  energy,  the  radiating  antenna  should  be  on  the  order 
of  a  fraction  or  more  of  the  wavelength  of  the  driving  signal.  For  many  baseband  signals,  the 
wavelengths  are  too  large  for  reasonable  antenna  dimensions.  For  example,  the  power  in  a 
speech  signal  is  concentrated  at  frequencies  in  the  range  of  100  to  3000  Hz.  The  corresponding 
wavelength  is  100  to  3000  km.  This  long  wavelength  would  necessitate  an  impractically  large 
antenna.  Instead,  by  modulating  a  high-frequency  carrier,  we  effectively  translate  the  signal 
spectrum  to  the  neighborhood  of  the  carrier  frequency  that  corresponds  to  a  much  smaller 
wavelength.  For  example,  a  1 0  MHz  carrier  has  a  wavelength  of  only  30  m,  and  its  transmission 
can  be  achieved  with  an  antenna  size  on  the  order  of  3  m.  In  this  respect,  modulation  is  like 
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Figure  1.6 

Modulation: 

(a)  carrier; 

(b)  modulating 
(baseband) 
signal; 

(c)  amplitude- 
modulated  wave; 

(d)  frequency- 
modulated 
wave. 
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letting  the  baseband  signal  hitch  a  ride  on  a  high-frequency  sinusoid  (carrier).  The  carrier  and 
the  baseband  signal  may  also  be  compared  to  a  stone  and  a  piece  of  paper  If  we  wish  to  throw 

a  piece  of  paper,  it  cannot  go  too  far  by  itself.  But  if  it  is  wrapped  around  a  stone  (a  carrier),  it 
can  be  thrown  over  a  longer  distance. 


1 .4.2  Simultaneous  Transmission  of  Multiple 
Signals— Multiplexing 


"raphical  area  without'  I  "P  Mgna  s  10  be  transmitted  at  the  same  time  in  the  same  gc 
grupnical  area  without  direct  mutual  interferenrp  tw  •  .  .  .  .  ,  tr,lt 

by  considering  the  output  of  multiple  televkion  J  P°‘nt  ‘S  S‘mply  dem 

the  air)  to  people's  television  receivers  With  V  77  Camed  by  the  SamC  CablC  ^  u 

be  interfering  with  one  another  because  ill  hi  m°d“1.at,on’  mult,P|e  video  signals  will 

bandwidth.  Thus,  cable  TV  or  broadcast  TV  w  ^  V'df°  Slgnals  effective'y  have  the  sa' 
tion  at  a  time  in  a  given  location— .  h  hi  Uh°.Ut  modulatlon  w°uld  be  limited  to  one  s 
is  many  times  larger  than  that  of  the  signal  WUStdul  Protoco1  because  the  channel  bandwn 

modulate  differem  eerier  treque^ie  m°du,ation-  We  can  use  various  TV  stations 
reqUenC,es- thus  translating  each  signal  to  a  different  freque. 
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range.  If  the  various  carriers  are  chosen  sufficiently  far  apart  in  frequency,  the  spectra  of  the 
modulated  signals  (known  as  TV  channels)  will  not  overlap  and  thus  will  not  interfere  with 
each  other.  At  the  receiver  (TV  set),  a  tunable  bandpass  filter  can  select  the  desired  station 
or  TV  channel  for  viewing.  This  method  of  transmitting  several  signals  simultaneously,  over 
nonoverlapping  frequency  bands,  is  known  as  frequency  division  multiplexing  (FDM).  A 
similar  approach  is  also  used  in  AM  and  FM  radio  broadcasting.  Here  the  bandwidth  of  the 
channel  is  shared  by  various  signals  without  any  overlapping. 

Another  method  of  multiplexing  several  signals  is  known  as  time  division  multiplexing 
(TDM).  This  method  is  suitable  when  a  signal  is  in  the  form  of  a  pulse  train  (as  in  PCM). 
When  the  pulses  are  made  narrower,  the  spaces  left  between  pulses  of  one  user  signal  are  used 
for  pulses  from  other  signals.  Thus,  in  effect,  the  transmission  time  is  shared  by  a  number  of 
signals  by  interleaving  the  pulse  trains  of  various  signals  in  a  specified  order.  At  the  receiver, 
the  pulse  trains  corresponding  to  various  signals  are  separated. 

1 .4.3  Demodulation 

Once  multiple  modulated  signals  have  arrived  at  the  receiver,  the  desired  signal  must  be 
detected  and  recovered  into  its  original  baseband  form.  Note  that  because  of  FDM,  the  first 
stage  of  a  demodulator  typically  requires  a  tunable  bandpass  filter  so  that  the  receiver  can  select 
the  modulated  signal  at  a  predetermined  frequency  band  specified  by  the  transmission  station 
or  channel.  Once  a  particular  modulated  signal  has  been  isolated,  the  demodulator  will  then 
need  to  convert  the  carrier  variation  of  amplitude,  frequency,  or  phase,  back  into  the  baseband 
signal  voltage. 

For  the  three  basic  modulation  schemes  of  AM,  FM,  and  PM,  the  corresponding  demod¬ 
ulators  must  be  designed  such  that  the  detector  output  voltage  varies  in  proportion  to  the  input 
modulated  signal’s  amplitude,  frequency,  and  phase,  respectively.  Once  circuits  with  such 
response  characteristics  have  been  implemented,  the  demodulators  can  downconvert  the  mod¬ 
ulated  (RF)  signals  back  into  the  baseband  signals  that  represent  the  original  source  message, 
be  it  audio,  video,  or  data. 


1 .5  DIGITAL  SOURCE  CODING  AND  ERROR 
CORRECTION  CODING 

As  stated  earlier,  SNR  and  bandwidth  are  two  factors  that  determine  the  performance  of  a  given 
communication.  Unlike  analog  communication  systems,  digital  systems  often  adopt  aggressive 
measures  to  lower  the  source  data  rate  and  to  fight  against  channel  noise.  In  particular,  source 
coding  is  applied  to  generate  the  fewest  bits  possible  for  a  given  message  without  sacrificing  its 
detection  accuracy.  On  the  other  hand,  to  combat  errors  that  arise  from  noise  and  interferences, 
redundancy  needs  to  be  introduced  systematically  at  the  transmitter,  such  that  the  receivers  can 
rely  on  the  redundancy  to  correct  errors  caused  by  channel  distortion  and  noise.  This  process 
is  known  as  error  correction  coding  by  the  transmitter  and  decoding  by  the  receiver. 

Source  coding  and  error  correction  coding  are  two  successive  stages  in  a  digital  com¬ 
munication  system  that  work  in  a  see-saw  battle.  On  one  hand,  the  job  of  source  coding  is 
to  remove  as  much  redundancy  from  the  message  as  possible  to  shorten  the  digital  message 
sequence  that  requires  transmission.  Source  coding  aims  to  use  as  little  bandwidth  as  possible 
without  considering  channel  noise  and  interference.  On  the  other  hand,  error  correction  coding 
intentionally  introduces  redundancy  intelligently,  such  that  if  errors  occur  upon  detection,  the 
redundancy  can  help  correct  the  most  likely  errors. 
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Randomness,  Redundancy,  and  Source  Coding 

To  understand  source  coding,  it  is  important  to  first  discuss  the  role  of  randomness  in  communi- 
cations.  As  noted  earlier,  channel  noise  is  a  major  factor  limiting  communication  performance 
because  it  is  random  and  cannot  be  removed  by  prediction.  On  other  other  hand,  randomness  is 
also  closely  associated  with  the  desired  signals  in  communications.  Indeed,  randomness  is  the 
essence  of  communication.  Randomness  means  unpredictability,  or  uncertainty,  of  a  source 
message.  If  a  source  had  no  unpredictability,  like  a  friend  who  always  wants  to  repeat  the  same 
story  on  “how  I  was  abducted  by  an  alien,”  then  the  information  would  be  known  beforehand 
and  would  contain  no  information.  Similarly,  if  a  person  winks,  it  conveys  some  information 
in  a  given  context.  But  if  a  person  winks  continuously  with  the  regularity  of  a  clock,  the  winks 
convey  no  information.  In  short,  a  predictable  signal  is  not  random  and  is  fully  redundant. 
Thus,  a  message  contains  information  only  if  it  is  unpredictable.  Higher  predictability  means 
higher  redundancy  and,  consequently,  less  information.  Conversely,  more  unpredictable  or  less 
likely  random  signals  contain  more  information. 


Source  coding  reduces  redundancy  based  on  the  predictability  of  the  message  source.  The 
objective  of  source  coding  is  to  use  codes  that  are  as  short  as  possible  to  represent  the  source 
signal.  Shorter  codes  are  more  efficient  because  they  require  less  time  to  transmit  at  a  given 
data  rate.  Hence,  source  coding  should  remove  signal  redundancy  while  encoding  and  trans- 
mmmg  the  unpred.ctable,  random  part  of  the  signal.  The  more  predictable  messages  contain 
more  redundancy  and  require  shorter  codes,  while  messages  that  are  less  likely  contain  more 
information  and  should  be  encoded  with  longer  codes.  By  assigning  more  likely  messages  with 

H  r  T  !'kely  messages  With  longer  *>“"*  codes,  one  obtains  more  effi- 
or dLheJ T 1 *  6  COde’  f°r  examP'e- In  this  code’  various  combinations 

shorter  c  >de  words  a  1  aSSigned  t0  each  letter'  To  mi"imize  transmission  time, 

as  e  r.  a^  Ja^d  lo^eT'T  *  7*  ***«*>  probable)  letters  (such 

(such  as  x.  a.  and  z)  Thus  °  6  S  ^  assigned  t0  rare*y  occurring  (less  probable)  letters 

letter  distribution,  Thereby  leadingTo^horteTco?5  W°Ul<1  ^  l°  ft>ll°W  3  kn°Wn 

explains  why  *«  -  *  ***»  — 

minimized  if  a  message  (ortvmMiof'  sl8nals- the  overall  transmission  time  is 

proportional  to  log  ( \/P)  Hence  from  PF°  3  ^  's  signed  a  code  word  with  a  length 

message  with  pmLLy  P  **  * 

coding.  Tms  ,s  known  as  entropy  (source) 


- v,v..vv«uii  aiming 

Error  correction  coding  also  nlavs  an  imnon  *  i  . 

removes  redundancy,  error  correction  codes  add™  H  m  ^ommunicati°n.  While  source  co 
redundancy  supports  reliable  communication  4  redundancy-  The  systematic  introductic 
error  due  to  noise  or  interference,  °f  ^Unda"<*  if  certain  bits  a 

decode  a  message  accurately  despite  errors  in  thP  \may  helP  lhem  recover,  allowing  i 

or  t.  xample,  English  is  about  50%  redundant-  thT^'^  S'^na*'  languages  are  redurn 
the  letters  or  words  without  losing  the  meaning  °n  the  3Verage’  we  may  throw  out 
any  -.nghsh  message,  the  speaker  or  the  write^has  f  message-  This  also  means  th 
on  the  average.  The  remaining  half  is  determined  h  ^  ^  °Ver  half  the  letters  or  w< 

II  a  I  the  redundancy  of  English  were  removed  it  J  u  StrUcture  of  the  langl 

a  c  legram  or  telephone  conversation  If  an  err  “  d  l3ke  about  half  the  time  to  tran 
he  mlher  dillicull  „  meke  „„  «  **  reiver,  however,  I,  w 

‘  p  >s  a  useful  ro" in — *—  SZ  2E2? ■ — 
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It  may  appear  paradoxical  that  in  source  coding  we  would  remove  redundancy,  only  to  add 
more  redundancy  at  the  subsequent  error  correction  coding.  To  explain  why  this  is  sensible, 
consider  the  removal  of  all  redundancy  in  English  through  source  coding.  This  would  shorten 
the  message  by  50%  (for  bandwidth  saving).  However,  for  error  correction,  we  may  restore 
some  systematic  redundancy,  except  that  this  well-designed  redundancy  is  only  half  as  long  as 
what  was  removed  by  source  coding  while  still  providing  the  same  amount  of  error  protection. 
It  is  therefore  clear  that  a  good  combination  of  source  coding  and  error  correction  coding 
can  remove  inefficient  redundancy  without  sacrificing  error  correction.  In  fact,  a  very  popular 
problem  in  this  field  is  the  persistent  pursuit  of  joint  source-channel  coding  that  can  maximally 
remove  signal  redundancy  without  losing  error  correction. 

How  redundancy  can  enable  error  correction  can  be  seen  with  an  example:  to  transmit 
samples  with  L  =  16  quantizing  levels,  we  may  use  a  group  of  four  binary  pulses,  as  shown 
in  Fig.  1.5.  In  this  coding  scheme,  no  redundancy  exists.  If  an  error  occurs  in  the  reception  of 
even  one  of  the  pulses,  the  receiver  will  produce  a  wrong  value.  Here  we  may  use  redundancy 
to  eliminate  the  effect  of  possible  errors  caused  by  channel  noise  or  imperfections.  Thus,  if  we 
add  to  each  code  word  one  more  pulse  of  such  polarity  as  to  make  the  number  of  positive  pulses 
even,  we  have  a  code  that  can  detect  a  single  error  in  any  place.  Thus,  to  the  code  word  0001 
we  add  a  fifth  pulse,  of  positive  polarity,  to  make  a  new  code  word.  00011.  Now  the  number  of 
positive  pulses  is  2  (even).  If  a  single  error  occurs  in  any  position,  this  parity  will  be  violated. 
The  receiver  knows  that  an  error  has  been  made  and  can  request  retransmission  of  the  message. 
This  is  a  very  simple  coding  scheme.  It  can  only  detect  an  error;  it  cannot  locate  or  correct 
it.  Moreover,  it  cannot  detect  an  even  number  of  errors.  By  introducing  more  redundancy,  it 
is  possible  not  only  to  detect  but  also  to  correct  errors.  For  example,  for  L  =  16,  it  can  be 
shown  that  properly  adding  three  pulses  will  not  only  detect  but  also  correct  a  single  error 
occurring  at  any  location.  Details  on  the  subject  of  error  correcting  codes  will  be  discussed  in 
Chapter  15. 
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Telecommunications  (literally:  communications  at  a  distance)  are  always  critical  to  human 
society.  Even  in  ancient  times,  governments  and  military  units  relied  heavily  on  telecommu¬ 
nications  to  gather  information  and  to  issue  orders.  The  first  type  was  with  messengers  on  foot 
or  on  horseback;  but  the  need  to  convey  a  short  message  over  a  large  distance  (such  as  one 
warning  a  city  of  approaching  raiders)  led  to  the  use  of  fire  and  smoke  signals.  Using  signal 
mirrors  to  reflect  sunlight  (heliography),  was  another  effective  way  of  telecommunication.  Its 
first  recorded  use  was  in  ancient  Greece.  Signal  mirrors  were  also  mentioned  in  Marco  Polo’s 
account  of  his  trip  to  the  Far  East.1  These  ancient  visual  communication  technologies  are, 
amazingly  enough,  digital.  Fires  and  smoke  in  different  configurations  would  form  different 
codewords.  On  hills  or  mountains  near  Greek  cities  there  were  also  special  personnel  for  such 
communications,  forming  a  chain  of  regenerative  repeaters.  In  fact,  fire  and  smoke  signal 
platforms  still  dot  the  Great  Wall  of  China.  More  interestingly,  reflectors  or  lenses,  equivalent 
to  the  amplifiers  and  antennas  we  use  today,  were  used  to  directionally  guide  the  light  farther. 

Naturally,  these  early  visual  communication  systems  were  very  tedious  to  set  up  and  could 
transmit  only  several  bits  of  information  per  hour.  A  much  faster  visual  communication  system 
was  developed  just  over  two  centuries  ago.  In  1793  Claude  Chappe  of  France  invented  and 
performed  a  series  of  experiments  on  the  concept  of  “semaphore  telegraph.”  His  system  was  a 
series  of  signaling  devices  called  semaphores,  which  were  mounted  on  towers,  typically  spaced 
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,0  km  apan.  (A  .emaph^  Xx^ed  ji.e  a 

receiving  semaphore  operator  wou  tans  telegraph  became  the 

then  relay  the  message  from  hts  »  Mre  nexh  and  so  »  ^  ^  ^  inc,ud|ng  ,he 

government  telecommunication  sys  ..  eclipsed  by  electric  telegraphy.  Today, 

United  States.  The  semaphore  telegrap  was  "Telegraph  Hill"  remind  us  of  the 

„„,y  a  few  remaining lamps,  shtp  tings,  and 

hldh^raphsl  remahned  anSimponant  P^t1  of  maritime  communications  well  into  the  twentieth 

“"'mse  early  telecommunication  systems  are  optical  systems  based  on  visual  receivers. 
Thus  thev  can  cover  only  line-of-sight  distance,  and  human  operators  are  required  to  deco 
the  signals  An  important  event  that  changed  the  history  of  telecommunication  occurred  in 
1 820.  when  Hans  Christian  Oersted  of  Denmark  discovered  the  interaction  "<-'J 

and  magnetism.2  Michael  Faraday  made  the  next  crucial  discovery,  which  changed  the 
history  of  both  electricity  and  telecommunications,  when  he  found  that  electric  curren 
can  be  induced  on  a  conductor  by  a  changing  magnetic  field.  Thus,  electricity  generation 
became  possible  by  magnetic  field  motion.  Moreover,  the  transmission  of  electric  signa  s 
became  possible  by  varying  an  electromagnetic  field  to  induce  current  change  in  a  distant 
circuit.  The  amazing  aspect  of  Faraday’s  discovery  on  current  induction  is  that  it  provides 
the  foundation  for  wireless  telecommunication  over  distances  without  line-ot-sight.  and  more 
importantly,  it  shows  how  to  generate  electricity  as  an  energy  source  to  power  such  systems. 
The  invention  of  the  electric  telegraph  soon  followed,  and  the  world  entered  the  modern  electric 
telecommunication  era. 

Modem  communication  systems  have  come  a  long  way  from  their  infancy.  Since  it 
would  be  difficult  to  detail  all  the  historical  events  that  mark  the  recent  development  ot 
telecommunication,  we  shall  instead  use  Table  1 . 1  to  chronicle  some  of  the  most  notable 
events  in  the  development  of  modem  communication  systems.  Since  our  focus  is  on  electrical 
telecommunication,  we  shall  refrain  from  reviewing  the  equally  long  history  of  optical  (fiber) 
communications. 

It  is  remarkable  that  all  the  early  telecommunication  systems  are  symbol -based  digital 
systems.  It  was  not  until  Alexander  Graham  Bell’s  invention  of  the  telephone  system  that 
analog  live  signals  were  transmitted.  Live  signals  can  be  instantly  heard  or  seen  by  the  receiving 
users.  The  Bell  invention  that  marks  the  beginning  of  a  new  (analog  communication)  era  is 
therefore  a  major  milestone  in  the  history  of  telecommunications.  Figure  1 .7  shows  a  copy  of 
an  illustration  from  Bell  s  groundbreaking  1876  telephone  patent.  Scientific  historians  often 
hail  this  invention  as  the  most  valuable  patent  ever  issued  in  history. 

I  lie  invention  ot  telephone  systems  also  marks  the  beginning  of  the  analog  com¬ 
munication  era  and  live  signal  transmission.  On  an  exciting  but  separate  path,  wireless 
communication  began  in  1887,  when  Heinrich  Hertz  first  demonstrated  a  way  to  detect 
the  presence  ot  electromagnetic  waves.  French  scientist  Edouard  Branly,  English  physi¬ 
cist  Oliver  Lodge,  and  Russian  inventor  Alexander  Popov  all  made  important  contributions 
to  the  development  of  radio  receivers.  Another  important  contributor  to  this  area  was 
t  ic  ro.it un  orn  genius  Nikola  Tesla.  Building  upon  earlier  experiments  and  inventions, 

LtTn  IT  a"w  ITT  Guglielm°  Marc0ni  devel°Ped  “  wireless  telegraphy  sys- 
\  .  '!r  *  l'\’  he  shared  the  Nobel  Prize  in  Physics  in  1909  Marconi’s  wireless 

naueTf  the'mvemt  ^Tr ,7^  °f  C°mmercial  tireless  communications.  Soon,  the  mar- 

h fnU  ,  !  ,  and  MarC0ni  allowed  a^log  audio  sionals  to  go  wireless, 

thanks  to  amplitude  modulation  (AM)  technology.  Quality  m§usic  l  FM  radio 
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TABLE  1.1 

Important  Events  of  the  Past  Two  Centuries  of  Telecommunications 

Year  Major  Events 

1 820  First  experiment  of  electric  current  causing  magnetism  (by  Hans  C.  Oersted) 

1831  Discovery  of  induced  current  from  electromagnetic  radiation  (by  Michael  Faraday) 

1 830-32  Birth  of  telegraph  (credited  to  Joseph  Henry  and  Pavel  Schilling) 

1837  Invention  of  Morse  code  by  Samuel  F.  B.  Morse 

1864  Theory  of  electromagnetic  waves  developed  by  James  C.  Maxwell 

1 866  First  transatlantic  telegraph  cable  in  operation 

1 876  Invention  of  telephone  by  Alexander  G.  Bell 

1878  First  telephone  exchange  in  New  Haven,  Connecticut 

1 887  Detection  of  electromagnetic  waves  by  Heinrich  Hertz 

1896  Wireless  telegraphy  (radio  telegraphy)  patented  by  Guglielmo  Marconi 

1901  First  transatlantic  radio  telegraph  transmission  by  Marconi 

1906  First  amplitude  modulation  radio  broadcasting  (by  Reginald  A.  Fessenden) 

1 907  Regular  transatlantic  radio  telegraph  service 

1915  First  transcontinental  telephone  service 

1920  First  commercial  AM  radio  stations 

1 92 1  Mobile  radio  adopted  by  Detroit  Police  Department 

1925  First  television  system  demonstration  (by  Charles  F.  Jenkins) 

1 928  First  television  station  W3XK  in  the  United  States 

1935  First  FM  radio  demonstration  (by  Edwin  H.  Armstrong) 

1941  NTSC  black  and  white  television  standard 
First  commercial  FM  radio  service 

1947  Cellular  concept  first  proposed  at  Bell  Labs 

1948  First  major  information  theory  paper  published  by  Claude  E.  Shannon 
Invention  of  transistor  by  William  Shockley,  Walter  Brattain,  and  John  Bardeen 

1949  The  construction  of  Golay  code  for  3  (or  fewer)  bit  error  correction 

1950  Hamming  codes  constructed  for  simple  error  corrections 

1 953  NTSC  color  television  standard 

1958  Integrated  circuit  proposed  by  Jack  Kilby  (Texas  Instruments) 

1 960  Construction  of  the  pow  erful  Reed-Solomon  error  correcting  codes 

1962  First  computer  telephone  modem  developed:  Bell  Dataphone  103 A  (300  bit/s) 

1962  Low-density  parity  check  error  correcting  codes  proposed  by  Robert  G  Gallager 
1968-9  First  error  correction  encoders  on  board  NASA  space  missions  (Pioneer  IX  and  Mariner  VI) 

197 1  First  wireless  computer  network:  AlohaNet 

1973  First  portable  cellular  telephone  demonstration  to  the  U.S.  Federal  Communications 
Commission,  by  Motorola 
1978  First  mobile  cellular  trial  by  AT&T 

1984  First  handheld  (analog)  AMPS  cellular  phone  service  by  Motorola 

1 989  Development  of  DSL  modems  for  high-speed  computer  connections 

1991  First  (digital)  GSM  cellular  service  launched  (Finland) 

First  wireless  local  area  network  (LAN)  developed  (AT&T-NCR) 

1 993  Digital  ATSC  standard  established 

1993  Turbo  codes  proposed  by  Berrou,  Glavieux,  and  Thitimajshima 

1996  First  commercial  CDMA  (IS-95)  cellular  service  launched 
First  HDTV  broadcasting 

1 997  IEEE  802. 1 1  (b)  wireless  LAN  standard 

1 998  Large-scope  commercial  ADSL  deployment 

1 999  IEEE  802. 1 1  a  wireless  LAN  standard 

2000  First  3G  cellular  service  launched 
2003  IEEE  802. 1 1  g  wireless  LAN  standard 
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Figure  1.7 

Illustration  from 
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broadcast  was  hrst  demonstrated  by  American  inventor  Major  Edwin  H.  Armstrong.  Arm- 

strongs  FM  demonstration  in  1935  took  place  at  an  IEEE  meeting  in  New  York’s  Empire 
State  Building. 

A  historic  year  for  both  communications  and  electronics  was  1948.  the  year  that  wit¬ 
nessed  the  rebirth  of  digital  communications  and  the  invention  of  semiconductor  transistors. 
The  rebirth  of  digital  communications  is  owing  to  the  originality  and  brilliance  of  Claude 
..  annon,  width  known  as  the  father  of  modem  digital  communication  and  information 
.heory,  In  two  semmal  articles  published  in  1948.  he  firs,  esiablished  ,he  fundamental  concept 
of  channel  capacity  and  its  relation  ,o  informalion  transmission  rate.  Derivtng  the  channel 
capacity  of  several  tntportan,  models.  Shannon’  proved  tha,  a,  long  as  the  information  is 

mak^mbahih  I  .  r  y  Shan"0”  sh‘>wed  lhc  «*«*  *  good  codes  that  can 

ra“"',“,one™'art’i«»i1>™al|.  This  noisy  channel  coding  theorem 
L  nrntsil^L*;^^*^  Co^cidentally.  the  inventton  of  .he 

the  way  to  the  design  and  implementation  of  nm  Brat,a"'’  and  ,ohn  Bardeen)  Paved 
circuits  to  put  Shannon’s  theorems  into  practical  u^ThT'  7", powerlul’ and  less  nolsy 
in  March  of  1 97 1  was  the  first  NASA  mission  ,.fs  I,™'  aUnCh  °*  Manner  lx  Mars  orblter 
which  reliably  transmitted  photos  taken  from  Mam  *  e,l"PPed  W"h  t0rr“‘mg  codeS’ 

spread  applications  of  corp^Lr'nei!!o!k!„Ta|l7d|'|a  com'"u"ica"0,>s.  marked  by  the  wide- 
for  home  computer  connection  to  a  tminf  ^  C£  Uar  P^ones-  The  first  telephone  modem 
uses  an  acoustic  coupler  to  interface  with^6  was  developed  by  AT&T  Bell  Labs  in  1962.  It 

convms  the  local  computer  data  into  audible  tonellid^^T  hand^  The  acoustic  c0UP,er 

uses  the  regular  telephone  microphone 
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to  transmit  the  tones  over  telephone  lines.  The  coupler  receives  the  mainframe  computer  data 
via  the  telephone  headphone  and  converts  them  into  bits  for  the  local  computer  terminal, 
typically  at  rates  below  300  bit/s.  Rapid  advances  in  integrated  circuits  (first  credited  to  Jack 
Kilby  in  1958)  and  digital  communication  technology  dramatically  increased  the  link  rate  to 
56  kbit/s  by  the  1990s.  By  2000,  wireless  local  area  network  (WLAN)  modems  were  developed 
to  connect  computers  at  speed  up  to  11  Mbit/s.  These  commercial  WLAN  modems,  the  size 
of  a  credit  card,  were  first  standardized  as  IEEE  802.  lib. 

Technological  advances  also  dramatically  reshaped  the  cellular  systems.  While  the  cellular 
concept  was  developed  in  1 947  at  Bell  Labs,  commercial  systems  were  not  available  until  1 983. 
The  “mobile”  phones  of  the  1980s  were  bulky  and  expensive,  mainly  used  for  business.  The 
world’s  first  cellular  phone,  developed  by  Motorola  in  1983  and  known  as  DynaTAC  8000X, 
weighed  28  ounces,  earning  the  nickname  of  “brick”  and  costing  $3995.  These  analog  phones 
are  basically  two-way  FM  radios  for  voice  only.  Today,  a  cellphone  is  truly  a  multimedia, 
multifunctional  device  that  is  useful  not  only  for  voice  communication  but  also  can  send 
and  receive  e-mail,  access  websites,  and  display  videos.  Cellular  devices  are  now  very  small, 
weighing  no  more  than  a  few  ounces.  Unlike  in  the  past,  cellular  phones  are  now  for  the  masses. 
In  fact,  Europe  now  has  more  cellphones  than  people.  In  Africa,  13%  of  the  adult  population 
now  owns  a  cellular  phone. 

Throughout  history,  the  progress  of  human  civilization  has  been  inseparable  from  tech¬ 
nological  advances  in  telecommunications.  Telecommunications  played  a  key  role  in  almost 
every  major  historical  event.  It  is  not  an  exaggeration  to  state  that  telecommunications  helped 
shape  the  very  world  we  live  in  today  and  will  continue  to  define  our  future.  It  is  therefore 
the  authors’  hope  that  this  text  can  help  stimulate  the  interest  of  many  students  in  telecommu¬ 
nication  technologies.  By  providing  the  fundamental  principles  of  modem  digital  and  analog 
communication  systems,  the  authors  hope  to  provide  a  solid  foundation  for  the  training  ot 
future  generations  of  communication  scientists  and  engineers. 
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In  this  chapter  we  discuss  certain  basic  signal  concepts.  Signals  are  processed  by  systems. 
We  shall  start  by  explaining  the  terms  signals  and  systems. 


Signals 

A  signal,  as  the  term  implies,  is  a  set  of  information  or  data.  Examples  include  a  telephone  or 
a  television  signal,  the  monthly  sales  figure  of  a  corporation,  and  the  daily  closing  prices  of  a 
stock  market  (e.g.,  the  Dow  Jones  averages).  In  all  these  examples,  the  signals  are  functions  of 
the  independent  variable  time.  When  an  electric  charge  is  distributed  over  a  surface,  however, 
the  signal  is  the  charge  density,  a  function  of  space  rather  than  time.  In  this  book  we  deal 
almost  exclusively  with  signals  that  are  functions  of  time.  The  discussion,  however,  applies 
equally  well  to  other  independent  variables. 


systems 


Signals  may  be  processed  further  by  systems,  which  may  modify  them  or  extract  an  additional 
information  from  them.  For  example,  an  antiaircraft  missile  launcher  may  want  to  know  the 
uture  location  of  a  hostile  moving  target,  which  is  being  tracked  by  radar.  From  the  radar 

S1(1Z:  hC  PaSt  l0Ca,l0n  and  velocity  of  the  target.  By  properly  processing  the  radar 
gnal  ( the  input),  he  can  approximately  estimate  the  future  location  of  the  target  Thus,  a  system 

A  rUvtTr:’  afTf T*  “•'■W “  «h=r  se,  /signals  (outputs,. 

systems  (hardware  realist ^tt^kTlSSI "**"**■ "  ■’’l*’"1* 
input  signal  (software  realization).  *  ?  h  ,hal  con,P““s  an  output  Irom  an 


2.1  SIZE  OF  A  SIGNAL 


I  he  size  of  any  entity  is  a  quantity  that  indicates  its  lam 

signal  amplitude  varies  with  time.  How  can  ■  ^eness  or  strength.  Generally  speaking, 

with  varying  amplitude  be  measured  by  one  numbertlv!"'  ?^.°Ver  3  Certain  time 
strength?  Such  a  measure  must  consider  not  I  W‘  ,ndlcate  the  signal  size  or  signal 
For  instance,  if  we  are  to  devise  a  single  numb  v  S'8nal  amPlitude-  but  also  its  duration, 
we  must  consider  not  only  his  or  herAiHth  i  ^  u\3S  3  measure  °f  the  size  of  a  human  being. 
20  rW’dth(emh)-b”'a'»>  *e  heigh,.  Thu  prrxluc,  of  g» 
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and  height  is  a  reasonable  measure  of  the  size  of  a  person.  If  we  wish  to  be  little  more  precise, 
we  can  average  this  product  over  the  entire  length  of  the  person.  It  we  make  a  simplifying 
assumption  that  the  shape  of  a  person  is  a  cylinder  of  radius  r,  which  varies  with  the  height  h 
of  the  person,  then  a  reasonable  measure  of  the  size  of  a  person  of  height  H  is  his  volume  V' 
given  by 


V  = 


r2(h)dh 


Signal  Energy 

Arguing  in  this  manner,  we  may  consider  the  area  under  a  signal  g(t)  as  a  possible  measure  of 
its  size,  since  it  takes  into  account  ot  not  only  the  amplitude,  but  also  the  duration.  However, 
this  will  be  a  defective  measure  because  if  g(t)  was  a  large  signal,  its  positive  and  negative 
areas  might  cancel  each  other,  wrongly  indicating  a  signal  of  small  size.  This  difficulty  can  be 
corrected  by  defining  the  signal  size  as  the  area  under  g2(t),  which  is  always  positive.  We  call 
this  measure  the  signal  energy  Eg,  defined  (for  a  real  signal)  as 


This  definition  can  be  generalized  to  a  complex  valued  signal  g(t)  as 


(2.1) 


(2.2) 


There  are  also  other  possible  measures  of  signal  size,  such  as  the  area  under  |g(r)|.  The  energy 
measure,  however,  is  not  only  more  tractable  mathematically,  it  is  also  more  meaningful  (as 
shown  later)  in  the  sense  that  it  is  indicative  of  the  energy  that  can  be  extracted  from  the  signal. 


Signal  Power 

The  signal  energy  must  be  finite  for  it  to  be  a  meaningful  measure  of  signal  size.  A  necessary 
condition  for  the  energy  to  be  finite  is  that  the  signal  amplitude  must  -*  0  as  |r|  -►  oo 
(Fig.  2.1a).  Otherwise  the  integral  in  Eq.  (2.1)  will  not  converge. 

if  the  amplitude  of  g(t)  does  not  0  as  |f  |  —  oo  (Fig.  2.1b),  the  signal  energy  is  infinite. 
A  more  meaningful  measure  of  the  signal  size  in  such  a  case  would  be  the  time  average  of  the 


Figure  2.1 

Examples  of 
signals: 

(a)  signal  with 
finite  energy; 

(b)  signal  with 
finite  power. 
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energy  (if  it  exists),  which  is  the  average  power  Pg  defined  by  (for  a  real  signal) 


energy 


(2.3) 


We  can  generalize  this  definition  for  a  complex  signal  g(t)  as 


(2.4) 


Observe  that  the  signal  power  Pg  is  the  time  average  (mean)  of  the  signal  amplitude  square, 
that  is,  the  mean  squared  value  of  g(t).  Indeed,  the  square  root  of  Pg  is  the  tamiliar  rms 
(root-mean-square)  value  of  g(t). 

The  mean  of  an  entity  averaged  over  a  large  time  interval  approaching  infinity  exists  if 
the  entity  is  either  periodic  or  has  a  statistical  regularity.  If  such  a  condition  is  not  satisfied, 
the  average  may  not  exist.  For  instance,  a  ramp  signal  g(t)  =  t  increases  indefinitely  as  |/| 
approaches  infinity,  and  neither  the  energy,  nor  the  power  exists  for  this  signal. 

Comments 

It  should  be  stressed  that  “signal  energy”  and  “signal  power”  are  inherent  characteristic 
values  of  a  signal.  They  have  nothing  to  do  with  the  consumption  of  the  signal  by  any 
load.  Signal  energy  and  signal  power  are  used  to  measure  the  signal  strength  or  size.  For 
instance,  if  we  approximate  a  signal  g(t)  by  another  signal  z(f),  the  approximation  error  is 
thus  e(l)  =g(t)  —  z(t).  The  energy  (or  power)  of  e(t)  is  a  convenient  indicator  of  the  approx¬ 
imation  accuracy.  It  provides  us  with  a  quantitative  measure  of  determining  the  closeness  of 
the  approximation.  It  also  allows  us  to  determine  whether  one  approximation  is  better  than 
another.  During  transmission  over  a  channel  in  a  communication  system,  message  signals  are 
corrupted  by  unwanted  signals  (noise).  The  quality  of  the  received  signal  is  judged  by  the 
relative  sizes  ot  the  desired  signal  and  the  unwanted  signal  (noise).  In  this  case  the  ratio  of  the 
message  signal  and  noise  signal  powers  (the  ratio  of  signal  to  noise  power)  is  a  good  indication 
of  the  received  signal  quality. 

Units  of  Signal  Energy  and  Power 


very  large  or  small.  As  a  convention,  a  signal  with 


have  power  of 


average  power  of  P  watts  can  be  said  to 


|IO-log„,P|dBw  or  |30+l01og,0/>|dBm 

Fo.  e»«mp,e.  -30  dBn,  represents  signal  ^  of  ,0-4  w  . 


in  linear  scale. 


Example  2. 1  Determine  the  suitable  measures  of  the  signals  in  Fig.  2.2. 


The  signal  in  Fig.  2.2a  approaches  0  as  I; I 
signal  is  its  energy  Ex  given  by 


00 •  Therefore,  the  suitable  measure  for  this 


Figure  2.2 

Signal  for 
Example  2.1 . 
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The  signal  in  Fig.  2.2b  does  not  approach  0  as  |?|  -»•  oo.  However,  it  is  periodic,  and 
therefore  its  power  exists.  We  can  use  Eq.  (2.3)  to  determine  its  power.  For  periodic  signals, 
we  can  simplify  the  procedure  by  observing  that  a  periodic  signal  repeats  regularly  each 
period  (2  seconds  in  this  case).  Therefore,  averaging  g2(/)  over  an  infinitely  large  interval 
is  equivalent  to  averaging  it  over  one  period  (2  seconds  in  this  case).  Thus 


Recall  that  the  signal  power  is  the  square  of  its  rms  value.  Therefore,  the  rms  value  of  this 


signal  is  l/\/3. 


:rmine  the  power  and  rms  value  of 

(a)  g(t)  =  C  cos  (coot  +  0) 

(b)  g(t)  =  Ci  cos(<uif  +  0])  +  C2  C0S((02t  +  O2)  (C0ly^C02) 

(c)  git)  =  Dei0*' 

(a)  This  is  a  periodic  signal  with  period  To  =  2n/coo-  The  suitable  measure  of  its  size  is 
power.  Because  it  is  a  periodic  signal,  we  may  compute  its  power  by  averaging  its  energy 
over  one  period  In/coo.  However,  for  the  sake  of  generality,  we  shall  solve  this  problem 
by  averaging  over  infinitely  large  time  interval  using  Eq.  (2.3). 

1  rT/ 2  1  rT/2  C2 

P  =  Jim  I  /  C2  cos2  (cuof  +  d)  dt  =  lim  -  /  —  [1  +  cos  (Icoot  +  20)]  dt 

8  T-+00  T  J-T/2  T^°°  1  J~T/2  2 

r2  pT/2  q2-  rT/2 

-  lim  —  /  dt+  lim  —  /  cos  (2coot  +  20)  dt 
T-y 00  2 T  J-T/2  T-+0O  2 T  J-T/2 

The  first  term  on  the  right-hand  side  equals  C2/ 2,  while  the  second  term  is  zero  because 
the  integral  appearing  in  this  term  represents  the  area  under  a  sinusoid  over  a  very  large 
time  interval  T  with  T  -*■  00.  This  area  is  at  most  equal  to  the  area  of  half  the  cycle 
because  of  cancellations  of  the  positive  and  negative  areas  of  a  sinusoid.  The  second  term 
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is  this  area  multiplied  by  C'ftT  with  T  -  oo.  CleaHy  this  torn  is  zero.  and 


P'  = 


■  OO. 

C2 


(2.5a) 


i  sinusoid  of  amplitude  C  has  a  power  C2/ 2  regardless 
onH  itc  nhncp  The  nm  value  is  C /  \/2.  It  the  sisnal 


frequency 

is  zero  (dc  or  a 

constant 

power  is  C 

(b) 

In  this  case 

1 

rT/2 

Pv  =  lim  - 

/  [Ci 

*  T-*oo  T 

J-T/2 

1 

r T/2 

=  *tm  — 

/  c' 

7->oo  T 

J-T/2 

i  r T /2 

+  lim 

T-*  oo 

T  J-T/2 

+  lim 

2 C,C2  / 

T->oo 

T  J. 

■7/  2 


-T/2 


Observe 


re  that  the  first  and  the  second  integrals  on  the  right-hand  side  are  the  powers  of 
the  two  sinusoids,  which  are  C\2/2  and  C22/2  as  found  in  part  (a).  We  now  show  that  the 
third  term  on  the  right-hand  side  is  zero  if  oq  £  a^. 

CT/2 

-77  2 
rT/2 

cos  [(a>i  +(02)t  +  G\  +d2]  dt 


lim^ 

T->oc  T 


rT/2 

/  cos(aqr  -I- ) cos  (a>27  +  02)dt 
J-T/2 


=  lim 


CiC2 


T-*  oo  T 
rT/2 
-T/2 

=  0 


-f 


J-T/2 

cos  [(a>i  -ft>2)r  +  ei  -02]df 


Consequently, 


*  2  2 


(2.51 


I - 

and  the  rms  value  is  ^(Ci2  +  C22)/2. 

luenciesWnK^0)eino?ht?2!lr.t?/ny  SUm  °f  sinusoids  with  distinct  angul 


frequencies  con  (wn  ±  0).  In  other  words,  if" 

OO 

g(7)  =  ^C„cos  (cont  +  en) 


n=  1 
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where  none  of  the  sinusoids  have  identical  frequencies,  then 


(2.5c) 


n=\ 


(c)  In  this  case  the  signal  is  complex  valued,  and  we  use  Eq.  (2.4)  to  compute  the 
power.  Thus, 


1  fr/2 

-77  2 

Recall  that  l^'l  =  1  so  that  IDe^'l2  =  |D|2,  and 


\  rTf 2  . 

P,  =  I™  =  /  IDc'-'l2* 

T-+oo  I  J-T/2 


i  rT/2 

1  —2  _  ,m2 


i  rl 

=  lim  -  /  |D|Z<*  =  |D| 

r-^oo  T  J-t/2 


(2.5d) 


Its  rms  value  is  |D|. 


Comment:  In  part  (b),  we  have  shown  that  the  power  of  the  sum  of  two  sinusoids  is  equal 
to  the  sum  of  the  powers  of  the  sinusoids.  It  may  appear  that  the  power  of  g  \  (t) + #2(0  is  simply 
p  _|_  p ^  ge  cautioned  against  such  a  generalization!  All  we  have  proved  here  is  that  this 
is  true  if  the  two  signals  gi(r)  and  g2(t)  happen  to  be  sinusoids.  It  is  not  true  in  general!  In 
fact,  it  is  not  true  even  for  the  sinusoids  if  the  two  sinusoids  are  of  the  same  frequency.  We  shall 
show  later  (Sec.  2.5.4)  that  only  under  a  certain  condition,  called  the  orthogonality  condition, 
is  the  power  (or  energy)  of  gi  (r)  +  gi(t)  equal  to  the  sum  of  the  powers  (or  energies)  of  gi  (t) 
and  g2(t)- 


2.2  CLASSIFICATION  OF  SIGNALS 

There  are  various  classes  of  signals.  Here  we  shall  consider  only  the  following  pairs  of  classes 
that  are  suitable  for  the  scope  of  this  book. 

1.  Continuous  time  and  discrete  time  signals 

2.  Analog  and  digital  signals 

3.  Periodic  and  aperiodic  signals 

4.  Energy  and  power  signals 

5.  Deterministic  and  probabilistic  signals 

2.2.1  Continuous  Time  and  Discrete  Time  Signals 

A  signal  that  is  specified  for  every  value  of  time  t  (Fig.  2.3a)  is  a  continuous  time  signal, 
and  a  signal  that  is  specified  only  at  discrete  points  of  t  =  nT  (Fig.  2.3b)  is  a  discrete  time 
signal.  Audio  and  video  recordings  are  continuous  time  signals,  whereas  the  quarterly  gross 
domestic  product  (GDP),  the  monthly  sales  of  a  corporation,  and  stock  market  daily  averages 
are  discrete  time  signals. 
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Figure  2.3 

(a)  Continuous 
time  and 

(b)  discrete  time 
signals. 


2.2.2  Analog  and  Digital  Signals 

One  should  not  confuse  analog  sienak  with 

not  the  same.  This  is  also  true  of  the  concepts  ofT°US  tlme  S‘gnalS'  The  tW°  concePts  are 

whose  amplitude  can  have  any  value  in  a  coniLm  ^  time  and  d,gital  s'gnals-  A  s'gnal 

an  analog  signal  amplitude  can  take  on  an  <  usrfn8e  isan  analog  signal.  This  means  that 
signal,  on  the  other  hand,  is  one  whose  amri^d11118* y number  of  values.  A  digital 
Signals  associated  with  a  digital  comnntpr  c  C3n  ta^e  °n  0n^  a  bn'te  number  values, 
(binary  signals).  For  a  signal  to  qualify  as  digital  T'  beCaUSe  they  take  on  only  two  values 
to  two.  It  can  be  any  finite  number.  A  digital  sionai  V 1umber  of  values  need  not  be  restricted 
an  Af-ary  signal  of  which  binary  (M  ~2\  \  8  J  Whose  amplitudes  can  take  on  M  values  is 
“discrete  time"  qualify  the  nature  of  signal  alnlcST^  ^ The  tCrms  “continuous  time”  and 
and  "digital,”  on  the  other  hand,  describe  "8  tbe  11  me  (horizontal)  axis.  The  terms  “analog” 
I  luure  2.4  shows  signals  of  various  types  It  k  T  ^  signal  amP,itude  (vertical)  axis, 
time,  nor  does  digital  need  to  be  discrete  ^ 1  31  analo8  is  n°t  necessarily  continuous 

i  tscrete  time  signal.  An  analog  signal  can  be  cn  '8Ure,  ~  *C  shows  an  example  of  an  analog, 

* "D  *"*»  4-ST(XS!S"t"  *  «*»  «■*  ^alog-.o-digi.al 

t  ).  as  explained  in  Section  6.2. 
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Figure  2.4 

Examples  of 
signals: 

(a)  analog  and 
continuous  time; 

(b)  digital  and 
continuous  time; 

(c)  analog  and 
discrete  time; 

(d)  digital  and 
discrete  time. 


(a) 


(d) 


2.2.3  Periodic  and  Aperiodic  Signals 

A  signal  g(t)  is  said  to  be  periodic  if  there  exists  a  positive  constant  To  such  that 


g(0  =  g(t  +  T0)  for  all/ 


(2.6) 


The  smallest  value  of  7o  that  satisfies  the  periodicity  condition  (2.6)  is  the  period  of  g(t ).  The 
signal  in  Fig.  2.2b  is  a  periodic  signal  with  period  of  2.  Naturally,  a  signal  is  aperiodic  if  it  is 
not  periodic.  The  signal  in  Fig.  2.2a  is  aperiodic. 

By  definition,  a  periodic  signal  g(t)  remains  unchanged  when  time-shifted  by  one  period. 
This  means  that  a  periodic  signal  must  start  at  t  =  -oo  because  if  it  starts  at  some  finite  instant, 
say,  /  =  0,  the  time-shifted  signal  g(t  +  To)  will  start  at  /  =  -T0  and  g(t  +  7o)  is  no  longer  the 
same  as  g(t).  Therefore,  a  periodic  signal,  by  definition,  must  start  from  -oo  and  continue 
forever,  as  shown  in  Fig.  2.5.  Observe  that  a  periodic  signal  shifted  by  an  integral  multiple 
of  To  remains  unchanged.  Therefore,  g(t)  may  also  be  considered  to  be  a  periodic  signal  with 
period  mTo ,  where  m  is  any  integer.  However,  by  definition,  the  period  is  the  smallest  interval 
that  satisfies  periodicity  condition  (2.6).  Therefore,  7b  is  the  period. 
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2.2.4  Energy  and  Power  Signals 

A  signal  with  finite  energy  is  an  energy  signal,  and  a  signal  with  finite  power  is  a  pow  er  signal. 
In  other  words,  a  signal  g(t)  is  an  energy  signal  if 

/oo 

\g(t)\2dt  <  oo  (2.7) 

-00 

Similarly,  a  signal  with  a  finite  and  nonzero  power  (mean  square  value)  is  a  power  signal.  In 
other  words,  a  signal  is  a  power  signal  if 

i  r T /2  - 

0  <  lim  —  /  \g(t)\2dt  <  oo  (2.8) 

T-> oo  T  J_T/ 2 

The  signals  in  Fig.  2.2a  and  b  are  examples  of  energy  and  power  signals,  respectively.  Observe 
that  power  is  the  time  average  of  the  energy.  Since  the  averaging  is  over  an  infinitely  large 
interval,  a  signal  with  finite  energy  has  zero  power,  and  a  signal  with  finite  power  has  infinite 
energy.  Therefore,  a  signal  cannot  both  be  an  energy  signal  and  a  power  signal.  If  it  is  one,  it 
cannot  be  the  other.  On  the  other  hand,  certain  signals  with  infinite  power  are  neither  energy 
nor  power  signals.  The  ramp  signal  is  one  example. 


Comments 


Every  signal  observed  in  real  life  is  an  energy  signal.  A  power  signal,  on  the  other  hand,  must 
have  an  infinite  duration.  Otherwise  its  power,  which  is  its  average  energy  (averaged  over 
an  infinitely  large  interval)  will  not  approach  a  (nonzero)  limit.  Obviously  it  is  impossible  to 

generate  a  true  power  signal  in  practice  because  such  a  signal  would  have  infinite  duration  and 
infinite  energy. 


on 6  n.H?vrCar!  °f  repetition'  Periodic  signals  for  which  the  area  under  |#(r)|2  over 

period  is  finite  are  power  signals;  however,  not  all  power  signals  are  periodic. 


2.2.5  Deterministic  and  Random  Signals 

form  is  a  determSclt^Ta^g^kk  C°mplet®,y: in  either  mathematical  or  graphical 
such  as  mean  value.  “  T  °f  pr°babiUstic  deScripti°"’ 

graphical  description,  it  is  a  random  signal  Most  of  thTno  '  ^  mathematicaV °r 

are  random  signals  All  message  d<m  .i  ,  °  the  no  se  Slgnals  encountered  in  practice 

signal,  to  convey  ^  as  will  be  shown  later,  a 

of  random  signals  will  be  discussed  in  lateTchlpters113111^  (randomness)  about  il-  Treatment 

2.3  SOME  USEFUL  SIGNAL  OPERATIONS 

We  discuss  here  three  useful  and  • 

Since  the  signal  variable  in  our  signal  desrrfnr'  °peratlons:  shifting,  scaling,  and  inversion. 

time  shifting,  time  scaling,  and  time  inversion^,  !°f  1,®.t,me’ these  operations  are  discussed  as 

VU 'd  lor  functions  hav'ng  independent  variableToth  "If’  H°Wever- this  discussion  is  equally 

an  doles  other  than  time  (e.g.,  frequency  or  distance). 


Figure  2.6 

Time  shifting  a 
signal. 
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j-r-i 


(a) 


(b) 


(c) 


2.3.1  Time  Shifting 

Consider  a  signal  git)  (Fig.  2.6a)  and  the  same  signal  delayed  by  T  seconds  (Fig.  2.6b),  which 
we  shall  denote  as  (pit).  Whatever  happens  in  git)  (Fig.  2.6a)  at  some  instant  t  also  happens 
in  (pit)  (Fig.  2.6b)  T  seconds  later  at  the  instant  t  +  T.  Therefore 

(pit  +  T)  =  git)  (2.9) 


or 


(Pit)  =  git -T)  (2.10) 

Therefore,  to  time-shift  a  signal  by  T ,  we  replace  t  with  t  —  T.  Thus,  git  -  T)  represents  g(f) 
time-shifted  by  T  seconds.  If  T  is  positive,  the  shift  is  to  the  right  (delay).  If  T  is  negative,  the 
shift  is  to  the  left  (advance).  Thus,  g(t  -  2)  is  g(t)  delayed  (right-shifted)  by  2  seconds,  and 
g(t  +  2)  is  git)  advanced  (left-shifted)  by  2  seconds. 

2.3.2  Time  Scaling 

The  compression  or  expansion  of  a  signal  in  time  is  known  as  time  scaling.  Consider  the 
signal  git)  of  Fig.  2.7a.  The  signal  </>(t)  in  Fig.  2.7b  is  g(t)  compressed  in  time  by  a  factor  of  2. 
Therefore,  whatever  happens  in  g(t)  at  some  instant  t  will  be  happening  to  (pit)  at  the  instant 
r/2  so  that 

*(0=*(O  <2.10 


and 


(Pit)  =  gilt) 


(2.12) 


30  SIGNALS  AND  SIGNAL  SPACE 


Figure  2.7 

Time  scaling  a 
signal. 


Example  2.3 


(a) 


(b) 


(c) 


Observe  that  because  g(t)  =  0  at  t  —  T\  and  T2,  the  same  thing  must  happen  in  (pit)  at  half 
these  values.  Therefore,  <j>(t)  =  0  at  t  =  T\/2  and  72/2,  as  shown  in  Fig.  2.7b.  If  git)  were 
recorded  on  a  tape  and  played  back  at  twice  the  normal  recording  speed,  we  would  obtain 
g(2t).  In  general,  if  git)  is  compressed  in  time  by  a  factor  a  (a  >  1 ),  the  resulting  signal  (pit) 
is  given  by 


(pit)  =  g(at) 


(2.13) 


We  can  use  a  similar  argument  to  show  that  g(t)  expanded  (slowed  down)  in  time  by  a 
factor  a  (a  >  I )  is  given  by 


m 


O 


(2.14) 


Figure  _.7c  shows  g(f/2),  which  is  g(t)  expanded  in  time  by  a  factor  of  2.  Note  that  the  signal 
imams  anc  ored  at  /  _  0  during  scaling  operation  (expanding  or  compressing).  In  other 
words,  the  signal  at  /  =  0  remains  unchanged.  This  is  because  g(t)  =  g(at)  =  g(0)  at  /  =  0. 

is  corn^sT*  Signa'  ^  3  f3Ct0r ' We  "P'«*  '  with  at.  \la  >1 ,  the  scaling 

is  compression,  and  if  a  <  1,  the  scaling  is  expansion. 


Crider  .h.  sigrcus  »(,)  and  W  to  Fig.  2.8a  and  b,  respec,ively.  Skeich  (a)  „(30  '*> 
(a)  g  (3/)  is  g(r)  compressed  by  a  factor  of  3  Thic  m  ..  ,  , 

Jtsr 24  ~  -  *<*>  - * fn 

-  -  *?£  b,2a  t?  ™-  »f  - 

Fig.  2.8d.  k  2,  2,  and  —6  respectively,  as  shown  in 
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Figure  2.8 

Examples  of  time 
compression  and 
time  expansion 
of  signals. 


2.3.3  Time  Inversion  (or  Folding) 

Time  inversion  may  be  considered  to  be  a  special  case  of  time  scaling  with  a  =  -1  in 
Eq.  (2.13).  Consider  the  signal  g(t)  in  Fig.  2.9a.  We  can  view  g(f)  as  a  rigid  wire  frame  hinged 
at  the  vertical  axis.  To  invert  g(t),  we  rotate  this  frame  180°  about  the  vertical  axis.  This 
time  inversion  or  folding  [the  mirror  image  of  g(t)  about  the  vertical  axis]  gives  us  the  signal 
4>(t)  (Fig.  2.9b).  Observe  that  whatever  happens  in  Fig.  2.9a  at  some  instant  t  also  happens  in 
Fig.  2.9b  at  the  instant  -t.  Therefore 


<P(-t)  =  g(t) 


and 


<P(t)  =  g(~0  (2-15) 

Therefore,  to  time-invert  a  signal  we  replace  1  with  —t  such  that  the  time  inversion  of  g(t) 
yields  g(-t).  Consequently,  the  mirror  image  of  g(t)  about  the  vertical  axis  is  g(-t).  Recall 
also  that  the  mirror  image  of  g(t)  about  the  horizontal  axis  is  -g(f). 


Example  2.4  For  the  signal  g(t)  shown  in  Fig.  2.10a,  sketch  g(— t). 

I  The  instants  -1  and  -5  in  g(r)  are  mapped  into  instants  1  and  5  in  g(-t).  If  g(t)  =  e'/2, 
then  g(-t )  =  e~,/2.  The  signal  g(-/)  is  shown  in  Fig.  2.10b. 
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Figure  2.10 

Example  of  time 
inversion. 


Figure  2.1 1 

(a)  Unit  impulse 
and  (b)  its 
approx¬ 
imation. 


(a) 

2.4  UNIT  IMPULSE  SIGNAL 


l 

€ 


2  2 
(b) 


e  — >0 


z  zrtzs:  z is  r of  ,he  mos'  ^ 

in  pZSelt"  aPPlU:a"0n  Provid'  »“*  convenience  ,ha,  is  no,  permissible 

the  ■SESSC  fUnC'i0n  ‘W  ~  defined  b^'  A-  M.  Dirac  (hence  often  known  as 


S(t)  =  0 


t*  0 


(2.16) 

(2.17) 


we  can  visualize  an  impulse  as  a  tall 

Hg.  2.11.  The  width  of  this  rectangular  nnT™"^  rectan§u'ar  pulse  of  unit  area,  as  shown  in 
'.ilue  l/f  in  the  limit  as  e  o.  The  unit  ^i3  VCr^ sma"  valne  e;  its  height  is  a  very  large 
puKe  with  a  width  that  has  become  infinit^^11  if  l^eretore  can  be  regarded  as  a  rectangular 

nfinueslmally  small,  a  heigh,  ,hai  has  become  infinitely 
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large,  and  an  overall  area  that  remains  constant  at  unity.*  Thus,  <5(f)  =  0  everywhere  except 
at  t  =  0,  where  it  is,  strictly  speaking,  undefined.  For  this  reason  a  unit  impulse  is  graphically 
represented  by  the  spearlike  symbol  in  Fig.  2.1  la. 

Multiplication  of  a  Function  by  an  Impulse 

Let  us  now  consider  what  happens  when  we  multiply  the  unit  impulse  8(t )  by  a  function  fit) 
that  is  known  to  be  continuous  at  t  =  0.  Since  the  impulse  exists  only  at  t  =  0,  and  the  value 
of  fit)  at  t  =  0  is  </>(0),  we  obtain 


(2.18a) 


<P(t)8(t)  =  <p(0)m 


Similarly,  if  fit)  is  multiplied  by  an  impulse  8{t  —  T )  (an  impulse  located  at  t  =  T),  then 


(2.18b) 


fit)8it  -T)  =  <p(T)8U  -  T) 


provided  fit)  is  defined  at  t  =  T. 


The  Sampling  Property  of  the  Unit  Impulse  Function 

From  Eq.  (2. 1 8)  it  follows  that 


(P(t)8«  —  T)dt  =  <p(T)  /  Sit  —  T)dt  —  <p(T) 


(2.19a) 


provided  fit)  is  continuous  at  t  =  T.  This  result  means  that  the  area  under  the  product  of  a 
function  with  an  impulse  8(t)  is  equal  to  the  value  of  that  function  at  the  instant  where  the  unit 
impulse  is  located.  This  property  is  very  important  and  useful,  and  is  known  as  the  sampling 
(or  sifting)  property  of  the  unit  impulse. 

Depending  on  the  value  of  T  and  the  integration  limit,  the  impulse  function  may  or  may 
not  be  within  the  integration  limit.  Thus,  it  follows  that 


b 


<P(t)8(t  —  T)dt  =  fiT)  /  8(t  —  T)dt  = 


a 


The  Unit  Impulse  as  a  Generalized  Function 

The  unit  impulse  function  definition  (2.17)  leads  to  a  nonunique  function.1  Moreover,  8(t)  is 
not  even  a  true  function  in  the  ordinary  sense.  An  ordinary  function  is  specified  by  its  values 
for  all  time  t.  The  impulse  function  is  zero  everywhere  except  at  t  =  0,  and  at  this,  the  only 
interesting  point  of  its  range,  it  is  undefined.  In  a  more  rigorous  approach,  the  impulse  function 
is  defined  not  as  an  ordinary  function  but  as  a  generalized  function,  where  8(t)  is  defined  by 
Eq.  (2.19a).  We  say  nothing  about  what  the  impulse  function  is  or  what  it  looks  like.  Instead, 
it  is  defined  only  in  terms  of  the  effect  it  has  on  a  test  function  fit).  We  define  a  unit  impulse 
as  a  function  for  which  the  area  under  its  product  with  a  function  fit)  is  equal  to  the  value  of 
the  function  fit)  at  the  instant  where  the  impulse  is  located.  Recall  that  the  sampling  property 
[Eq.  (2.19a)]  is  the  consequence  of  the  classical  (Dirac)  definition  of  impulse  in  Eq.  (2.17).  In 
contrast,  the  sampling  property  [Eq.  (2.19 a)]  defines  the  impulse  function  in  the  generalized 
function  approach. 


*  The  impulse  function  can  also  be  approximated  by  other  pulses,  such  as  a  positive  triangle,  an  exponential  pulse, 
or  a  Gaussian  pulse. 
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Figure  2.12 

(a)  Unit  step 
(unction  uit). 

(b)  Causal 
exponential 
e'a,u(t). 


The  Unit  Step  Function  u(t) 

Another  familiar  and  useful  function,  the  unit  step  function  u(t),  is  often  encountered  in  circuit 
analysis  and  is  defined  by  Fig.  2.12a: 


u(t)  = 


t>  0 
t  <0 


(2.20) 


If  we  want  a  signal  to  start  at  t  —  0  (so  that  it  has  a  value  of  zero  for  t  <  0),  we  need  only 
multiply  the  signal  by  u(f).  A  signal  that  starts  after  t  =  0  is  called  a  causal  signal.  In  other 
words,  g(l)  is  a  causal  signal  if 


8(0  =0  t<  0 

The  signal  e~al  represents  an  exponential  that  starts  at  t  =  -oo.  If  we  want  this  signal  to  start 
at  l  =  0  (the  causal  form),  it  can  be  described  as  e~a'u(t)  (Fig.  2.12b).  From  Fig.  2.1  lb,  we 
observe  that  the  area  from  — oo  to  t  under  the  limiting  form  of  <5(?)  is  zero  if  t  <  0  and  unity  if 
t  >  0.  Consequently, 


6(r)dr  = 


0 

1 


=  u(t) 


t<  0 
t  >  0 


From  this  result  it  follows  that 


(2.21a) 


(2.21b) 


2.5  SIGNALS  VERSUS  VECTORS 


I^mJVumZofTnTin0,"  'TT'””*"*  ™d  ™ore'  SiS"als  are  defined  for  only 

ITS (Say,"'  h  Wri,Kn  “  vecors  (of  dimension  N).  Thus, 
consiuer  a  signal  gU)  defined  over  a  closed  time  intervil  \„  in#  ■,»,  -r  „mh/ 

on  the  time  interval  \a ,  b ]  such  that  b]' If  WC  P‘ck  N  P°,ntS  unitormly 


t,=a.  r2  =  a  +  f,  t3  =  a  +  2e,  tn  =  a  + (N  -  \)€  =  b 
We  then  can  write  a  signal  vector  g  as  a  AT -dimensional  vector: 


S -[«('!)  g(t2)  ...  g(,w)] 


€  = 


b  -  a 
N  -T 
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Figure  2.13 

Component 
(projection)  of  a 
vector  along 
another  vector. 


x 


As  the  number  of  time  instants  N  increases,  the  sampled  signal  vector  g  will  grow.  Eventually, 
as  N  — >  oo,  the  signal  values  would  form  a  vector  g  with  infinitely  long  dimension.  Because 
6  — >  0,  the  signal  vector  g  would  transform  into  the  continuous  time  signal  g(t)  defined  over 
the  interval  [a,  b\.  In  other  words. 


lim  g  =  g(t)  t  e  [a,  b] 

N-*  oo 

This  relationship  clearly  shows  that  continuous  time  signals  are  straightforward  generalizations 
of  finite  dimension  vectors.  Thus,  basic  definitions  and  operations  in  a  vector  space  can  be 
applied  to  continuous  time  signals  as  well.  We  now  highlight  this  connection  between  the  finite 
dimension  vector  space  and  the  continuous  time  signal  space. 

We  shall  denote  all  vectors  by  boldface  type.  For  example,  x  is  a  certain  vector  with 
magnitude  or  length  ||x||.  A  vector  has  magnitude  and  direction.  In  a  vector  space,  we  can 
define  the  inner  (dot  or  scalar)  product  of  two  real-valued  vectors  g  and  x  as 

<g,  x>  =  llgll  •  ||x|| cos  9  (2.22) 

where  9  is  the  angle  between  vectors  g  and  x.  Using  this  definition  we  can  express  ||x||,  the 
length  (norm)  of  a  vector  x  as 


||x||2  =  <x,  x> 


(2.23) 


This  defines  a  normed  vector  space. 


2.5.1  Component  of  a  Vector  along  Another  Vector 

Consider  two  vectors  g  and  x,  as  shown  in  Fig.  2.13.  Let  the  component  of  g  along  x  be  cx. 
Geometrically,  the  component  of  g  along  x  is  the  projection  of  g  on  x  and  is  obtained  by 
drawing  a  perpendicular  from  the  tip  of  g  on  the  vector  x,  as  shown  in  Fig.  2.13.  What  is 
the  mathematical  significance  of  a  component  of  a  vector  along  another  vector?  As  seen  from 
Fig.  2. 1 3,  the  vector  g  can  be  expressed  in  terms  of  vector  x  as 

g  =  cx  +  e  (2.24) 

However,  this  does  not  describe  a  unique  way  to  decompose  g  in  terms  of  x  and  e.  Figure  2. 14 
shows  two  of  the  infinite  other  possibilities.  From  Fig.  2.14a  and  b,  we  have 


g  =  C|X  +  ei  =  C2X  +  e2 


(2.25) 
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Figure  2.14 

Approximations 
of  a  vector  in 
terms  of  another 
vector. 


c,x 


x 


X 


(a) 


(b) 


The  question  is:  Which  is  the  “best”  decomposition?  The  concept  of  optimality  depends  on 
what  we  wish  to  accomplish  by  decomposing  g  into  two  components. 

In  each  of  these  three  representations,  g  is  given  in  terms  of  x  plus  another  vector  called 
the  error  vector.  If  our  goal  is  to  approximate  g  by  cx  (Fig.  2.13) 


(2.26) 


g  -  g  =  c\ 


then  the  error  in  this  approximation  is  the  (difference)  vector  e=  g  —  cx.  Similarly,  the  errors 
in  approximations  in  Fig.  2.14a  and  b  are  e\  and  e2.  What  is  unique  about  the  approximation 
in  Fig.  2.13  is  that  its  error  vector  is  the  shortest  (with  smallest  magnitude  or  norm).  We  can 
now  define  mathematically  the  component  (or  projection)  of  a  vector  g  along  vector  x  to  be 
cx.  where  c  is  chosen  to  minimize  the  magnitude  of  the  error  vector  e  =  g  —  cx. 

Geometrically,  the  magnitude  of  the  component  of  g  along  x  is  ||g||  cos  0,  which  is  also 
equal  to  c||x||.  Therefore 


c||x||  =  llgll cos  6 


Based  on  the  definition  of  inner  product  between  two  vectors,  multiplying  both  sides  by  ||x|| 
yields 


c||x||2  =  llgll  ||x|| cos  0  =  <g,  x> 


and 


(2.27) 


<g.  x>  =  0 


(2.28) 


2.5.2  Decomposition  of  a  Signal  and  Signal  Components 


The  concepts  of  vector  comnonent  -.nj  „_i _ 


g(t)  z  cx{t)  t\  <t<t2 


(2.29) 
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The  error  e(t)  in  this  approximation  is 


For  “best  approximation,”  we  need  to  minimize  the  error  signal,  that  is,  minimize  its  norm. 
Minimum  signal  norm  corresponds  to  minimum  energy  Ee  over  the  interval  [fj,  ti\  given  by 


Note  that  the  right-hand  side  is  a  definite  integral  with  t  as  the  dummy  variable.  Hence  Ee  is  a 
function  of  the  parameter  c  (not  t)  and  Ee  is  minimum  for  some  choice  of  c.  To  minimize  Ee, 
a  necessary  condition  is 


(2.31) 


dc 


or 


Expanding  the  squared  term  inside  the  integral,  we  obtain 


From  which  we  obtain 


and 


(2.32) 


To  summarize  our  discussion,  if  a  signal  g(t)  is  approximated  by  another  signal  x(t)  as 


g(t)  ~  cx(t) 


then  the  optimum  value  of  c  that  minimizes  the  energy  of  the  error  signal  in  this  approximation 
is  given  by  Eq.  (2.32). 

Taking  our  clue  from  vectors,  we  say  that  a  signal  g(t)  contains  a  component  cjc(/),  where 
c  is  given  by  Eq.  (2.32).  As  in  vector  space,  cx(t)  is  the  projection  of  g(t)  on  *(f).  Consistent 
with  the  vector  space  terminologies,  we  say  that  if  the  component  of  a  signal  g(t)  of  the  form 
x(t)  is  zero  (that  is,  c  =  0),  the  signals  g(t)  and  x(t)  are  orthogonal  over  the  interval  [/| ,  til- 
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In  other  words,  with  respect  to  real-valued  signals,  two  signals  x(l)  and  g(t)  are  orthogonal 
when  there  is  zero  contribution  from  one  signal  to  the  other  (i.e.,  when  c  =  0).  Thus,  x(t)  and 


g(t)  are  orthogonal  if  and  only  if 


(2.33) 


g{t)x(t)dt  =  0 


Based  on  the  illustration  between  vectors  in  Fig.  2.14,  two  signals  are  orthogonal  if  and  only 
their  inner  product  is  zero.  This  relationship  indicates  that  the  integral  of  Eq.  (2.33)  is  closely 
related  to  the  concept  of  inner  product  between  vectors. 

Indeed,  compared  to  the  standard  definition  of  inner  product  of  two  A/ -dimensional  vectors 
g  and  x 


N 

<  g,  X  >=  J^giXi 


1=1 


the  integration  of  Eq.  (2.33)  has  an  almost  identical  form.  We  therefore  define  the  inner  product 
of  two  (real-valued)  signals  g(t)  and  x(t)  over  a  time  interval  [t\ ,  t2]  as 


(2.34) 


1 1 


Recall  from  algebraic  geometry  that  the  square  of  a  vector  length  ||x|p  is  equal  to  <x,  x>. 
Keeping  this  concept  in  mind  and  continuing  to  draw  the  analogy  to  vector  analysis,  we  define 
the  norm  of  a  signal  g(t)  as 


ll*(0ll  =  J<g(t),  g(tj> 


(2.35) 


which  is  the  square  root  of  the  signal  energy  in  the  time  interval.  It  is  therefore  clear  that  the 
norm  of  a  signal  is  analogous  to  the  length  of  a  finite-dimensional  vector.  More  generally,  the 
signals  may  not  be  merely  defined  over  a  continuous  segment  [/j,  t2].* 


Example  2.5  For  the  square  signal  g(t)  shown  in  Fig.  2.15  find  the  compon 
other  words,  approximate  g(t)  in  terms  of  sin  t: 

g(r)~csinr  0  <  /  <  2jt 

such  that  the  energy  of  the  error  signal  is  minimum. 


component  in  g(t)  of  the  form  sin  t.  In 


In  this  case 


■*(0  =  sin  t  and 


ation  may  be  over  a  set  of  time  s< 
inner  product  is  define  oc  ™ 


nner  product  is  defined  as  an  integral  over  the 
<  g(0,  x(t)  >=  f  g(t)x(t)dt 


segments  represented  simply  by  0.  For 
ntegral  over  the  time  domain  0 


Given  the  mner  product  definition,  the  signal 
for  anv  tir™  domain  . ; _ .  M£nai 


(2.36) 


for  any  time  domain  signals. 


^  and  the  signal  space  can  be  defined 
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Figure  2.1 5 

Approximation 
of  square  signal 
in  terms  of  a 
single  sinusoid. 


From  Eq.  (2.32),  we  find 

i*2n 


-'-f 

*  Jo 


g(t )  sin  tdt  — 


1 


71 


pTC  /»2  71 

I  sin  rdf  +  /  (—  sin  r)  Jr 

JO  J  71 


4 

71 


(2.37) 


Therefore 


4 

g(t)  ~  —  sin  t  (2.38) 

71 

represents  the  best  approximation  of  g(t)  by  the  function  sin  r,  which  will  minimize  the 
error  signal  energy.  This  sinusoidal  component  of  g(t)  is  shown  shaded  in  Fig.  2.15.  As 
in  vector  space,  we  say  that  the  square  function  g(t)  shown  in  Fig.  2.15  has  a  component 
of  signal  sin  t  with  magnitude  of  4/7T. 


2.5.3  Complex  Signal  Space  and  Orthogonality 

So  far  we  have  restricted  our  discussions  to  real  functions  of  t.  To  generalize  the  results  to 
complex  functions  of  /,  consider  again  the  problem  of  approximating  a  function  g(t)  by  a 
function  jc(f)  over  an  interval  (t\  <  t  <  t2): 

g(t)  ~  cx(t)  (2.39) 

where  g(t)  and  x(t)  are  complex  functions  of  t.  In  general,  both  the  coefficient  c  and  the  error 

e(t)  =  g(t)  -  cx(t)  (2.40) 

are  complex.  Recall  that  the  energy  Ex  of  the  complex  signal  x(t)  over  an  interval  [t\,  ti\  is 

Ex=  f  \x(t)\2dt 
Jt\ 

For  the  best  approximation,  we  need  to  choose  c  that  minimizes  Ee ,  the  energy  of  the  error 
signal  e(t)  given  by 
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Recall  also  that 


|M  +  v|2  =  (u  +  v)(m*  +  V*)  =  |w|2  +  |v|2  +  W*v  +  MV*  (2.42) 


After  some  manipulation,  we  can  use  this  result  to  express  the  integral  Ee  in  Eq.  ( 2.4 1 )  as 
Ee  =  T  \g(t)\2  dt  -  g(t)x*(t)dt  +  cjEx  -  -j=  jf  g(t)x*{t)  dt 


Since  the  first  two  terms  on  the  right-hand  side  are  independent  of  c,  it  is  clear  that  Ee  is 
minimized  by  choosing  c  such  that  the  third  term  is  zero.  This  yields  the  optimum  coefficient 


c  = 


(2.43) 


In  light  of  the  foregoing  result,  we  need  to  redefine  orthogonality  for  the  complex  case  as 
follows:  complex  functions  (signals)  jti(f)  and  *2(0  are  orthogonal  over  an  interval  (t  <  t\  <tj) 
as  long  as 


[  x\  (0*2(0  dt  =  0  or  [  xf(/)jt 2(0  dt  =  0  (2.44) 

Jt\  Jt\ 

In  fact,  either  equality  suffices.  This  is  a  general  definition  of  orthogonality,  which  reduces  to 
Eq.  (2.33)  when  the  functions  are  real. 

Similarly,  the  definition  of  inner  product  for  complex  signals  over  a  time  domain  0  can 
be  modified  as 


-I, 


<g(t),  x(t)>  =  I  g(t)x*(t)dr 

'(f:f€0) 


(2.45) 


Consequently,  the  norm  of  a  signal  g(t )  is  simply 


■II 


\\sm  =  \l  ig(/)i2*l 
'{/:»€©)  J 


1/2 


(2.46) 


2.5.4  Energy  of  the  Sum  of  Orthogonal  Signals 

equal  to  th^sVmTZ  mgmmS  squa^oHhe  1  ^  ^  ^  °f  tW°  0rth°g0nal  veCt°rS  iS 
orthogonal,  and  if  z  =  x  +  v.  then  ^  H  tW°  Vectors‘  Thus’  ,f  vectors  x  and  y  are 


i|z||2  =  ||x||2  + 


Thus,  if  signals  x(t)  and  y(t)  are  ortho^  l°  ^  SUm  °f  the  ener&ies  of  the  two  signals, 
then  anciM,)  are  orthogonal  ove.an  imerv.1 1„.  ,2l.  a'd  if  =  v(„ 


=  Ex  +  Ey 


(2.47) 
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We  now  prove  this  result  for  complex  signals,  of  which  real  signals  are  a  special  case. 
From  Eq.  (2.42)  it  follows  that 


\x(t)  +  y(t)\2  dt  =  f2\x(t)\2dt+  f2  \y(t)\2 dt  +  f2 x(t)y*(t)dt+  [  x*(t)y(t)dt 


(2.48) 


The  last  equality  follows  from  the  fact  that  because  of  orthogonality,  the  two  integrals  of  the 
cross  products  x(t)y*(t)  and  x*(t)y(t)  are  zero.  This  result  can  be  extended  to  sum  of  any 
number  of  mutually  orthogonal  signals. 


2.6  CORRELATION  OF  SIGNALS 


By  defining  the  inner  product  and  the  norm  of  signals,  we  have  set  the  foundation  for  signal 
comparison.  Here  again,  we  can  benefit  by  drawing  parallels  to  the  familiar  vector  space.  Two 
vectors  g  and  x  are  similar  if  g  has  a  large  component  along  x.  If  c  in  Eq.  (2.27)  is  large,  the 
vectors  g  and  x  are  similar.  We  could  consider  c  to  be  a  quantitative  measure  of  similarity 
between  g  and  x.  Such  a  measure,  however,  would  be  defective  because  c  varies  with  the 
norms  (or  lengths)  of  g  and  x.  To  be  fair,  the  amount  of  similarity  between  g  and  x  should  be 
independent  of  the  lengths  of  g  and  x.  If  we  double  the  length  of  g,  for  example,  the  amount  of 
similarity  between  g  and  x  should  not  change.  From  Eq.  (2.27),  however,  we  see  that  doubling 
g  doubles  the  value  of  c  (whereas  doubling  x  halves  the  value  of  c).  The  similarity  measure 
based  on  signal  correlation  is  clearly  faulty.  Similarity  between  two  vectors  is  indicated  by  the 
angle  9  between  the  vectors.  The  smaller  the  6 ,  the  larger  is  the  similarity,  and  vice  versa.  The 
amount  of  similarity  can  therefore  be  conveniently  measured  by  cos  9.  The  larger  the  cos  9 ,  the 
larger  is  the  similarity  between  the  two  vectors.  Thus,  a  suitable  measure  would  be  p  =  cos  9 , 
which  is  given  by 


We  can  readily  verify  that  this  measure  is  independent  of  the  lengths  of  g  and  x.  This 
similarity  measure  p  is  known  as  the  correlation  coefficient.  Observe  that 


-1  <p<  1 


(2.50) 


Thus,  the  magnitude  of  p  is  never  greater  than  unity.  If  the  two  vectors  are  aligned,  the  sim¬ 
ilarity  is  maximum  (p  =  1).  Two  vectors  aligned  in  opposite  directions  have  the  maximum 
dissimilarity  (p  =  -1).  If  the  two  vectors  are  orthogonal,  the  similarity  is  zero. 

We  use  the  same  argument  in  defining  a  similarity  index  (the  correlation  coefficient)  for 
signals.  For  convenience,  we  shall  consider  the  signals  over  the  entire  time  interval  from  — oo 
to  oo.  To  establish  a  similarity  index  independent  of  energies  (sizes)  of  g(t)  and  *(/),  we 
must  normalize  c  by  normalizing  the  two  signals  to  have  unit  energies.  Thus,  the  appropriate 
similarity  index  p  analogous  to  Eq.  (2.49)  is  given  by 


(2.51) 
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-1  <  p  <  1 


(2.52) 


2.6.1  Best  Friends,  Opposite  Personalities, 
and  Complete  Strangers 


We  can  readily  verify  that  if  g(t)  =  Kx(t),  then  p  =  1  when  K  is  any  positive  constant,  and 
p  -  - 1  when  K  is  any  negative  constant.  Also  p  =  0  if  g{t)  and.r(f)  are  orthogonal.  Thus,  the 
maximum  similarity  [when  g(t)  =  Ar(f)]  is  indicated  by  p  =  1,  the  maximum  dissimilarity 
[when  g(f)  =  -K.r(Ol  is  indicated  by  p  =  -1.  When  the  two  signals  are  orthogonal,  the 
similarity  is  zero.  Qualitatively  speaking,  we  may  view  orthogonal  signals  as  unrelated  signals. 
Note  that  maximum  dissimilarity  is  different  from  unrelatedness  qualitatively.  For  example, 
we  have  the  best  friends  (p  =  I ),  the  opposite  personalities  (p  =  —  1 ).  and  complete  strangers, 
who  do  not  share  anything  in  common  (p  =  0).  Our  opposite  personalities  are  not  strangers, 
but  are  in  fact,  people  who  do  and  think  always  in  opposite  ways. 

We  can  readily  extend  this  discussion  to  complex  signal  comparison.  We  generalize  the 
deiinition  of  p  to  include  complex  signals  as 


(2.53) 


Example  2.6  Find  the  correlation  coefficient  p  between  the  pulse x(t)  and  the  pulses  g,(f),  i  =  1,  2,  3,  4,  5, 


and  6  shown  in  Fig.  2.16. 

Wo  shall  compute  p  using  Eq.  (2.51)  for  each  of  the  six  cases.  Let  us  first  compute  the 
energies  of  all  the  signals. 


Hie  Cauchy -Schwarz  inequality  states  that  for  tv 
W,.h  equality  if  and  only  if*,)  =  **(„.  where  K 


two  real  energy  signals  g(,)  and  *,),  (/^  g(t)x(t)  d,f  < 
K  is  an  arbitrary  constant.  There  is  also  similar  inequality  for 
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Figure  2.16 

Signals  for 
Example  2.6. 


For  gi(t ),  a  =  1/5  and  T  =  5.  Therefore,  Eg4  =  2.1617.  For  gs(t),  a  =  1  and  T  =  oo. 
Therefore,  Eg5  =  0.5.  The  energy  of  Eg6  is  given  by 


^8  6 


/■5  • 

=  /  sin 

Jo 


2  2ntdt  =  2.5 


From  Eq.  (2.51),  the  correlation  coefficients  for  six  cases  are  found  to  be 

1  '5 


(1) 

(2) 

(3) 

(4) 

(5) 

(6) 


7(5X5) 
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7(  1.25)(5) 

1  z*5 
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Jo 

f\ 0. 

Jo 


7(5X5) 
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L 


(0.5)  Jt  =  1 

\)dt  =  -1 
5 


e~l/5dt  =  0.961 


n/(2.1617)(5) 

1  r5 

/  dt  =  0.628 

7(03x5)  y0 

^=/s 

7(23X5)  Jo 


sin  2;r/  dt  =  0 


Comments  on  the  Results  from  Example  2.6 

Because  gi(t)  =  *(/),  the  two  signals  have  the  maximum  possible  similarity,  and  p  =  1. 
However,  the  signal  gz(t)  also  shows  maximum  possible  similarity  with  p  =  1.  This  is 
because  we  have  defined  p  to  measure  the  similarity  of  the  waveshapes,  and  is  independent 
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For  *4(r),  p  =  0.961,  implying  a  high  degree  of  similarity  with  x(t).  This  is  reasonable 
because  j?4(0  is  very  similar  to  x(t)  over  the  duration  of  x(t)  (for  0  <  t  <  5).  Just  by 

•  i  .  i  /  ..  \  J  ^  ^ r\f  r*  •  t-i-i  1 1  or  ro toe 


For  g4(f),  P  =  0*961 
because  £4 (0  is  very 
inspection,  we  notice  t 


(/)  is  very  similar  to  xyi)  over  me  uuiauun  y 

we  notice  that  the  variations  or  changes  in  both  x(t)  and  g4(0  are  at  similar  rates. 


Such  is  not  the  case  with  g5(t),  where  we  notice  that  variations  in  g5(0  are  generally  at  a 
higher  rate  than  those  in  x(t).  There  is  still  a  considerable  degree  of  similarity;  both  signals 
always  remain  positive  and  show  no  oscillations.  Both  signals  have  zero  or  negligible  strength 
beyondr  =  5.Thus,g5(f)  is  similar  to  x(t),  but  not  as  similar  as  g4(0- This  is  why  p  =  0.628 
for  gs(t ). 

The  signal  g6(0  is  orthogonal  to  jt(f),  so  that  p  =  0.  This  appears  to  indicate  that  the 
dissimilarity  in  this  case  is  not  as  strong  as  that  of  g3(f),  for  which  p  =  —  1 .  This  may  seem 
odd  because  £3 (/)  appears  more  similar  to  jc(f),  than  does  gb(t).  The  dissimilarity  between 
x(l)  and  g3(/)  is  of  the  nature  of  opposite  characteristics;  in  a  way  they  are  very  similar, 
but  in  opposite  directions.  On  the  other  hand,  x(f)  and  g^(t)  are  dissimilar  because  they 
go  their  own  ways  irrespective  of  each  other  signal’s  variation;  it  is  of  the  nature  of  their 
being  strangers  to  each  other.  Hence  the  dissimilarity  of  g^(t)  to  x(t)  rates  lower  than  that 
of  £3(0- 

Sec.  2.10  provides  a  MATLAB  exercise  that  numerically  computes  the  correlation  coeffi¬ 
cients  [sign_cor.m| 


2.6.2  Application  to  Signal  Detection 

C  orrelation  between  two  signals  is  an  extremely  important  concept  that  measures  the  degree  of 
similarity  (agreement  or  alignment)  between  the  two  signals.  This  concept  is  widely  used  for 
signal  processing  in  radar,  sonar,  digital  communication,  electronic  warfare,  and  many  other 
applications. 


We  explain  such  applications  by  an  example  in  which  a  radar  signal  pulse  is  transmitted  in 
order  to  detect  a  suspected  target.  If  a  target  is  present,  the  pulse  will  be  reflected  by  it.  If  a  target 
is  not  present,  there  will  be  no  reflected  pulse,  just  noise.  By  detecting  the  presence  or  absence 
ol  the  reflected  pulse,  we  confirm  the  presence  or  absence  of  a  target.  The  crucial  problem 
in  this  procedure  is  to  detect  the  heavily  attenuated,  reflected  pulse  (of  known  waveform) 
buried  among  the  unwanted  noise  and  interferences.  Correlation  of  the  received  pulse  with  the 
transmitted  pulse  can  be  of  great  help  in  this  situation. 

We  denote  the  transmitted  radar  pulse  signal  as  g(t).  The  received  radar  return  signal  is 


(2.54) 


where  or  represents  the  tarpet  rpfWn™  _ .•  . 


orthogonality  between  w(f)  and  g(t  -  t0),  that  is. 


'i  <  t0  <  t2 


(2.55) 


2.6  Correlation  of  Signals  45 


Thus,  to  detect  whether  a  target  is  present,  a  correlation  can  be  computed  between  ziO  and  a 
delayed  pulse  signal  git  —  to) 


a Ep  target  present 
0  target  absent 


(2.56) 


Here  En  is  the  pulse  energy.  Given  the  orthogonality  between  w(r)  and  git  —  to),  the  target 
detection  problem  can  be  reduced  to  a  thresholding  problem  in  Eq.  (2.56)  to  determine  whether 
the  correlation  is  aEp  or  0  by  applying  a  magnitude  threshold.  Note  that  when  to  is  unknown, 
a  bank  of  N  correlators,  each  using  a  different  delay  r /, 


(2.57) 


may  be  applied  to  allow  the  receiver  to  identify  the  peak  correlation  at  the  correct  delay  r7  =  to- 
Alternatively,  a  correlation  function  can  be  used  (as  in  the  next  section). 

A  similar  situation  exists  in  digital  communication  when  we  are  required  to  detect  the 
presence  of  one  of  the  two  known  waveforms  in  the  presence  of  noise.  Consider  the  case 
of  binary  communication,  where  two  known  waveforms  are  received  in  a  random  sequence. 
Each  time  we  receive  a  pulse,  our  task  is  to  determine  which  of  the  two  (known)  waveforms 
has  been  received.  To  make  the  detection  easier,  we  must  make  the  two  pulses  as  dissimi¬ 
lar  as  possible.  This  means  we  should  select  one  pulse  to  be  the  negative  of  the  other  pulse. 
This  gives  the  highest  dissimilarity  (p  =  -  1).  This  scheme  is  sometimes  called  the  antipo¬ 
dal  scheme.  We  can  also  use  orthogonal  pulses  that  result  in  p  =  0.  In  practice  both  these 
options  are  used,  although  antipodal  is  the  best  in  terms  of  distinguishability  between  the  two 
pulses. 

Let  us  consider  the  antipodal  scheme  in  which  the  two  pulses  are  pit)  and  -pit).  The 
correlation  coefficient  p  of  these  pulses  is  -1.  Assume  no  noise  or  any  other  imperfections 
in  the  transmission.  The  receiver  consists  of  a  correlator  that  computes  the  correlation  coef¬ 
ficient  between  pit)  and  the  received  pulse.  If  p  is  1,  we  decide  that  pit)  is  received,  and 
if  p  is  -1,  we  decide  that  -pit)  is  received.  Because  of  the  maximum  possible  dissim¬ 
ilarity  between  the  two  pulses,  detection  is  easier.  In  practice,  however,  there  are  several 
imperfections.  There  is  always  an  unwanted  signal  (noise)  superimposed  on  the  received 
pulses.  Moreover,  during  transmission,  pulses  are  distorted  and  dispersed  (spread  out)  in 
time.  Consequently,  the  correlation  coefficient  is  no  more  ±1.  but  has  a  smaller  magni¬ 
tude,  thus  reducing  their  distinguishability.  We  use  a  threshold  detector,  which  decides 
that  if  p  is  positive,  the  received  pulse  is  pit),  and  if  p  is  negative,  the  received  pulse 
is  -pit). 

Suppose,  for  example,  that  pit)  has  been  transmitted.  In  the  ideal  case,  correlation  of  this 
pulse  at  the  receiver  would  be  1,  the  maximum  possible.  Now  because  of  noise  and  pulse 
distortion,  p  is  less  than  1 .  In  some  extreme  situation,  noise  and  pulse  overlapping  (spreading) 
make  this  pulse  so  dissimilar  to  pit)  that  p  is  a  negative  amount.  In  such  a  case  the  threshold 
detector  decides  that  -pit)  has  been  received,  thus  causing  a  detection  error.  In  the  same  way, 
if  -pit)  is  transmitted,  channel  noise  and  pulse  distortion  could  cause  a  positive  correlation, 
resulting  in  a  detection  error.  Our  task  is  to  make  sure  that  the  transmitted  pulses  have  sufficient 
energy  to  ensure  that  the  damage  caused  by  noise  and  other  imperfections  remains  within  a 
limit,  with  the  result  that  the  error  probability  is  below  some  acceptable  bound.  In  the  ideal 
case,  the  margin  provided  by  the  correlation  p  for  distinguishing  the  two  pulses  is  2  (from  1  to 
- 1 .  and  vice  versa).  Noise  and  channel  distortion  reduce  this  margin.  That  is  why  it  is  important 
to  start  with  as  large  a  margin  as  possible.  This  explains  why  the  antipodal  scheme  has  the  best 
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performance  in  terms  of  guarding  against  channel  noise  and  pulse  distortion.  For  other  reasons, 
however,  as  mentioned  earlier,  schemes  such  as  an  orthogonal  scheme,  where  p  0,  are  also 
used  even  when  they  provide  a  smaller  margin  (from  0  to  1,  and  vice  versa)  in  distinguishing 
the  pulses.  A  quantitative  discussion  of  correlation  in  digital  signal  detection  is  presented  in 
Chapter  1 1 . 

In  later  chapters  we  shall  discuss  pulse  dispersion  and  pulse  distortion  during  transmission, 
as  well  as  the  calculation  of  error  probability  in  the  presence  of  noise. 

2.6.3  Correlation  Functions 

We  should  revisit  the  application  of  correlation  to  signal  detection  in  radar,  where  a  signal 
pulse  is  transmitted  to  detect  a  suspected  target.  By  detecting  the  presence  or  absence  of  the 
reflected  pulse,  we  confirm  the  presence  or  absence  of  a  target.  By  measuring  the  time  delay 
between  the  transmitted  pulse  and  received  (reflected)  pulse,  we  determine  the  distance  of  the 
target.  Let  the  transmitted  and  the  reflected  pulses  be  denoted  by  g(f)  and  •;(/),  respectively,  as 
shown  in  Fig.  2. 17.  If  we  were  to  use  Eq.  (2.51)  directly  to  measure  the  correlation  coefficient 
p,  we  would  obtain 


(2.58) 


Thus,  the  correlation  is  zero  because  the  pulses  are  disjoint  (nonoverlapping  in  time).  The 
integral  in  Eq.  (2.58)  will  yield  zero  value  even  when  the  pulses  are  identical  hut  with  relative 


„  "  . ,  m  ...  . .  "  ’ ,  ,  U3,,,s  UIC  ,u«g™  on  me  right-hand  side  of  Eq.  (2.58)  we 

and  z(t )  defined  by  ^  ^'zg{T)'  th°  cross*corre,at'on  function  of  two  complex  signals  g(/) 


(2.59) 


rherefore,  ^( * )  is  an  indication  of  sii 
shifted)  by  r  seconds. 


milarity  (correlation)  of  g(t)  with  z(t)  advanced  (left- 


2.6.4  Autocorrelation  Function 


■  The  autocorrelation  function 


(2.60) 


with  its  own  displaced  version.  In  Chapter  3,  we 
provides  valuable  spectral  information  about  the 
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Figure  2.1  7 

Physical 
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correlation 
function. 
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2.7  ORTHOGONAL  SIGNAL  SETS 


In  this  section  we  show  a  way  of  representing  a  signal  as  a  sum  of  an  orthogonal  set  of  signals. 
In  effect,  this  orthogonal  set  of  signals  form  a  basis  for  the  specific  signal  space.  Here  again 
we  can  benefit  from  the  insight  gained  from  a  similar  problem  in  vectors.  We  know  that  a 
vector  can  be  represented  as  a  sum  of  orthogonal  vectors,  which  form  the  coordinate  system 
of  a  vector  space.  The  problem  in  signals  is  analogous,  and  the  results  for  signals  are  parallel 
to  those  for  vectors.  For  this  reason,  let  us  review  the  case  of  vector  representation. 


2.7.1  Orthogonal  Vector  Space 

Consider  a  multidimensional  Cartesian  vector  space  described  by  three  mutually  orthogonal 
vectors  X|,  X2,  and  X3,  as  shown  in  Fig.  2.18  for  the  special  case  of  three-dimensional  vec¬ 
tor  space.  First,  we  shall  seek  to  approximate  a  three-dimensional  vector  g  in  terms  of  two 
orthogonal  vectors  xi  and  X2: 


g  ~  C)X|  +  C2X2 

The  error  e  in  this  approximation  is 

e  =  g  —  (C]X|  +c2x2) 


or  equivalently 


g  =  cixi  +C2X2  -He 

Building  on  our  earlier  geometrical  argument,  it  is  clear  from  Fig.  2. 1 8  that  the  length  of  error 
vector  e  is  minimum  when  it  is  perpendicular  to  the  (xj,  X2)  plane,  and  when  t  |X|  and  C2X2 
are  the  projections  (components)  of  g  on  X|  and  \2,  respectively.  Therefore,  the  constants  c\ 
and  C2  are  given  by  formula  in  Eq.  (2.27). 

Now  let  us  determine  the  best  approximation  to  g  in  terms  of  all  the  three  mutually 
orthogonal  vectors  X| ,  X2,  and  X3: 


g  ~  C|X|  +C2X2  +  C3X3 


(2.61) 
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Figure  2.18  shows  that  a  unique  choice  of  c\,  ci ,  and  C3  exists,  for  which  Eq.  (2.61 )  is  no 
longer  an  approximation  but  an  equality: 


g  =  ClXi  +C2X2  +  C3X3 


In  this  case,  ciXi,ox2,  and  C3X3  are  the  projections  (components)  of  g  on  xj,X2,  and  X3, 
respectively.  Note  that  the  approximation  error  e  is  now  zero  when  g  is  approximated  in  terms 
of  three  mutually  orthogonal  vectors:  xi ,  X2,  and  X3.  This  is  because  g  is  a  three-dimensional 
vector,  and  the  vectors  xj,X2,  and  X3  represent  a  complete  set  of  orthogonal  vectors  in  three- 
dimensional  space.  Completeness  here  means  that  it  is  impossible  in  this  space  to  find  any  other 
vector  X4,  which  is  orthogonal  to  all  the  three  vectors  xj,X2,  and  X3.  Any  vector  in  this  space 
can  therefore  be  represented  (with  zero  error)  in  terms  of  these  three  vectors.  Such  vectors  are 
known  as  basis  vectors,  and  the  set  of  vectors  is  known  as  a  complete  orthogonal  basis  of 
this  vector  space.  If  a  set  of  vectors  { x, }  is  not  complete,  then  the  approximation  error  will 
generally  not  be  zero.  For  example,  in  the  three-dimensional  case  just  discussed,  it  is  generally 
not  possible  to  represent  a  vector  g  in  terms  of  only  two  basis  vectors  without  an  error. 

The  choice  ot  basis  vectors  is  not  unique.  In  fact,  each  set  of  basis  vectors  corresponds  to  a 
particular  choice  ot  coordinate  system.  Thus,  a  three-dimensional  vector  g  may  be  represented 
in  many  different  ways,  depending  on  the  coordinate  system  used. 

To  summarize,  if  a  set  of  vectors  {x, )  is  mutually  orthogonal,  that  is,  if 


x„> 


|0  m^n 

I  l^ml"  tn  =  n 


and  if  this  basis  set  is  complete,  a  vector  g  in  this  space  can  be  expressed  as 


8  —  ciX|  +C2X2  +  C3X3 

(2.62) 

where  the  constants  c,  are  given  by 

_  <8  */> 

—  - - - 

<X/,  X/> 

“  iff <8' Xi>  i  =  l'2'3 

(2.63a) 

(2.63b) 
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2.7.2  Orthogonal  Signal  Space 

We  continue  with  our  signal  approximation  problem  using  clues  and  insights  developed  for 
vector  approximation.  As  before,  we  define  orthogonality  of  a  signal  set^i(r),  *2(0,  •  •  •  xn(0 
over  a  time  domain  0  (may  be  an  interval  [t i,  ^2])  as 


xm{t)x*(t)dt 


0  m  n 

E„  m  =  n 


(2.64) 


If  all  signal  energies  are  equal  E„  =  1 ,  then  the  set  is  normalized  and  is  called  an  orthonormal 
set.  An  orthogonal  set  can  always  be  normalized  by  dividing  x„(t)  by  for  all  «•  Now,  con¬ 
sider  the  problem  of  approximating  a  signal  g(f)  over  the  (-)  by  a  set  of  N  mutually  orthogonal 
signals xi(f),jr2(0.  ...,xnU): 


g(t)  ~  cixi(r)  +  C2*2(0  H - 1-  cnxn(0  (2.65a) 

N 

=  ^pcvc„(r)  te&  (2.65b) 

n=l 


It  can  be  shown  that  Ee ,  the  energy  of  the  error  signal  e{t)  in  this  approximation  is  minimized 
if  we  choose 


f  g(t)x*(t)  dt 

.  _  Jie& _ 

f  \x„(t)\2dt 
Jtee 

=  —  f  g(t)x„(t)dt 

En  Je 


n  =  1,  2, ..  .,N 


(2.66) 


Moreover,  if  the  orthogonal  set  is  complete,  then  the  error  energy  Ee  0,  and  the  represen¬ 
tation  in  (2.65)  is  no  longer  an  approximation,  but  an  equality.  More  precisely,  let  the  A-term 
approximation  error  be  defined  by 


N 

es(t)  =  g(t)  —  [ciJt|(/)  +  C2X2O)  +  •  •  •  +  Cfl/XN(t)]  =  g(t)  —  c„x„(t)  t  €  (~)  (2.67) 

n=l 


If  the  orthogonal  basis  is  complete,  then  the  error  signal  energy  converges  to  zero,  that  is, 

lim  [  \eN(t)\2dt  =  0  (2.68) 

n->oc 

Although  strictly  in  a  mathematical  sense,  a  signal  may  not  converge  to  zero  even  though  its 
energy  does.  This  is  because  a  signal  may  be  nonzero  at  some  isolated  points.*  Still,  for  all 
practical  purposes,  signals  are  continuous  for  all  t ,  and  the  equality  (2.68)  states  that  the  error 
signal  has  zero  energy  as  N  -*  00.  Thus,  for  N  00,  the  equality  (2.65)  can  be  loosely 


Known  as  a  measure-zero  set 
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written  as 


g(t)  =  C|JC|  (/)  +  C2X2  (/)  +  •  •  •  +  c„x„(t)  +  ■  •  • 

00 

=  Y/c„x„(t)  te@  (2.69) 

n=l 


where  the  coefficients  cn  are  given  by  Eq.  (2.66).  Because  the  error  signal  energy  approaches 
zero,  it  follows  that  the  energy  of  g(t)  is  now  equal  to  the  sum  of  the  energies  of  its  orthogonal 
components. 

The  series  on  the  right-hand  side  of  Eq.  (2.69)  is  called  the  generalized  Fourier  series  of 
g(t)  with  respect  to  the  set  {*„(/)).  When  the  set  {*„(?)}  is  such  that  the  error  energy  £/v  -*  0 
as  N  -►oo  for  every  member  of  some  particular  signal  class,  we  say  that  the  set  {*„(/)}  is 
complete  on  [t :  0}  for  that  class  of  g(f),  and  the  set  {jt„(r)}  is  called  a  set  of  basis  functions 
or  basis  signals.  In  particular,  the  class  of  (finite)  energy  signals  over  0  is  denoted  as  L?(&}. 


2.7.3  Parseval's  Theorem 

Recall  that  the  energy  ot  the  sum  of  orthogonal  signals  is  equal  to  the  sum  of  their  energies. 
I  heretore,  the  energy  of  the  right-hand  side  of  Eq.  (2.69)  is  the  sum  of  the  energies  of  the 
individual  orthogonal  components.  The  energy  of  a  component  c„x„(t)  is  czEn.  Equating  the 
energies  of  the  two  sides  of  Eq.  (2.69)  yields 

Eg  =  C\E\  +  c\E2  +  c\E-i  -\ - 

=  (2.70) 


(area  under  the  ,,,  ,  ,  *  7  °'  Farseval’s theorem  Recall  that  the  signal  energy 

the  vector-sign^analogy11  In6 vector  space* wTk^°8°Uh  ‘°  T  °f  the  'ength  °f  “  VeCt°r 

equal  to  the  sum  of  the  squares  of  the  lengths  of  its  or!^  T**  ^  ^  'ength  °f  “  VeC,°r  “ 
(2.70)  is  the  statement  of  this  fact  as  applied  to  C°mp°nentS-  Parseval’S  the°rem 

2.7.4  Some  Examples  of  Generalized  Fourier  Series 

2 -  ^  ^  *!"  -  ^na,  are  vectors  in  every 

depending  upon  the  choice  of  a  coordinate  s  a|SUm  °f  .,tS  comPonents  in  a  variety  of  ways, 
its  components  in  a  variety  of  ways  J  ^  ^  3  S'^na*  can  be  represented  as  a  sum  of 

mutually  orthogonal  vectors,  we  also  ha**S  ^  "?  *1ave  vector  coordinate  systems  formed  by 

a  variety  of  sets  of  mutually  orthogonal  sienaf"^001^'.11316  systems  (basis  signals)  formed  by 
sets  that  can  he  used  as  basis  signals  for  *  CX*sts  a  'ar£e  number  of  orthogonal  signal 
sets  are  trigonometric  (sinusoid)  function?"6™  F0Urier  Series-  Some  well-known  signal 
Rcssel  functions,  Legendre  polynomials  I  eXponent'a*  (sinusoid)  functions.  Walsh  functions, 

polynomials,  and  Chebyshev  polynomial’s  fUnCtions’  Jacob'  polynomials,  Hermitian 
(he  trigonometric  and  the  exponential  sinusoids  huh'0”8  ^  COncern  us  most  'n  tb's  bo0^  are 
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2.8  TRIGONOMETRIC  FOURIER  SERIES 

We  first  consider  the  class  of  (real  or  complex)  periodic  signals  with  period  7o.  This  space  of 
periodic  signals  of  period  To  has  a  well-known  complete  orthogonal  basis  formed  by  real-valued 
trigonometric  functions.  Consider  a  signal  set: 

{1,  cos  coot ,  cos  2coot,  . . . ,  cos  ncoot ,  . . . ;  sin  coo t,  sin  2 coot,  . . . ,  sin  ncoot ,  . . .} 

(2.71) 

A  sinusoid  of  angular  frequency  ncoo  is  called  the  A/th  harmonic  of  the  sinusoid  of  angular 
frequency  coo  when  n  is  an  integer.  Naturally  the  sinusoidal  frequency  fy  (in  hertz)  is  related 
to  its  angular  frequency  coo  via 


coo  =  27t/0 


These  terms  offer  different  conveniences  and  are  equivalent.  We  note  that  <^o  and  Info  are 
equally  commonly  used  in  practice.  Neither  offers  any  distinct  advantages.  For  this  reason, 
we  will  be  using  coo  and  fo  interchangeably  in  this  book  according  to  the  convenience  of  the 
particular  problem  in  question. 

The  sinusoid  of  angular  frequency  coo  serves  as  an  anchor  in  this  set,  called  the  funda¬ 
mental  tone  of  which  all  the  remaining  terms  are  harmonics.  Note  that  the  constant  term  1 
is  the  zeroth  harmonic  in  this  set  because  cos  (0  x  coot)  =  1.  We  can  show  that  this  set  is 
orthogonal  over  any  interval  of  duration  7o  =  2n/coo,  which  is  the  period  of  the  fundamental. 
This  follows  from  the  equations  (proved  in  Appendix  A): 


and 


0 

Zb 

2 


I  cos  ncoot  cos  mcoot  dt  = 
JTq 


f  sin  ncoot  sin  mcoot  dt  =  I  7^ 

Jt0  l  2 


/  sin  ncoot  cos  mcoot  dt  —  0 
Jtq 


n^m 
m  =  n  7^  0 

n  ^  m 
n  =  m  ^  0 


for  all  n  and  m 


(2.72a) 

(2.72b) 


(2.72c) 


The  notation  fT  means  “integral  over  an  interval  from  t  =  t\  to  t\  +  7o  for  any  value  of  fi” 
These  equations  show  that  the  set  (2.71)  is  orthogonal  over  any  contiguous  interval  of  duration 
7o-  This  is  the  trigonometric  set,  which  can  be  shown  to  be  a  complete  set.3’ 4  Therefore,  we 
can  express  a  signal  g(t)  by  a  trigonometric  Fourier  series  over  any  interval  of  duration  7o 
seconds  as 


g(t)  =  ao  +  a]  cos  coot  +  a2  cos  2 oot  H - 

+  b\  sin  coot  +  £2  sin  2 coot  H -  t\  <  t  <  t\  +  To  (2.73a) 

or 

00 

g(t)  =  ao  +  ^  (tin  cos  ncoot  +  bn  sin  noot)  t\  <  t  <  t\  +  To  (2.73b) 

n=\ 
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where 


2  n  ,  1 

t oo  =  Info  =  —  and  /o  - 
To 


To 


(2.74) 


We  can  use  Eq.  (2.66)  to  determine  the  Fourier  coefficients  ao,  a„,  and  b„.  Thus 


/  g(t)  cos  no)otdt 

_  Ju _ 

a"~  r»+T0 

/  cos“  ncDQt  dt 

Jt\ 


(2.75) 


The  integral  in  the  denominator  of  Eq.  (2.75)  as  seen  from  Eq.  (2.72a)  (with  m  =  n)  is  7o/2 
when  n  ^  0.  Moreover,  for  n  =  0,  the  denominator  is  7q.  Hence 


1  ft\+To 

ao  =  —  g(t)dt 

To  Jti 


(2.76a) 


and 


2  /*' >+7o  2  rh+ro 

an  =  ToJ,  80) cos  moot dt  =  —  J  g(t) cos  nlnfot dt  n  =  1,2,3,... 

(2.76b) 


By  means  of  a  similar  argument,  we  obtain 
2  r'l+Tb 


2  2  /,,'+7b 

bn~  T0  Jtl  g(t) S1"  dt  =  To],  g(,) sin  n27r/o' *  «  =  1 , 2, 3, . . . 


(2.76c) 

^  signals;6  trig°n°me,ric  F°Urier  series  of  (2-73>  applies  to  both  real  and  complex 

2.8.1  Compact  Trigonometric  Fourier  Series 

(2  76)  are  also  Tconlu  "f  Fourier  series  coefficka*  of  E* 

*  «* «-»  real-valued  sine 

same  fluency  using  «*  *el|.k„ow„"ngonolfc  'n  *  Single  °' "" 


sin  nlnfot  =  C„  cos  (ribtfot  +  0„) 

(2.77) 

'  ~  \Jan2  +  bn2 

(2.78a) 

(2.78b) 

where 
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For  consistency  we  denote  the  dc  term  ao  by  Co,  that  is, 

Co  =  ao  (2.78c) 

From  the  identity  (2.77),  the  trigonometric  Fourier  series  in  Eq.  (2.73)  can  be  expressed  in  the 
compact  form  of  the  trigonometric  Fourier  series  as 

00 

g(t)  =  Co  +  ^  c„  cos  ( nlirfot  +  6n)  t\  <  t  <  t\  +  7o  (2.79) 

n=  i 


where  the  coefficients  C„  and  9n  are  computed  from  a„  and  b„  by  using  Eq.  (2.78). 

Equation  (2.76a)  shows  that  a0  (or  Co)  is  the  average  value  of  g(t)  (averaged  over  one 
period).  This  value  can  often  be  determined  by  direct  inspection  of  g(/). 


Example  2.7  Find  the  compact  trigonometric  Fourier  series  for  the  exponential  e  shown  in  Fig.  2.19a 
over  the  interval  0  <  t  <  n. 

Because  we  are  required  to  represent  g(t)  by  the  trigonometric  Fourier  series  over  the 
interval  0  <  t  <  tt.  To  =  n,  and  the  fundamental  angular  frequency  and  the  fundamental 
frequency  are 


2jt  1  1 

ruo  =  —  =2  rad/s  and  /o  =  —  =  z 
To 


To  71 


respectively.  Therefore 

00 

g(t)  =  ao  +  ^  an  c°s  2nf  +  bn  sin  2 nt  0  <  t  <  tt 


n=  1 


where  [from  Eq.  (2.76)] 

00  =  1  r  e~,/2dt  =  0.504 

X  J  0 

an  =  —  f  e~r/2  cos  2 nt  dt  =  0.504  ( 
*  Jo  V 


1  +  16/7 


0 


and 


b-  =  \  [  e~'n  sin  lm  d‘  -  °'504  ( ITTfe3 ) 


Therefore 


g(t)  =  0.504 


i  +  E 


n=  1 


1  +  \6n2 


(cos  2 nt  +  4 n  sin  2 nt) 


0  <  t  <  n 
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Figure  2.19 

(a,  b)  Periodic 
signal  and  (c,  d) 
its  Fourier 
spectra. 
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To  find  the  compact  Fourier  series,  we  use  Eq.  (2.78)  to  find  its  coefficients,  as 


C0  =  fl0  =  0.504 


C«  =  Jal  +  %  =  0.504^ 


64n2 


(1  +  16n2)2  (1  +  16/i2)2 

en  =  tan’1  =  tan-l(-4n)  =  -  tan"1 4 n 


(2.80) 


The  amplitude  and  phases  of  the  dc 
Eq.  (2.80)  and  displayed  in  Table  2.1. 
in  the  compact  trigonometric  Fourier 


and  the  first  seven  harmonics  are  computed  from 

We  can  use  these  numerical  values  to  express  g(t) 
series  as 


g(t)  =  0.504  +  0.504  V' _ 1  cn,n  ' 

16^2  (2"r~tan  4«)  0<t<7T  (2.81a) 

=  0.504  +  0.244 cos  (2r  -  75  96°\  _l  n  ns  <a 

/j.vo  )  +  U.l 25 cos (4?  —  82.87°) 

+  0.084  cos  (6r  -  85  24°  t  4-  n  oat 

> +  0  063  cos  (8/  -  86.42°)  -\ _ 


0  <t  <n 
(2.81b) 
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TABLE  2.1 


n  0  1  2  3  4  5  6  7 


C„  0.504  0.244  0.125  0.084  0.063  0.0504  0.042  0.036 

6„  0  -75.96°  -82.87°  -85.24°  -86.42°  -87.14°  -87.61°  -87.95° 


2.8.2  Periodicity  of  the  Trigonometric  Fourier  Series 

We  have  shown  how  an  arbitrary  signal  g(t)  may  be  expressed  as  a  trigonometric  Fourier  series 
over  any  continuous  interval  of  To  seconds.  The  Fourier  series  is  equal  to  g(f )  over  this  interval 
alone,  as  shown  in  Fig.  2.19a  and  b.  Outside  this  interval,  the  series  is  not  necessarily  equal  to 
g{t).  It  would  be  interesting  to  find  out  what  happens  to  the  Fourier  series  outside  this  interval. 
We  now  show  that  the  trigonometric  Fourier  series  is  a  periodic  function  of  period  To  (the 
period  of  the  fundamental).  Let  us  denote  the  trigonometric  Fourier  series  on  the  right-hand 
side  of  Eq.  (2.79)  by  Therefore 

OO 

(p(t)  =  Co  +  ^2  Cn  cos  (n2nfot  +  0n)  for  all  t 

ti— l 

and 

OO 

(p(t  +  To)  =  Co  +  ^  cn  cos  [n2nfo(t  +  7o)  +  9„] 

n=  l 
OO 

=  Co  +  E  C„  cos  [nlnfyt  +  2nn  +  0n] 

n—  1 

oo 

=  Co  +  ^2  c„  cos  {nlnfot  +  9n) 

n=  l 

=  <p(t)  for  all  t  (2.82) 

This  shows  that  the  trigonometric  Fourier  series  is  a  periodic  function  of  period  To  (the  period 
of  its  fundamental).  For  instance,  <p(r),  the  Fourier  series  on  the  right-hand  side  of  Eq.  (2.8 1  b) 
is  a  periodic  function  in  which  the  segment  of  g(t)  in  Fig.  2.19a  over  the  interval  (0  <  t  <  n) 
repeats  periodically  every  n  seconds  as  shown  in  Fig.  2.19b.* 

Thus,  when  we  represent  a  signal  g(t)  by  the  trigonometric  Fourier  series  over  a  certain 
interval  of  duration  To,  the  function  g(t)  and  its  Fourier  series  <p(t)  need  be  equal  only  over 
that  interval  of  T0  seconds.  Outside  this  interval,  the  Fourier  series  repeats  periodically  with 
period  T0.  Now  if  the  function  g(t)  were  itself  to  be  periodic  with  period  T0,  then  a  Fourier 
series  representing  g(t)  over  an  interval  T0  will  also  represent  g(t)  for  all  /  (not  just  over  the 
interval  7b).  Moreover,  such  a  periodic  signal  g(t)  can  be  generated  by  a  periodic  repetition 
of  any  of  its  segment  of  duration  T0.  Therefore,  the  trigonometric  Fourier  series  representing 
a  segment  of  g(t)  of  duration  T0  starting  at  any  instant  represents  g(f)  for  all  t.  This  means  in 
computing  the  coefficients  a0 ,  an,  and  bn ,  we  may  use  any  value  for  t\  in  Eq.  (2.76).  In  other 


*  In  reality,  the  series  convergence  at  the  points  of  discontinuity  shows  about  9%  overshoot  (Gibbs  phenomenon). 
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words,  we  may  perform  this  integration  over  any  interval  of  T0.  Thus,  the  Fourier  coefficients 
of  a  series  representing  a  periodic  signal  g(t)  (for  all  t)  can  be  expressed  as 


(2.83a) 


—  [  g(t)  cos  nlnfatdt 
To  Jt0 


(2.83b) 


n  =  1,2,3,... 


and 


b„  =  J-  f  g(0sin  nlnfotdt 
fo  Jtq 


(2.83c) 


n  =  1,2,3, ... 


where  JTi  means  that  the  integration  is  performed  over  any  interval  of  Tq  seconds. 


2.8.3  The  Fourier  Spectrum 

The  compact  trigonometric  Fourier  series  in  Eq.  (2.79)  indicates  that  a  periodic  signal  g(t)  can 

be  expressed  as  a  sum  of  sinusoids  of  frequencies  0  (dc),/o,  2/o, . . . ,  m/o, _ whose  amplitudes 

are  Co,  Cj,  C2, . . . ,  C„,  —  and  whose  phases  are  0,  Q\,  62, . . . ,  0„, ... ,  respectively.  We  can 
readily  plot  amplitude  Cn  versus /  (amplitude  spectrum)  and  6n  versus/  (phase  spectrum). 
These  two  plots  of  magnitude  and  phase  together  are  the  frequency  spectra  of  #(0* 

Figure  2.19c  and  d  show  the  amplitude  and  phase  spectra  for  the  periodic  signal  <p(t) 
in  Fig.  2.19b.  These  spectra  tell  us  at  a  glance  the  frequency  composition  of  (p(t):  that  is, 
the  amplitudes  and  phases  of  various  sinusoidal  components  of  </?(/).  Knowing  the  frequency 
spectra,  vve  can  reconstruct  or  synthesize  (p(t)y  as  shown  on  the  right-hand  side  of  Eq.  (2.81a). 
Therefore,  the  frequency  spectra  in  Figs.  2.19c  and  d  provide  an  alternative  description— the 
frequency  domain  description  of  The  time  domain  description  of  <p(t)  is  shown  in 
Fig.  2.19b.  A  signal,  therefore,  has  a  dual  identity:  the  time-domain  identity  <p{t)  and  the 
frequency  domain  identity  (Fourier  spectra).  The  two  identities  complement  each  other;  taken 
together,  they  provide  a  better  understanding  of  a  signal. 

Series  Convergence  at  Jump  Discontinuities 


integrable  over  one  period;  that  is. 


(2.84) 
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This  is  known  as  the  weak  Dirichlet  condition.  If  a  function  g(t)  satisfies  the  weak  Dirichlet 
condition ,  the  existence  of  a  Fourier  series  is  guaranteed ,  but  the  series  may  not  converge 
at  every  point.  For  example,  if  a  function  g(t)  is  infinite  at  some  point,  then  obviously 
the  series  representing  the  function  will  be  nonconvergent  at  that  point.  Similarly,  if  a 
function  has  an  infinite  number  of  maxima  and  minima  in  one  period,  then  the  function 
contains  an  appreciable  amount  of  the  infinite  frequency  component  and  the  higher  coef¬ 
ficients  in  the  series  do  not  decay  rapidly,  so  that  the  series  will  not  converge  rapidly  or 
uniformly.  Thus,  for  a  convergent  Fourier  series,  in  addition  to  condition  (2.84),  we  require 
that 

2.  The  function  g(t)  can  have  only  a  finite  number  of  maxima  and  minima  in  one  period, 
and  it  may  have  only  a  finite  number  of  finite  discontinuities  in  one  period. 

These  two  conditions  are  known  as  the  strong  Dirichlet  conditions.  We  note  here  that  any 
periodic  waveform  that  can  be  generated  in  a  laboratory  satisfies  strong  Dirichlet  conditions 
and  hence  possesses  a  convergent  Fourier  series.  Thus,  a  physical  possibility  of  a  periodic 
waveform  is  a  valid  and  sufficient  condition  for  the  existence  of  a  convergent  series. 


Example  2.8  Find  the  compact  trigonometric  Fourier  series  for  the  periodic  square  wave  w(t)  shown  in 
Fig.  2.20a,  and  sketch  its  amplitude  and  phase  spectra. 


The  Fourier  series  is 


w(t)  =  do  +  ^  an  cos  nlnfot  4-  b„  sin  nlnfyt 


Figure  2.20 

(a)  Square  pulse 
periodic  signal 
and  (b)  its  Four¬ 
ier  spectrum. 


-371 


-271 


wit) 


2k 


371 


(a) 


(b) 
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where 


ao  =  I  w(r)Jr 

h  JTo 

In  the  foregoing  equation,  we  may  integrate  \v(t)  over  any  interval  of  duration  7o.  In  this 
particular  case,  the  signal  period  is  7o  =  2jt.  Thus, 


Thus,  it  is  simpler  to  use  coo  to  represent  the  Fourier  series.  Figure  2.20a  shows  that  the 
best  choice  for  a  region  of  integration  is  from  -7o/2  to  7o/2.  Because  w(t)  =  1  only  over 


and  w(t)  =0  over  the  remaining  segment, 


(2.85a) 


We  could  have  found  ao,  the  average  value  of  w(r),  to  be  0.5  merely  by  inspection  of  w(t) 
in  Fig.  2.20a.  Also, 


0  n  even 


? 


(2.85b) 


(2.85c) 


In  these  derivations  we  used  the  fact  that  f0T0  =  1.  Therefore 


(2.86) 
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Figure  2.21 

Bipolar  square 
pulse  periodic 
signal. 


radians.  This  can  be  seen  from  the  trigonometric  identity* 

—  cos  x  =  cos  ( x  —  n) 

Using  this  fact,  we  can  express  the  series  in  Eq.  (2.86)  as 


w(t) 


1  2  r 

=  2  +  *  [' 


COS  ft*)t  +  -  COS  (3ft*) t  —  7r)  +  -  COS  5 ft*)/ 


4-  \j  cos  (7 ft*)/  —  ji)  +  -  cos  9 ft*)/  + 


This  is  the  desired  form  of  the  compact  trigonometric  Fourier  series.  The  amplitudes  are 


and  the  phases  are 


0n  = 


C0  = 

2 

1 0  n  even 

Cn  = 

I  ^  n  odd 

0 

n^  3,7,11,15 

—Tt 

n  =  3,7, 11,15 

We  could  plot  amplitude  and  phase  spectra  using  these  values.  We  can,  however, 
simplify  our  task  in  this  special  case  if  we  allow  amplitude  C„  to  take  on  negative  values. 
If  this  is  allowed,  we  do  not  need  a  phase  of  -n  to  account  for  the  sign.  This  means  that 
the  phases  of  all  components  are  zero,  and  we  can  discard  the  phase  spectrum  and  manage 
with  only  the  amplitude  spectrum,  as  shown  in  Fig.  2.20b.  Observe  that  there  is  no  loss  of 
information  in  doing  so  and  that  the  amplitude  spectrum  in  Fig.  2.20b  has  the  complete 
information  about  the  Fourier  series  in  (2.86).  Therefore,  whenever  all  sine  terms  vanish 
(b„  =0),  it  is  convenient  to  allow  C„  to  take  on  negative  values.  This  permits  the  spectral 
information  to  be  conveyed  by  a  single  spectrum— the  amplitude  spectrum.  Because  C„ 
can  be  positive  as  well  as  negative,  the  spectrum  is  called  the  amplitude  spectrum  rather 
than  the  magnitude  spectrum. 

Another  useful  function  that  is  related  to  the  periodic  square  wave  is  the  the  bipolar 
square  wave  w0(f)  shown  in  Fig.  2.21.  We  encounter  this  signal  in  switching  applications. 


w0(0 


1 

- 1 - 

- 1 - 

-To 

-1- 

To 

2  T0  t- 

*  Because  cos  (x  ±  n)  =  -  cos  x,  we  could  have  chosen  the  phase  jt  or  - jr.  In  fact,  cos  (x  ±  Nit)  ~  -  cos  x  for  any 
odd  integral  value  of  N.  Therefore,  the  phase  can  be  chosen  as  ±Nir  where  N  is  any  convenient  odd  integer. 
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Note  that  w„(t)  basically  w(f)  minus  its  dc  component.  It  is  easy  to  see  that 

w„(t)  =  2 [w(t)  -  0.5] 


Hence,  from  Eq.  (2.86),  it  follows  that 

w„(t )  =  -  ^cos  wot  ~  ^  cos  3wo t  +  ^  cos  5wot  -  ^  cos  Icoot  H - ^  (2.87) 

Comparison  of  Eq.  (2.87)  with  Eq.  (2.86)  shows  that  the  Fourier  components  of  w„(t)  are 
identical  to  those  of  w(f)  [Eq.  (2.86)]  in  every  respect  except  for  doubling  the  amplitudes 
and  loss  of  dc. 


Example  2.9 


Find  the  trigonometric  Fourier  series  and  sketch  the  corresponding  spectra  for  the  periodic 
impulse  train  8r0(t)  shown  in  Fig.  2.22a. 


The  trigonometric  Fourier  series  for  8j0(t)  is  given  by 


&T0(t)  =  Co  +  ^C„  cos  (nlnfot  +  6n)  wq  =  — 

To 


We  first  compute  ao,  a„  and  b„: 


1  [T 1 
ao  =  —  8(t)  dt  =  — 

To  J-To/2  To 

2  fb>/2  2 

a"  =  —  8(t)  cos  nlnfotdt  =  — 

To  J-Tq/2  To 


Figure  2.22 

(a)  Impulse  train 
and  (b)  its 
Fourier  spectrum. 
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-To 
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(a) 
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This  result  follows  from  the  sampling  property  (2.19a)  of  the  impulse  function.  Similarly, 
by  using  the  sampling  property  of  the  impulse,  we  obtain 


To/2 

8(t)  sin  nlnfotdt  =  0 

-To/2 


Therefore,  Co  =  1  /7o,  C„  =  2/Tq  and  0n  =  0.  Thus 


(2.88) 


n=  1 


Figure  2.22b  shows  the  amplitude  spectrum.  The  phase  spectrum  is  zero. 


2.8.4  The  Effect  of  Symmetry 

The  Fourier  series  for  the  signal  g(t)  in  Fig.  2.19a  (Example  2.7)  consists  of  sine  and  cosine 
terms,  but  the  series  for  the  signal  w(t)  in  Fig.  2.20a  (Example  2.8)  consists  of  cosine  terms 
only.  In  some  cases  the  Fourier  series  consists  of  sine  terms  only.  None  ot  this  is  accidental. 
It  can  be  shown  that  the  Fourier  series  of  any  even  periodic  function  g(t)  consists  of  cosine 
terms  only  and  the  series  for  any  odd  periodic  function  g(r)  consists  of  sine  terms  only  (see 
Prob.  2.8-1). 


2.9  THE  EXPONENTIAL  FOURIER  SERIES 


We  noted  earlier  that  orthogonal  signal  representation  is  NOT  unique.  While  the  trigonometric 
Fourier  series  allows  a  good  representation  of  all  periodic  signals,  here  we  provide  a  new 
orthogonal  representation  of  periodic  signals  that  is  equivalent  to  the  trigonometric  Fourier 
series  but  has  a  simpler  form. 

First  of  all,  it  is  clear  that  the  set  of  exponentials  e Jnw°'  («  =  0.  ±  1 ,  ±2, . . .)  is  orthogonal 
over  any  interval  ot  duration  7q  =  2tt/(oo,  that  is. 


m  ^  n 
m  =  n 


(2.89) 


Moreover,  this  set  is  a  complete  set.3-4  From  Eqs.  (2.66)  and  (2.69),  it  follows  that  a  signal 
g(r)  can  be  expressed  over  an  interval  of  7b  second(s)  as  an  exponential  Fourier  series 


00 


S<0  =  £ 


oo 


=  Y,  Dnjn2nh' 


(2.90) 


62 


SIGNALS  AND  SIGNAL  SPACE 


where  |see  Eq.  (2.66)] 

Dn  =  l  f  g(t)e-jn2llfo,dt  (2.91) 

To  JTo 

The  exponential  Fourier  series  is  basically  another  form  of  the  trigonometric  Fourier  series. 
Each  sinusoid  of  frequency /  can  be  expressed  as  a  sum  of  two  exponentials,  e i  and  e  f . 
As  a  result,  the  exponential  Fourier  series  consists  of  components  of  the  form  e)n~7TU]l  with 
n  varying  from  —  oo  to  oo.  The  exponential  Fourier  series  in  Eq.  (2.90)  is  periodic  with 
period  7o. 

To  see  its  close  connection  with  the  trigonometric  series,  we  shall  rederive  the  exponential 
Fourier  series  from  the  trigonometric  Fourier  series.  A  sinusoid  in  the  trigonometric  series  can 
be  expressed  as  a  sum  of  two  exponentials  by  using  Euler’s  formula: 


C„  cos  (n2nfot  +  0n)  =  ^  +  e  7(n2jr/o^n)  j 

=  (^Yei0n  j  e?n2*kt  +  e~jn2nfot 


where 


=  Dne>nlnfQt  +  D-ne~jn27Tfot 


=  ~CnA 
D-n  =  \cne-#» 


OO 


(2.92) 


(2.93) 


Recall  that  g(t)  is  given  by 

g(0  =  Co  +  ^  cos  (nlnfot  +  0„) 

n=  1 

Use  of  Eq.  (2.92)  in  the  foregoing  equation  (and  letting  C0  =  D0)  yields 

OO 

g(t)  =  Do  +  ^  ^D„e,n2irA'  +  D-ne~jn27Tf°'\ 

n=\  V  / 

OO 

=  A)  +  ^  D„e’n2n^t 

n=-oo(n^0) 

which  is  precisely  equivalent  to  Eq  (2  90)  derive  „Q  c  . 

connection  between  the  coefficients  of  the  trio  d  Equatlon  (2  93>  shows  the  cloSC 
Observe  the  compactness  of  expressions  (§2  90 w!  exponential  Fourier  serieS' 

sions  corresponding  to  the  trigonometric  Fr,  \and  (2.91)  and  compare  them  withexpres- 

dearly  the  principJvirtue  SSSSST Se"eS'  ?**. tW0  equations  demonstrate  very 
compact.  Second,  the  mathematical  exnre«  Series‘ First’ the  form  of  the  series  is  more 

compact.  It  is  much  more  convenient  to  h  *  hi  °L  eriving  the  coefficients  of  the  series  is  also 
one.  In  system  analysis  also,  the  exponenti  *!  f  6  *  C  exPonent>al  series  than  the  trigonometric 
ric  form.  For  these  reason  we  shall  use  ex  orm  P’’oves  more  convenient  than  the  trigonomet- 
ot  signals  in  the  rest  of  the  book  F  fentia  (rather  than  trigonometric)  representation 
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Example  2.10  Find  the  exponential  Fourier  series  for  the  signal  in  Fig.  2.19b  (Example  2.7). 
In  this  case,  7q  =  7r,  2nfo  =  2n/To  =  2,  and 


<p(t)  =  ^  Dnei 


2m 


where 


and 


Dn  =  ±  f  <p(t)e-*"dt 

*0  Jt0 

=  1  re-^e-^dt 
n  Jo 

= ~  r  e-(i+j2n)'  dt 
n  Jo 


-1 


n  (3  +j2nj 

0.504 
1  +  jAn 


7—(j+j2n)t 


jlnt 


= 0  504  [‘ +  +  rhs**4' + 


1 


1  +J12 


e*6’  + 


+ 


1  ~j 4 


e-j*  + 


1  -ys 


e-j41  + 


1  -yi2 


e-'6'  + 


(2.94) 


(2.95a) 


(2.95b) 


Observe  that  the  coefficients  Dn  are  complex.  Moreover,  D„  and  D_„  are  conjugates  as 
expected  [see  Eq.  (2.93)]. 


Exponential  Fourier  Spectra 

In  exponential  spectra,  we  plot  coefficients  Dn  as  a  function  of  co.  But  since  Dn  is  complex  in 
general,  we  need  two  plots:  the  real  and  the  imaginary  parts  of  Dn  or  the  amplitude  (magnitude) 
and  the  angle  of  D„.  We  prefer  the  latter  because  of  its  close  connection  to  the  amplitudes  and 
phases  of  corresponding  components  of  the  trigonometric  Fourier  series.  We  therefore  plot 
|D„|  versus  w  and  ZD„  versus  co.  This  requires  that  the  coefficients  D„  be  expressed  in  polar 

form  as  \Dn\eJ^D*. 

Comparison  of  Eqs.  (2.76a)  and  (2.91)  (for  n  =  0)  shows  that  Do  =  <*o  =  Co.  Equation 
(2.93)  shows  that  for  a  real  periodic  signal,  the  twin  coefficients  Dn  and  D_„  are  conjugates. 
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Thus, 


|D„|  =  |D_n| 

—  1  04 

II 

0 

(2.96a) 

ZD„  =  6„ 

and 

1 

II 

e 

1 

Q 

N 

(2.96b) 

D«  =  \D„\ej6" 

and 

D-n  =  \Dn\e~je” 

(2.97) 

Note  that  \D„  \  are  the  amplitudes  (magnitudes)  and  ZD„  are  the  angles  of  various  expo¬ 
nential  components.  From  Eq.  (2.96)  it  follows  that  the  amplitude  spectrum  (|D„|  vs./)  is  an 
even  function  of /  and  the  angle  spectrum  (ZD„  vs./)  is  an  odd  function  of /  when  g(t)  is  a 
real  signal. 

For  the  series  in  Example  2.10  [see  Eq.  (2.93)],  for  instance, 


D0  =  0.504 
0.504 


D|  =  — —  =  0.122e_-'75%°  =>  |D||  =  0.122,  ZD,  =  -75.96° 


1  +  jA 
0.504 


and 


D-\  =  y— -  =  0.122e'7596°  =>  |D_,|  =  0.122,  ZD_i  =  75.96° 


_  0.504  fi70 

=  y^jj  =  0.0625e  1  -■  =>  |D2|  =  0.0625,  ZD2  =  -82.87° 

D~ 2  =  =  °  °625^8-  87  =»  |D_2|  =  0.0625,  ZD_2  =  82.87° 

and  so  on.  Note  that  D„  and  D_„  are  conjugates,  as  expected  [see  Eq.  (2.96)]. 

Figure  2.23  shows  the  frequency  spectra  (amplitude  and  angle)  of  the  exponential  Fourier 
senes  for  the  periodic  signal  <p{t)  in  Fig.  2.19b. 

„  i™"es,in,8  fea,“res  of  ,hese  speclra'  Rm- lhe  spectra  exist  for  positive 

“  “  “f”™  val“f  «t frequency).  Second,  the  amplitude  spectrum  is  an  even 

between  th  s  „  !  a"el'sP'®'™  lsan  odd  lu,lclln"  of/.  Finally,  we  see  a  close  connection 
rtrX  ^d T2  '  SPM,ra  °f  “"“‘*>"*"8  trigonometric  Fourier  scries  fo, 
spectra  IC  and  »  )  with  fqU'"'OI"r  %al  shows  ,he  connection  between  the  trigonometric 
rSfcaH?  tah  1  Tm  SP“'ra  <ID>I  ln“  ZD">  Th'  *  components  Do  and  Co 
the  trigonometric  ?*»•  ™  »  “f 

is  identical  to  the  trigonometric  phase  sictnin,  h  1  e*P°"nllal  an-'le  spectrum  ID, 
exponential  s*ctra  merely  by  a'n”  “ 

"  *l;l*  noes  Nepative  Frequency  Mean0 
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Figure  2.23 

Exponential 

Fourier  spectra 
for  the  signal  in 

Fig.  2. 1 9a. 
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of  a  negative  frequency  -/o  can  be  expressed,  by  borrowing  a>o  =  2nfo,  as 

cos  (—coot  +  9)  =  cos  (coot  —  9) 

This  clearly  shows  that  the  frequency  of  a  sinusoid  cos  (—coot  +  9)  is  |cuo|,  which  is  a  positive 
quantity.  The  common  ideas  that  a  frequency  must  be  positive  comes  from  the  traditional 
notion  that  frequency  is  associated  with  a  real-valued  sinusoid  (such  as  a  sine  or  a  cosine). 
In  reality,  the  concept  of  frequency  for  a  real-valued  sinusoid  describes  only  the  rate  of  the 
sinusoidal  variation,  without  concern  for  the  direction  of  the  variation.  This  is  because  real¬ 
valued  sinusoidal  signals  do  NOT  contain  information  on  the  direction  of  its  variation. 

The  concept  of  negative  frequency  is  meaningful  only  when  we  are  considering  complex 
sinusoids  for  which  the  rate  and  the  direction  of  variation  are  meaningful.  Observe  that 

_  cos  ^  ±  j  sjn 

This  relationship  clearly  shows  that  positive  or  negative  a>o  both  lead  to  periodic  variation  of 
the  same  rate.  However,  the  resulting  complex  signals  are  NOT  the  same.  Because  | e±jm'  1=1, 
both  e+jw°'  and  are  unit  length  complex  variables  that  can  be  shown  on  the  complex 

plane.  We  illustrate  the  two  exponential  sinusoids  as  unit  length  complex  variables  that  are 
varying  with  time  t  in  Fig.  2.24.  Thus,  the  rotation  rate  for  both  exponentials  e±JW°'  is  |cuo|.  It 
is  clear  that  for  positive  frequency,  the  exponential  sinusoid  rotates  counterclockwise,  whereas 
for  negative  frequency,  the  exponential  sinusoid  rotates  clockwise.  This  illustrates  the  actual 
meaning  of  negative  frequency. 

There  exists  a  good  analogy  between  positive/negative  frequency  and  positive  and  negative 
velocity.  Just  as  people  are  reluctant  to  use  negative  velocity  in  describing  a  moving  object, 
they  are  equally  unwilling  to  accept  the  notion  of  “negative”  frequency.  However,  once  we 
understand  that  negative  velocity  simply  refers  to  both  the  negative  direction  and  the  actual 
speed  of  a  moving  object,  negative  velocity  makes  perfect  sense.  Likewise,  negative  frequency 
does  NOT  describe  the  rate  of  periodic  variation  of  a  sine  or  a  cosine.  It  describes  the  direction 
of  rotation  of  a  unit  length  exponential  sinusoid  and  its  rate  ot  revolution. 
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Figure  2.24 

(a)  Unit  length 


Im 


Im 


complex  variable 
with  positive 
frequency  (rotat¬ 
ing  counterclock¬ 
wise)  and  (b)  unit 
length  complex 
variable  with 
negative 
frequency 
(rotating 


clockwise). 


(b) 


(a) 


Another  way  of  looking  at  the  situation  is  to  say  that  exponential  spectra  are  a  graphical 
representation  of  coefficients  Dn  as  a  function  off .  Existence  of  the  spectrum  at  f  —  —  nfo 


is  merely  an  indication  that  an  exponential  component  e  jn2jTf^  exists  in  the  series.  We  know 
that  a  sinusoid  of  frequency  nco o  can  be  expressed  in  terms  of  a  pair  of  exponentials  e l,U0[)t 


and  e~^najQt  [Eq.  (2.92)].  That  both  sine  and  cosine  consist  of  positive-  and  negative-frequency 
exponential  sinusoidal  components  clearly  indicates  that  we  are  NOT  at  all  able  to  use  coo  alone 
to  describe  the  direction  of  their  periodic  variations.  Indeed,  both  sine  and  cosine  functions 
of  frequency  coq  consist  of  two  equal-sized  exponential  sinusoids  of  frequency  zfccuo-  Thus, 
the  frequency  of  sine  or  cosine  is  the  absolute  value  of  their  two  component  frequencies  and 
denotes  only  the  rate  of  the  sinusoidal  variations. 


Example  2.11  Find  the  exponential  Fourier  series  for  the  periodic  square  wave  w(t)  shown  in  Fig.  2.20a. 


00 


w(t)  =  E  Dnein2nf°' 


where 
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Figure  2.25 

Exponential 
Fourier  spectrum 
of  the  square 
pulse  periodic 
signal. 


Example  2.12  Find  the  exponential  Fourier  series  and  sketch  the  corresponding  spectra  for  the  impulse  train 
Sf0{t)  shown  in  Fig.  2.26a. 

The  exponential  Fourier  series  is  given  by 

00  . 

STo(0  =  J2  Dnein27rf0‘  fo  =  -  (2.98) 

n=—oo 


where 


Dn  =  ±r  f  8T0(t)e-jn2”f°' dt 
To  Jtq 

Choosing  the  interval  of  integration  (^,  and  recognizing  that  over  this  interval 
STo(t)  =  we  write 

1  rT°'2 

D„  =  —  /  8(t)e~jn2nfo'  dt 
To  J-To/2 

In  this  integral,  the  impulse  is  located  at  t  =  0.  From  the  sampling  property  of  the  impulse 
function,  the  integral  on  the  right-hand  side  is  the  value  of  e~jn2n^‘  at  t  =  0  (where  the 
impulse  is  located).  Therefore 


Dn  =  T  (2.99) 

To 

and 

1  00  1 

Sto(')  =  f  E  ein2nf0'  f°  =  V  (2-,00) 

T°  nt'oo  T° 

Equation  (2.100)  shows  that  the  exponential  spectrum  is  uniform  (D„  =  1/T0)  for  all  the 
frequencies,  as  shown  in  Fig.  2.26.  The  spectrum,  being  real,  requires  only  the  amplitude 
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Figure  2.26 

(a)  Impulse  train 
and  (b)  its 
exponential 
Fourier  spectrum. 


plot.  All  phases  are  zero.  Compare  this  spectrum  with  the  trigonometric  spectrum  shown 
in  Fig.  2.22b.  The  dc  components  are  identical,  and  the  exponential  spectrum  amplitudes 
are  half  of  those  in  the  trigonometric  spectrum  for  all /  >  0. 
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ParsevaPs  Theorem  in  the  Fourier  Series 

A  periodic  signal  #(f)  is  a  power  signal,  and  every  term  in  its  Fourier  series  is  also  a  power 
signal.  The  power  of  g(t)  is  equal  to  the  power  of  its  Fourier  series.  Because  the  Fourier 
series  consists  of  terms  that  are  mutually  orthogonal  over  one  period,  the  power  of  the  Fourier 
series  is  equal  to  the  sum  of  the  powers  of  its  Fourier  components.  This  follows  from  Parse  val  s 
theorem.  We  have  already  demonstrated  this  result  in  Example  2.2  for  the  trigonometric  Fourier 
series.  It  is  also  valid  for  the  exponential  Fourier  series.  Thus,  for  the  trigonometric  Fourier 
series 


00 

£0)  Co  -I-  Cfi  cos  (nojQt  -P  0n) 

n=  I 


the  power  of  g(t)  is  given  by 


Pg  —  Co* 


For  the  exponential  Fourier  series 


(2.101) 


00 

g(t)  =  Do  +  ^2  Dne'n‘Uo' 

*=- oo,  n£0 


the  power  is  given  by  (see  Prob.  2.1-7) 


00 

p*=  E  io„i2 

*=-00 


(2.102a) 
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For  a  real  g(t),  |D_„|  =  \D„\.  Therefore 


OO 

Pg=D02  +  2J2\D„\2 

n=\ 


(2.102b) 


Comment:  Parseval’s  theorem  occurs  in  many  different  forms,  such  as  in  Eqs.  (2.70), 
(2.101),  and  (2.102a).  Yet  another  form  is  found  in  the  next  chapter  for  nonperiodic  signals. 
Although  these  forms  appear  different,  they  all  state  the  same  principle:  that  is,  the  square  of 
the  length  of  a  vector  equals  the  sum  of  the  squares  of  its  orthogonal  components.  The  form 
(2.70)  applies  to  energy  signals,  the  form  (2.101)  applies  to  periodic  signals  represented  by 
the  trigonometric  Fourier  series,  and  the  form  (2.102a)  applies  to  periodic  signals  represented 
by  the  exponential  Fourier  series. 


2.10  MATLAB  EXERCISES 

In  this  section,  we  provide  some  basic  MATLAB  exercises  to  illustrate  the  process  of  signal 
generation,  signal  operations,  and  Fourier  series  analysis. 

Basic  Signals  and  Signal  Graphing 

Basic  functions  can  be  defined  by  using  MATLAB’s  m-files.  We  gave  three  MATLAB  programs 
that  implement  three  basic  functions  when  a  time  vector  t  is  provided: 

•  ustep  .  m  implements  the  unit  step  function  u(t) 

•  rect .  m  implements  the  standard  rectangular  function  rect(f) 

•  triangl  .m  implements  standard  triangle  function  A(/) 


%  (file  name:  ustep. m) 

%  The  unit  step  function  is  a  function  of  time  't'  . 

%  Usage  y  =  ustep (t) 

% 

%  ustep  (t)  =0  if  t  <  0 

%  ustep(t)  =1,  if  t  >=  1 

% 

%  t  -  must  be  real -valued  and  can  be  a  vector  or  a  matrix 
% 

function  y=ustep(t) 

y  =  ( t >=0 ) ; 

end 


%  (file  name:  rect.m) 

%  The  rectangular  function  is  a  function  of  time  't'  . 

% 

%  Usage  y  =  rect(t) 

%  t  -  must  be  real -valued  and  can  be  a  vector  or  a  matrix 
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Figure  2.27 

Graphing  a 
signal. 


^rm  lime  domain 


%  rect(t)  =  1,  if  |t|  <  0.5 

%  rect(t)  =  0,  if  |t|  >  0.5 

% 

function  y=rect(t) 

y  = (sign(t+0 .5) -sign(t-0 .5)  >0)  ; 

end 


%  (file  name:  triangl.m) 

%  The  triangle  function  is  a  function  of  time  't' 

% 

%  triangl ( t )  =  l-|t|,  if  |t|  <  l 
%  triangl (t)  =  0,  if  |tj  >  l 

% 

%  Usage  y  =  triangl (t) 

%  t  -  must  be  real -valued  and  can  be  a  vector  or  a  matrix 

function  y=triangl(t) 

y=  (1-abs (t) ) . * (t>=-i) . * (t<l) ; 

end 


We  now  show  how  to  use  MATLAR  tn  o.n.„la  ,  . 

P-ognm,  s iggraf  . «.  I„  ,h,s  e*an,ple.  4  con^c.ZptaTst^'  'h,°U8h  " 

y(t)  =  exp(— f)  sin  (6nt)u(t  +  \) 

The  resulting  figure  is  shown  in  Fig.  2.27. 


(file  name:  siggraf.m) 

To  graph  a  signal,  the  first  sten  i „  . 
the  x-axis  and  the  y-axis  to  plot  °  determine 

%  We  can  first  decide  the  length  of  x-axi*  *  , 

t- 1-2: 0.01:3];  %  „t„  is  f  3X18  to  Pl°t 

%  Then  evaluate  the  signal  over  theTrange^f ' inCrement 

a  t  to  plot 


2. 10  MATLAB  Exercises  71 


y=exp ( -t) . *sin  (10*pi*t)  .*ustep(t+l) ; 

figure (1);  f igl=plot (t , y) ;  %  plot  t  vs  y  in  figure  1 

set ( f igl , ’ Linewidth' , 2 ) ;  %  choose  a  wider  line-width 

xlabel('\it  t');  %  use  italic  ' t '  to  label 

x-axis 

ylabel  ( ' {\bf  y}({\it  t  })');  %  use  boldface  »y'  to  label  y-axis 

title!' {\bf  y}_{\rm  time  domain}'); 

%  can  use  subscript 


Signal  Operations 

Some  useful  signal  operations  introduced  in  Sec.  2.3  can  be  implemented  and  illustrated  by 
means  of  MATLAB.  We  provide  the  example  program  sigtransf  .m  to  demonstrate  the 
basic  operations.  First,  we  generate  a  segment  of  an  exponentially  decaying  function 

y(0  =  exp(-|r|/4)[«(/)  -  u(t  -  4)] 

We  then  apply  time  scaling  and  time  shifting  to  generate  new  signals 

yi  (t)  =  y(2 1)  y2(t)  =  y(t  +  2) 

Finally,  time  scaling,  time  inversion,  and  time  shifting  are  all  utilized  to  generate  a  new  signal 

73(0  =  y( 2  -  It) 

The  original  signal  y(t)  and  its  transformations  are  all  given  in  Fig.  2.28. 


%  (file  name  sigtransf. m) 

%  To  apply  a  transform  time-domain  transformation  on  a  signal 
%  y=g(t)#  simply  redefine  the  x-axis 
%  a  signal,  the  first  step  is  to  determine 
%  the  x-axis  and  the  y-axis  to  plot 
%  We  can  first  decide  the  range  of  ' t '  for  g(t) 

t= [-3 : 0 . 002 : 5]  ;  %  "t"  is  from  -3  to  5  in  0.002  increment 
%  Then  evaluate  the  signal  over  the  range  of  "t"  to  plot 

y=exp (-abs (t) /4) . * (ustep (t) -ustep (t-4) ) ; 

%  Form  the  corresponding  signal  vector 
%  y=g(t)  =  exp(-t)  sin (10*pi*t )  [u (t ) -u (t-4) ] 


figure (1) ; 

subplot (221) ; f ig0=plot (t,y) ; 
set (f igO , ' Linewidth' ,2) ; 
xlabel('\it  t');  %  use 

ylabel  (' {\bf  y} ({\it  t  }) ') ; 


%  plot  y  vs  t  in  subfigure  1 
%  choose  a  wider  line-width 
italic  't'  to  label  x-axis 
%  use  boldface  'y'  to  label  y-axis 
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Figure  2.28 

Examples  of 
basic  signal 
operations. 


Original  signal  y(t) 


t 


Time  scaling  y(2t) 


t 


Time  shifting  y(t+2)  y{2-2t) 


'  t 


% 

% 


title ( 'original  signal  y(t)'); 

Now  we  evaluate  the  dilated  signal 

%  Scale  in  time 

Form  the  corresponding  signal  vector 

yl=exp ( -abs ( t 1 ) /4 ) . * (ustep (tl) -ustep (tl-4 ) ) ; 
figure (1); 

subplot  (222 )  ;  f igl=plot  (t , yl)  ;  %  piot  y  vs  t  in  subfigure  2 

set (figl, ' Linewidth' ,2) ;  %  choose  a  wider  line-width 

X  T  '  ^  t,)?,  ,  %  USe  italic  *t'  to  label  x-axis 

ylabel ( ' { \bf  y_l}({\it  t  })');  %  use  boldface 

%y'  to  label  y-axis 

title ('time  scaling  y(2t),); 


% 

% 


Now  we  evaluate  the  dilated  signal 

*  shift  in  time 

Form  the  corresponding  signal  vector 

y2  =exp ( - abs ( t2 ) /4 ) . * (ustep ( t2 ) -ustep ( t2 - 4 ) ) 
figure (l); 


subplot (223) ; f ig2=plot (t, y2) ; 
set(fig2, 'Linewidth' ,2) ; 
xlabel('\it  t');  %  use 

ylabel ('(\bf  y_2}({\it  t  })');  - 

Y'  to  label  y-axis 
title('time  shifting  y(t+2)r); 


■s  plot  y  vs  t  in  subfigure  2 
•s  choose  a  wider  line-width 
italic  't'  to  label  x-axis 
%  use  boldface 
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%  Now  we  evaluate  the  dilated  signal 
t3=2 - 1*2  •  o  .  . 

0  #  *  time- reversal +Scale  in  time 

o  Form  the  corresponding  signal  vector 

y3=exp(-abs ( t 3 ) /4) .* (ustep(t3) -ustep(t3-4) ) ; 
figure (1) ; 

subplot (224) ;fig3=plot(t,y3);  %  plot  y  vs  t  in  subfigure  2 

set(fig3,  Linewidth' , 2 ) ;  %  choose  a  wider  line-width 

xlabel ( ' \it  t');  %  use  italic  ’t'  to  label  x-axis 

ylabel  ( ' {\bf  y_3}({\it  t  })');  %  use  boldface 

'y'  to  label  y-axis 
title ( 'y (2-2t) ' ) ; 

subplot (221) ;  axis ([-3  5  -.5  1.5]);  grid; 
subplot (222) ;  axis ((-3  5  -.5  1.5]);  grid; 
subplot  (223) ;  axis ([-3  5  -.5  1.5]);  grid; 
subplot  (224) ;  axis ([-3  5  -.5  1.5]);  grid; 


Periodic  Signals  and  Signal  Power 

Periodic  signals  can  be  generated  by  first  determining  the  signal  values  in  one  period  before 
repeating  the  same  signal  vector  multiple  times. 

In  the  following  MATLAB  program,  PfuncEx.m,  we  generate  a  periodic  signal  and 
observe  its  behavior  over  2 M  periods.  The  period  of  this  example  is  T  =  6.  The  program  also 
evaluates  the  average  signal  power,  which  is  stored  as  a  variable  y_power,  and  the  signal 
energy  in  one  period,  which  is  stored  in  the  variable  y_energyT. 


%  (file  name:  PfuncEx.m) 

%  This  example  generates  a  periodic  signal,  plots  the  signal 
%  and  evaluates  the  averages  signal  power  in  y_power  and  signal 
%  energy  in  1  period  T:  y_energyT 

echo  of f; clear; cl f; 

%  To  generate  a  periodic  signal  g_T(t), 

%  we  can  first  decide  the  signal  within  the  period  of  'T'  for  g(t) 
Dt=0 . 002 ;  %  Time  interval  (to  sample  the  signal) 

T=6;  %  period=T 

M=3;  %  To  generate  2M  periods  of  the  signal 

t= [0 : Dt : T-Dt] /  %"t"  goes  for  one  period  [0,  T]  in  Dt  increment 

%  Then  evaluate  the  signal  over  the  range  of  "T" 

y=exp ( -abs (t) /2) . *sin (2*pi*t ) .*(ustep(t) -ustep(t-4) ) ; 

%  Multiple  periods  can  now  be  generated. 
time=  []  ; 
y_periodic=  []  ; 
for  i=-M : M- 1 , 

time= [time  i*T+t] ; 
y__periodic=  [y_periodic  y]  ; 

end 

figured)  ;  fy=plot  (time,  y_periodic)  ; 
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Figure  2.29  | 

Generating  a 
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set (fy, 'Linewidth' ,2) ;xlabel('  /' ) ; 
echo  on 

%  Compute  average  power 

y_power=sum(y_periodic*y_periodic' )  *Dt/  (max (time)  -min  (time)  ) 

%  Compute  signal  energy  in  1  period  T 
y_energyT=sum(y.*conj (y) ) *Dt 


The  program  generates  a  figure  of  periodic  signal  in  Fig.  2.29  and  numerical  answers: 


Y_P°wer  = 

0.0813 


y_energyT  = 
0.4878 


Signal  Correlation 

The  concept  of  signal  correlation  introduced  in  Sec  •  , 

-  ~~  F, The 


'  name:  sign_cor.m) 

clear 

Dt.0.01,  %dti^eiScr^^"*k“ithi"  the  P"iod  °£  'T'  tor  9(cl 


9_5(t) ; 
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Figure  2.30 

Signals  from 
Example  2.6. 
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T-6.0;  %  time  duration  =  T 

t=[-l:Dt:T];  %”t"  goes  between  [-1,  T]  in  Dt  increment 

*  Then  evaluate  the  signal  over  the  range  of  "t"  to  plot 
x=ustep (t) -ustep (t-5) ; 
gl=0.5* (ustep (t) -ustep (t-5) ) ; 
g2=- (ustep (t) -ustep (t-5) ) ; 
g3=exp(-t/5) .* (ustep (t) -ustep (t-5) ) ; 
g4=exp ( -t) .* (ustep (t) -ustep (t-5) ) ; 
g5=sin (2*pi*t) .* (ustep(t) -ustep(t-5) ) ; 
subplot (231) ;  sigl=plot (t,x, 'k' ) ; 
xlabel ('/');  ylabel ( '  x (  /  )  ' )  ;  %  Label  axis 

set (sigl, 'Linewidth' ,2) ;  %  change  linewidth 

axis  ([-.5  6  -1.2  1.2]);  grid  %  set  plot  range 

subplot (232) ;  sig2=plot(t,gl, 'k' ) ; 
xlabel ( '  t' ) ;  ylabel ( '  g_l  ( / )  ' )  , 
set (sig2 , 'Linewidth' ,2) ; 
axis (  [- . 5  6  -1.2  1.2]);  grid 
subplot (233) ;  sig3=plot (t,g2, >k' ) ; 
xlabel ( '  r ) ;  ylabel ( '  g_2  (/)'); 
set (sig3, 'Linewidth' ,2) ; 
axis ([-.5  6  -1.2  1.2]);  grid 
subplot (234) ;  sig4=plot (t,g3 , 'k' ) ; 
xlabel ( '  t' ) ;  ylabel ( '  g_ 3  (/)'); 
set  (sig4, 'Linewidth' ,2) ; 
axis ([-.5  6  -1.2  1.2]);  grid 
subplot (235) ;  sig5=plot (t , g4 , 'k' ) ; 
xlabel ( '  f  ) ;  ylabel ( '  g_4 ( t )  ' )  ; 
set  (sig5, 'Linewidth' , 2) ;grid 
axis ( [- . 5  6  -1.2  1.2]); 
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subplot  (236) ;  sig6=plot (t ,  g5,  'k'  )  ; 
xlabel  ('/');  ylabel  ( #  g_S  (/)'); 
set (sig6 , 'Linewidth' , 2) ;grid 
axis ([-.5  6  -1.2  1.2]); 

%  Computing  signal  energies 
E0=sum (x . *conj (x) ) *Dt ; 

El=sum(gl . *conj (gl) ) *Dt; 

E2=sum(g2 . *conj (g2) ) *Dt; 

E3=sum(g3 . *conj (g3) ) *Dt; 

E4=sum(g4 . *conj (g4) ) *Dt; 

E5=sum(g5 . *conj (g5) ) *Dt; 

cO=sum(x. *conj (x) ) *Dt/ (sqrt (E0*E0) ) 
cl=sum (x . *conj (gl) ) *Dt/ (sqrt (E0*E1)  ) 
c2  =  sum (x. *conj (g2) ) *Dt / (sqrt (E0*E2)  ) 
c3  =  sum (x . *conj (g3) ) *Dt/ (sqrt (E0*E3)  ) 
c4=sum (x . *conj (g4) ) *Dt/ (sqrt (E0*E4) ) 
c5=sum (x . *conj (g5) ) *Dt/ (sqrt (E0*E5)  ) 


The  six  correlation  coefficients  are  obtained  from  the  program  as 


cO  = 

1 

cl  = 

1 

c2  = 

-1 

c3  = 

0.9614 

C4  = 

0.6282 

c5  = 

8 . 6748e- 17 


numerical  Computation  of  Coefficients  D 

There  are  several  ways  to  numerically  compute  the  P  ■ 

use  MATLAB  ,o  presen,  ,w„  dillerem  me,hX  SSJLT"'  ““  «*«*««*  D -  We 
evaluate  the  Fourier  series  bv  means  nfn  ■  ne  We  Wl  *  sh°w  how  to  numeric 

y  TCa,,S  °f  numenc‘‘l  lntegration  and  fas,  Fourier  rransfomr. 


2.10  MATLAB  Exercises  77 


Numerical  Integration:  The  direct  method  is  to  carry  out  a  direct  numerical  integration 
ot  Eq.  (2.91)  by  defining  the  symbolic  function  g(t)  along  with  its  period.  The  first  step  is  to 
define  the  symbolic  expression  of  the  signal  g(t)  under  analysis.  We  use  the  triangle  function 
A(r)  in  the  following  example. 


%  (funct_tri.m) 

%  A  standard  triangle  function  of  base  -1  to  1 
function  y  =  funct_tri(t) 

%  Usage  y  =  func__tri  ( t ) 

%  t  =  input  variable  i 
y=( (t>-l)-(t>l) ) . * (1-abs ( t ) ) ; 


Once  the  file  funct_tri  .m  has  defined  the  function  y  =  g(t ),  we  can  directly  carry 
out  the  necessary  integration  of  Eq.  (2.91)  for  a  finite  number  of  Fourier  series  coefficients 
\Dn,  n  =  —N,  ...,—1,0,  1,  . . . ,  N).  We  provide  a  MATLAB  program  called  FSexp_a .  m 
to  evaluate  the  Fourier  series  of  A(r/2)  with  period  [a,  b]  ( a  =  -2.  b  =  2).  In  this  example, 
Af  =  1 1  is  selected.  Executing  this  short  program  in  MATLAB  will  generate  Fig.  2.3 1  with 
both  amplitude  and  angle  of  Dn. 


%  (file  name:  FSexp_a.m) 

%  This  example  shows  how  to  numerically  evaluate 
%  the  exponential  Fourier  series  coefficients  Dn 
%  directly. 

%  The  user  needs  to  define  a  symbolic  function 
%  g(t).  In  this  example,  g ( t ) =funct_tri ( t ) . 

echo  off;  clear;  elf; 

j=sqrt(-l);  %  Define  j  for  complex  algebra 
b=2;  a=-2;  %  Determine  one  signal  period 

tol=l.e-5;  %  Set  integration  error  tolerance 
T=b-a;  %  length  of  the  period 

N=ll;  %  Number  of  FS  coefficients 

%  on  each  side  of  zero  frequency 
Fi= [-N:N] *2*pi/T;  %  Set  frequency  range 

%  now  calculate  D_0  and  store  it  in  D(N+1); 

Func=  @(t)  funct_tri (t/2) ; 

D (N+l ) =1/T*quad (Func ,  a,  b, tol ) ;  %  Using  quad.m  integration 

for  i=l:N 


%  Calculate  Dn  for  n=l,...,N  (stored  in  D(N+2)  ...  D(2N+1) 

Func=  @(t)  exp ( - j  *2*pi*t*i/T) . *funct_tri ( t/2 ) ; 

D(i+N+1) =1/T*quad (Func, a, b, tol) ; 

%  Calculate  Dn  for  n=-N,  .  .  . , -1  (stored  in  D ( 1 )  ...  D(N) 

Func=  @ (t)  exp ( j *2*pi*t* (N+l-i) /T) . *func_tri (t/2) ; 

D ( i ) =l/T*quad(Func, a, b, tol) ; 

end 

figure (1 ) ; 

subplot (211 ) ; sl=stem( [-N:N] , abs (D) ) ; 
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Figure  2.31 

Exponential 
Fourier  series 
coefficients  of  a 
repeated  A(//2) 
with  period 
7  =  4  showing 
(a)  amplitude 
and  (b)  angle  of 
Du • 


Amplitude  of  Dn 


title ( 'Amplitude  of  D  n' ) 
subplot  (212,  [■»,„,  ,,„gle(D, ,  . 

set  (s2,  'Linewidth'  ,2) ;  ylabel('<£>  „■>' 
title ('Angle  of  D_n' )  s  ~  1 
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samples  of  g(t),  consider  Eq.  (2.91) 


(2.103) 


where  g(kTs)  is  the  kth  sample  of  g(r)  and 


fto  =  (ooTs  N0  =  — 

*  S 


s 


(2.104) 


In  practice,  it  is  impossible  to  make  Ts  — >  0  in  computing  the  right-hand  side  of  Eq. 
(2.103).  We  can  make  it  small,  but  not  zero,  because  this  would  increase  the  data  without 
limit.  Thus,  we  shall  ignore  the  limit  on  Ts  in  Eq.  (2. 103)  with  the  implicit  understanding  that 
Ts  is  reasonably  small.  This  results  in  some  computational  error,  which  is  inevitable  in  any 
numerical  evaluation  of  an  integral.  The  error  resulting  from  nonzero  Ts  is  called  the  aliasing 
error,  which  is  discussed  in  more  detail  in  Chapter  6.  Thus,  we  can  express  Eq.  (2.103)  as 


(2.105) 


This  equation  shows  that  D„+Nq  =  D„.  Hence,  Eq.  (2.105)  yields  the  Fourier  spectrum  D„ 
repeating  periodically  with  period  N0.  This  will  result  in  overlapping  of  various  components. 
To  reduce  the  effect  of  such  overlapping,  we  need  to  increase  N0  as  much  as  practicable.  We 
shall  see  later  (Sec.  6.1)  that  the  overlapping  appears  as  if  the  spectrum  above  the  ((Vo/2)nd 
harmonic  has  folded  back  at  this  frequency  Nocoq/I.  Hence,  to  minimize  the  effect  of  this 
spectral  folding,  we  should  make  sure  that  D„  for  n  >  No/2  is  negligible.  The  DFT  (or  FFT) 
gives  the  coefficients  D„  for  n  >  0  up  ton  =  N0/2.  Beyond/;  =  N0/2,  the  coefficients  represent 
the  values  for  negative  n  because  of  the  periodicity  property  Dn+N()  =  Dn.  For  instance, 
when  No  =  32,  D\i  =  D-is,  D |g  =  D_  14,  ...,£>31  =  D_j.  The  cycle  repeats  again  from 
n  =  32  on. 

We  can  use  the  efficient  fast  Fourier  transform  (FFT)  to  compute  the  right-hand  side  of 
Eq.  (2.105).  We  shall  use  MATLAB  to  implement  the  FFT  algorithm.  For  a  this  purpose,  we 
need  samples  of  g(t)  over  one  period  starting  at  /  =  0.  For  a  general  FFT  algorithm,  it  is  also 
preferable  (although  not  necessary)  that  No  be  a  power  of  2;  that  is.  No  =  2"',  where  m  is  an 
integer. 

COMPUTER  EXAMPLE 

Compute  and  plot  the  trigonometric  and  exponential  Fourier  spectra  for  the  periodic  signal  in  Fig.  2. 19b 
(Example  2.7). 

The  samples  of  g(t)  start  at  /  =  0  and  the  last  (/V0th)  sample  is  at  t  =  T0  -  Ts  (the  last  sample  is 
not  at  t  =  To  because  the  sample  at  t  =  0  is  identical  to  the  sample  at  t  =  7q.  and  the  next  cycle  begins 
at  t  —  To).  At  the  points  of  discontinuity,  the  sample  value  is  taken  as  the  average  of  the  values  of  the 
function  on  two  sides  of  the  discontinuity.  Thus,  in  the  present  case,  the  first  sample  (at  t  =  0)  is  not  1  but 
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(f-*/2  |  )/2  =  0.604.  To  determine  No,  we  require  £>«  for  n  >  No/2  to  be  relatively  small.  Because 

g(i)  has  a  jump  discontinuity,  Dn  decays  rather  slowly  as  l/n.  Hence,  choice  ot  No  =  200  is  acceptable 
because  the  (/V0/2)nd  ( 100th)  harmonic  is  about  0.01  (about  1%)  of  the  fundamental.  However,  we  also 
require  No  to  be  power  of  2.  Hence,  we  shall  take  Nq  =  256  =  2*\ 

We  write  and  save  a  MATLAB  program  c2 1 .  m  to  compute  and  plot  the  Fourier  coefficients. 


%  (c21.m) 

%M  is  the  number  of  coefficients  to  be  computed 
T0=pi ;N0=256 ;Ts=T0/N0  ;M=10 ;  %  whatever 

t=0:TS:Ts* (N0-1) ;  t=t'; 
g=exp(-t/2) ;g (1) =0 . 604 ; 

%  fft(g)  is  the  FFT  [the  sum  on  the  right-hand  side  of 
Eq.  (2.105)] 

Dn=f f t (g) /NO 

[Dnangle, Dnmag] =cart2pol (real (Dn) , imag (Dn) ) ; 
k=0: length (Dn) -l;k=k' ; 
subplot (211) , stem (k, Dnmag) 
subplot (212) ,  stem (k, Dnangle) 


To  compute  trigonometric  Fourier  series  coefficients,  we  recall  the  program  c2 1 .  m  along 
with  commands  to  convert  D„  into  C„  and  6n. 


C21 ; elf 

C0=Dnmag (1) ;  Cn=2*Dnmag (2 :M) ; 

Amp li tudes = [C0;Cn] 

Angles=Dnangles (1 :M) ; 
Angles=Angles* (180/pi) ; 
disp( 'Amplitudes  Angles') 
[Amplitudes  Angles] 

%  To  Plot  the  Fourier  coefficients 
k=0 : length (Amplitudes) -1 ;  k=k' • 
subplot (211) , stem(k, Amplitudes) 
subplot (212) ,  stem(k/ Angles) 

ans  = 


Amplitudes  Angles 

0.5043 

0 

0.2446 

-75.9622 

0.1251 

-82.8719 

0.0837 

-85.2317 

0.0629 

-86.4175 

0.0503 

-87.1299 

0.0419 

-87.6048 

0.0359 

-87.9437 

0.0314 

-88.1977 

0.0279 

-88 . 3949 
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PROBLEMS 


2.1-1  Find  the  average  power  of  the  signals  in  Fig.  P2.1-1. 


Figure  P.2.1-1 


2.1-2  (a)  Find  Ex  and  Ey,  the  energies  of  the  signals  x(t)  and  y(t)  shown  in  Fig.  P2.1-2a.  Sketch  the 
signals  x(f)  +  y(t)  and  x(t)  —  y(t).  Show  that  the  energies  of  either  of  these  two  signals  is  equal 
to  Ex  +  Ey.  Repeat  the  procedure  for  signal  pair  in  Fig.  P2.  l-2b. 


(b)  Now  repeat  the  procedure  for  signal  pair  in  Fig.  P2.1-2c.  Are  the  energies  of  the  signals 
x(t)  +  y(t)  and  x(t)  -  y(t)  identical  in  this  case? 


Figure  P.2.1-2 
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Figure  P.2.1- 


Figure  P.2.1-6 


2. 1  -3  Redo  Example  2.2a  to  find  the  power  of  a  sinusoid  C  cos  (cuq/ + 6)  by  averaging  the  signal  energy 
over  one  period  Itt/cdq  (rather  than  averaging  over  the  infinitely  large  interval). 

2.1- 4  Show  that  if  co  j  =  the  power  of  g(t)  =  C\  cos  {co\t  +  #i)  +  Ci  cosiest  -f  (H)  is  [Cj2  + 

C22  4*  2C|  C2  cos  (0\  -  #2)1/2,  which  is  not  equal  to  (C\ 2  +  C22)/2. 

2.1- 5  Find  the  power  of  the  periodic  signal  g(/)  shown  in  Fig.  P2.1-5.  Find  also  the  powers  and  the  rms 

values  of  (a)  -#(/);  (b)  2 g(t)\  (c)  cg(f).  Comment. 


2.1-6 


Find  the  energies  of  the  signals  shown  in  Fig.  P2.1-1 
change,  time  shift,  or  doubling  of  the  signal.  What 
multiplied  by  kl 


•  Comment  on  the  effect  on  energy  of  sign 
is  the  effect  on  the  energy  if  the  signal  is 


2.1-7  Show  that  the  power  of  a  signal  g(t)  given  by 


g(0  =  D*?'"*' 

k=m 


wi  ^  to*  for  all  i  5*  k 


is  (Parseval's  theorem) 


n 


=  £  \Dk  I2 

k=m 


Figure  P.2.3- 


Figure  P.2.3-2 
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2.1-8  Determine  the  power  and  the  rms  value  for  each  of  the  following  signals: 


(a)  10  cos  ^  1 00/  +  j) 

(b)  10  cos  (lOOr-l-  +  16  sin  (l50r  + 

(c)  (10-1-2  sin  3/)  cos  10/ 


(d)  10  cos  5/  cos  10/ 

(e)  10  sin  5/  cos  10/ 

(f)  eiat  cos  coQt 


2.1-9 


Find  the  power  and  the  rms  value  for  the  signals  in  (a)  Fig.  P2.8-2a;  (b)  Fig  P2  8-2c* 
(c)  Fig.  P2.8-?* 


2.2-1  Show  that  an  exponential  e  at  starting  at  -oo  is  neither  an  energy  nor  a  power  signal  for  any 
real  value  of  a.  However,  if  a  is  imaginary,  it  is  a  power  signal  with  power  />  =  1  regardless  of 
the  value  of  a. 


2.2- 2  Determine  whether  the  exponential  signal  e~ct  starting  at  -oo  is  an  energy  or  a  power  signal  for 

a  complex  value  c  which  has  nonzero  real  part. 

2.3- 1  In  Fig.  P2.3-I.  the  signal  g,  (t)  =  g(-f).  Express  signals  g2(t).  g3(r),  g4(t),  and  g5(t)  in  terms 

ol  signals  g(t),  gj  (/),  and  their  time-shifted,  time-scaled,  or  time-inverted  versions.  For  instance, 
82(0  =  g(t  —  T)  +  g\(t  —  T)  for  some  suitable  value  of  T.  Similarly,  both  g3(r)  and  g4(r)  can 
be  expressed  as  g{t  ~T)  +  g(t  +  T)  for  some  suitable  value  of  T ;  and  g5(t)  can  be  expressed 
as  g(r)  time-shifted,  time-scaled,  and  then  multiplied  by  a  constant.  (These  operations  may  be 
performed  in  any  order.) 


2.3-2  For  the  signal  g(/)  shown  in  Fig.  P2.3-2, 

(a)  Sketch  the  signals  (i)  g(-t );  (ii)  g(t  -I-  12);  (iii)  g(3/);  (iv)  g( 6  -  It). 

(b)  Find  the  energies  of  the  signals  in  part  (a). 
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Figure  P.2.3-3 


2.3-3  For  the  signal  g(t)  shown  in  Fig.  P2.3-3,  sketch  (a)  g(t  -  4);  (b)  g  (-p^) ;  (c)  g(2t  —  4); 
(d)  g(2  —  t) .  Hint:  Recall  that  replacing  t  with  t  —  T  delays  the  signal  by  T.  Thus.  g(2t  —  4)  is 
g(2t)  with  t  replaced  by  t  -  2.  Similarly,  g( 2  - 1)  is  g(-t )  with  t  replaced  by  t  -  2. 


2.3- 4  II  gu)  is  a  periodic  signal  with  period  T  and  average  power  Pg,  show  what  kind  of  signal  e(at) 

is  and  what  average  power  it  has. 

2.3- 5  For  an  energy  signal  g(,)  with  energy  Eg.  show  that  the  energy  of  any  one  of  the  signals 

is  F  /  8tw'  Td  *  T)  ,S  Eg‘  Sh°W  alS°  that  the  ener?y  of  S(at)  as  well  as  g(at  -  b ) 

oiherhl’  h:S  ShOWS  ,hat  ne',her  time  inversion  nor  time  shifting  affects  signal  energy.  On  the 

the  efft,  C°mPreSS,0n  °ffa  S'®nal  ^  a  a  reduces  the  energy  by  the  factor  a.  What 

...  f'.  ,°"  s,gnal  energy  lf  the  signal  is  (a)  time-expanded  by  a  factor  a  (a  >  I)  and 

(b)  multiplied  by  a  constant  a?  '  l>  dr  u 

2.4- 1  Simplify  the  following  expressions: 


(a) 

/  tan  /  \ 

u+,b> 

(d) 

(b) 

(sets)*"1 

(e) 

(c) 

\e~'  cos  (3r  -  jt/3)]  8(t  +  n) 

(D 

H«r.  Vx  Eq.  (2.18b).  Fo,pan  (f,use  LHopii.r, 

2.4-2  Prove  that 


1) 


S(2t  +  3) 


8(at)  =  -la(f) 

\<*\ 

Hence  show  that 

S((t>)  =  brS(f)  where  <o  =  2  Ttf 

Hint:  Show  that 


/oo 

<t>U)8(at)dt  =  _L 

■°°  la| 


0(0) 
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2.4-3  Evaluate  the  following  integrals: 


(a) 

[OO 

/  g(r  +  a)S(t  -r)dr 

7—oo 

(e) 

ro o 

y  5(/  +  3)e~'  dt 

(b) 

roc 

/  S(T)g(t-T)dz 

7—00 

(f) 

f2 

J  ^(/3  +4)3(1  -t)dt 

(c) 

/•OO 

/  &(t)e~jM'  dt 

7—00 

(g) 

roc 

/  g(2  -  t)S(3  —  t)dt 

7—oo 

(d) 

f  8(t  -  2)  sin  ntdt 

7  —  00 

(h) 

f°°  71 

/  cos  -(x  -  5)5(2* ■- 
7-oo  2 

Hint:  8(x)  is  located  at  x  =  0.  For  example,  8(  1  -  t)  is  located  at  1  -  t  =  0,  that  is,  at  t  =  1 ,  and 
so  on. 

2.5- 1  Derive  Eq.  (2.27)  in  an  alternate  way  by  observing  that  e  =  (g-cx),  and 

|e|2  =(g-cx)(g-cx)  =|g|2  -f-  c2|x|2  -  2cg  •  x 

To  minimize  |e|2,  equate  its  derivative  with  respect  to  c  to  zero. 

2.5- 2  For  the  signal  g(t)  and  x(t)  shown  in  Fig.  P2.5-2,  find  the  component  of  the  form  jc(/)  contained 

in  g(t).  In  other  words,  find  the  optimum  value  of  c  in  the  approximation  g(t)  %  cx(t)  so  that  the 
error  signal  energy  is  minimum.  What  is  the  error  signal  energy? 


2.5- 3  For  the  signal  g(t)  and  x(t)  shown  in  Fig.  P2.5-2,  find  the  component  of  the  form  g(t)  contained 

in  x(t).  In  other  words,  find  the  optimum  value  of  c  in  the  approximation  x(t)  ^  cg(t)  so  that  the 
error  signal  energy  is  minimum.  What  is  the  error  signal  energy? 

2.5- 4  Repeat  Prob.  2.5-2  if  x (/)  is  a  sinusoid  pulse  as  shown  in  Fig.  P2.5-4. 
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2.5-5  Energies  of  the  two  energy  signals  x(t)  and  y(t)  are  Ex  and  Ey ,  respectively. 

(a)  If  x (/)  and  y(t)  are  orthogonal,  then  show  that  the  energy  of  the  signal  .v(r)  +  y(t)  is  identical 
to  the  energy  of  the  signal  jc(/)  -  y(f),  and  is  given  by  Ex  +  £v. 

(b)  Ifjc(/)  andy(/)  are  orthogonal,  find  the  energies  of  signals  C{X(t)+C2y(t)  and  cjjc(/)  —C2y(t). 

(c)  We  define  Exy,  the  correlation  of  the  two  energy  signals  ,r(/)  and  y(t)  as 


If  z(t)  =  x(t)  ±  y(/),  then  show  that 


Ez-Ex+Ey±  ( Exy  +  EyX) 


2.6-1  Find  the  correlation  coefficient  p  between  of  signal  x(t)  and  each  of  the  four  pulses  g,  (/),  g2(t), 
g^U),  and  g4(r)  shown  in  Fig.  P2.6-1.  To  provide  maximum  margin  against  the  noise  along  the 
transmission  path,  which  pair  of  pulses  would  you  select  for  a  binary  communication? 


Figure  P.2.6-1 


(a) 


p  W) 


0.707 


(d) 


0.707 


(e) 


0 


0  0.5 


-0.707 


2.6- 2  Find  the  autocorrelation  function  of  the  si: 

2.6- 3  Find  the  autocorrelation  function  of  the  si. 


signal  u(t)  -  u(t  -  2). 
signal  g(t)  =  exp(—2t)u(f) 


umt  energy  signals  orthogonal 


This  signal  can  be 
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(a)  Determine  the  vector  representation  of  the  following  six  signals  in  this  two-dimensional 


vector  space: 


(i)  g\(t)=lx\(t)-x2(t) 

(ii)  g2(t)  =  -X]  (t)  +  2x2(t) 

(iii)  g3(t)  =  -x2(r) 


(iv)  g4(t)  =x,(r)  +  2x2(r) 

(v)  gsU)  =  2x\(t)  +  x2(t) 

(vi) g6(0  =  3xi(r) 


(b)  Point  out  pairs  of  mutually  orthogonal  vectors  among  these  six  vectors.  Verify  that  the  pairs 
of  signals  corresponding  to  these  orthogonal  vectors  are  also  orthogonal. 

2.7- 2  (a)  For  the  five  signals  in  Fig.  P.2.6-1,  determine  a  set  of  orthonormal  basis  functions  of 

dimension  4. 

(b)  Determine  the  vector  representation  of  the  five  signals  for  the  orthonormal  basis  in  part  (a). 

2.8- 1  If  a  periodic  signal  satisfies  certain  symmetry  conditions,  the  evaluation  of  the  Fourier  series 

components  is  somewhat  simplified.  Show  that: 

(a)  If  g(t)  =  g(-t)  (even  symmetry),  then  all  the  sine  terms  in  the  Fourier  series  vanish  (6, ,  =0). 

(b)  If  g(t)  =  -g(-t)  (odd  symmetry),  then  the  dc  and  all  the  cosine  terms  in  the  Fourier  series 
vanish  (aq  =  an  =  0). 

Further,  show  that  in  each  case  the  Fourier  coefficients  can  be  evaluated  by  integrating 
the  periodic  signal  over  the  half-cycle  only.  This  is  because  the  entire  information  of  one 
cycle  is  implicit  in  a  half-cycle  due  to  symmetry. 

Hint:  If  ge(t)  and  gQ(t )  are  even  and  odd  functions,  respectively,  of/,  then  (assuming  no 
impulse  or  its  derivative  at  the  origin) 


and 


Also  the  product  of  an  even  and  an  odd  function  is  an  odd  function,  the  product  of  two  odd 
functions  is  an  even  function,  and  the  product  of  two  even  functions  is  an  even  function. 


2.8-2  For  each  of  the  periodic  signals  shown  in  Fig.  P2.8-2,  find  the  compact  trigonometric  Fourier 


series  and  sketch  the  amplitude  and  phase  spectra.  If  either  the  sine  or  cosine  terms  are  absent  in 
the  Fourier  series,  explain  why. 

2.8-3  (a)  Show  that  an  arbitrary  function  g(t)  can  be  expressed  as  a  sum  of  an  even  function  ge(t)  and 
an  odd  function  g0(t): 


g(0  =  ge(0  +  g0(t) 


Hint:  g(t)  =  -lg(t)  +  g(-/)]  +  -lg(t)  -  g(-/)] 


ge(0 


go(0 


(b)  Determine  the  odd  and  even  components  of  functions  (i)  u(t);  (ii)  e~at u(t)\  (iii)  e>1 . 


2.8-4  If  the  two  halves  of  one  period  of  a  periodic  signal  are  identical  in  shape  except  that  the  one  is 
the  negative  of  the  other,  the  periodic  signal  is  said  to  have  a  half-wave  symmetry.  If  a  periodic 
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Figure  P.2.8-2 


-7 

-5 

-3 

-1 

0 

1 

3 

5 

7 

t 

-1 


i - •  u 


.•mu- wave 


^nimetry  condition. 


g(0 

In  (his  case,  show  that  all  the  even-numbered  ham™  • 

harmonic  coefficients  are  given  by  n,cs  vanish,  and  that  the  odd-numbered 


an 


f7o/2 


fTo/2 


4  fT 

'~ToJo  ^OcosnajQtdt  and  bn  =  i.  f1  o(t,  . 

0  "  t0J0  8(0  sm  na>ot  dt 

Use  these  results  to  find  the  Fourier  series  for  rh„ 

he  anodic  signals  in  Fig.  P2.8-4. 

2.8-5  Over  a  finite  interval,  a  sienal  ran  h» 

Fourier  series.  For  instance,  if  we  ^  ^onomtixic  (or exponential 

ouner  senes  with  fundamental  frequency  co  o  '  *  ~  '  °Ver  an  interval  0  <  /  <  1  by  f 

, .  a (e„  ,he  “xr  ■  pr /,,) = r 

so  inat  T0  =  n  and  coq  =  2.  If 


over  the 
we  want 
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the  fundamental  frequency  (o0  to  be  4,  we  repeat  the  pulse  every  tt/2  seconds.  If  we  want  the 
series  to  contain  only  cosine  terms  with  coq  =  2,  we  construct  a  pulse/ (r)  =  |/|  over  —  1  <  t  <  1 , 
and  repeat  it  every  n  seconds.  The  resulting  signal  is  an  even  function  with  period  tt.  Hence,  its 
Fourier  series  will  have  only  cosine  terms  with  coq  =  2.  The  resulting  Fourier  series  represents 
f(t)  =  t  over  0  <  t  <  1  as  desired.  We  do  not  care  what  it  represents  outside  this  interval. 
Represent/ (/)  =  /  over  0  <  t  <  1  by  a  Fourier  series  that  has 

(a)  coq  =  %  and  contains  all  harmonics,  but  cosine  terms  only 

(b)  (oq  =  2  and  contains  all  harmonics,  but  sine  terms  only 

(c)  (oq  =  ^  and  contains  all  harmonics,  which  are  neither  exclusively  sine  nor  cosine 

(d)  coo  =  1  and  contains  only  odd  harmonics  and  cosine  terms 

(e)  (oq  =  and  contains  only  odd  harmonics  and  sine  terms 

(f)  coq  =  1  and  contains  only  odd  harmonics,  which  are  neither  exclusively  sine  nor  cosine 

Hint:  For  parts  (d),  (e),  and  (f),  you  need  to  use  half-wave  symmetry  discussed  in  Prob.  2.8-2. 
Cosine  terms  imply  a  possible  dc  component. 

2.9- 1  For  each  of  the  periodic  signals  in  Fig.  P2.8-2,  find  exponential  Fourier  series  and  sketch  the 

corresponding  spectra. 

2.9- 2  A  periodic  signal  g(t)  is  expressed  by  the  following  Fourier  series: 

g(t)  =  3  sin  r  +  cos  +2 cos  ^8/  + 

(a)  Sketch  the  amplitude  and  phase  spectra  for  the  trigonometric  series. 

(b)  By  inspection  of  spectra  in  part  (a),  sketch  the  exponential  Fourier  series  spectra. 

(c)  By  inspection  of  spectra  in  part  (b),  write  the  exponential  Fourier  series  for  g(r). 

2.9- 3  (a)  Sketch  the  signal  g(t)  =  r  and  find  the  exponential  Fourier  series  to  represent  g{t)  over  the 

interval  (—1,  1).  Sketch  the  Fourier  series  cp(t)  for  all  values  of  t. 

(b)  Verify  ParsevaFs  theorem  [Eq.  (2.102a)]  for  this  case,  given  that 


oo  .  4 

El  _ 

«4  =  90 


n=\ 


90 
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2.9-4  (a)  Sketch  the  signal  g(t )  =  I  and  find  the  exponential  Fourier  series  to  represent  g(t)  over  the 
interval  (— 7r,  tt).  Sketch  the  Fourier  series  (pit)  for  all  values  of  t. 

(b)  Verify  Parseval’s  theorem  |Eq.  (2. 102a)]  for  this  case,  given  that 


I  i 

El  7i 

^2  =  T 


n=  1 


2.9-5  Show  that  the  coefficients  of  the  exponential  Fourier  series  of  an  even  periodic  signal  are  real  and 
those  of  an  odd  periodic  signal  are  imaginary. 


Q  ANALYSIS  AND 
O  TRANSMISSION  OF 
SIGNALS 


Electrical  engineers  instinctively  think  of  signals  in  terms  of  their  frequency  spectra  and 
of  systems  in  terms  of  their  frequency  responses.  Even  teenagers  know  about  audio 
signals  having  a  bandwidth  of  20  kHz  and  good-quality  loudspeakers  responding  up  to 
20  kHz.  This  is  basically  thinking  in  the  frequency  domain.  In  the  last  chapter  we  discussed 
spectral  representation  of  periodic  signals  (Fourier  series).  In  this  chapter  we  extend  this 
spectral  representation  to  aperiodic  signals. 


3.1  APERIODIC  SIGNAL  REPRESENTATION  BY 
FOURIER  INTEGRAL 


We  now  apply  a  limiting  process  to  show  that  an  aperiodic  signal  can  be  expressed  as  a 
continuous  sum  (integral)  of  everlasting  exponentials.  To  represent  an  aperiodic  signal  g(t) 
such  as  the  one  shown  in  Fig.  3.1a  by  everlasting  exponential  signals,  let  us  construct  a 
new  periodic  signal  gTo(t)  formed  by  repeating  the  signal  g(t)  every  T0  seconds,  as  shown  in 
Fig.  3. 1  b.  The  period  To  is  made  long  enough  to  avoid  overlap  between  the  repeating  pulses.  The 
periodic  signal  gr0(t)  can  be  represented  by  an  exponential  Fourier  series.  If  we  let  To  -»  oo, 
the  pulses  in  the  periodic  signal  repeat  after  an  infinite  interval,  and  therefore 


lim  gT0(O=gU) 

/()— »OC 


Thus,  the  Fourier  series  representing  gj0(t )  will  also  represent  g(t)  in  the  limit  To 
The  exponential  Fourier  series  forger)  is  given  by 


oo 


57b  (0  =  ^2  D„ejl 


JiuoqI 


(3.1) 


in  which 


(3.2a) 
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2n 

wo  =  —  =  2;r/o 
'0 


over 


is  ,he  same  as  in,efra,ins  *(,) 
co 0 

(3.2c) 


)n~  Tq  f  8^e  jnW°' dt 

l  z100 

=  T0J_J^-J2nnf0,dr 


1 0  «/-oo 

It  is  interesting  to  see  how  the  nature  of  rho 

this  fascinating  behavior  let  us  define  ri  /  >  mm  c^an?es  as  7o  increases.  To  understand 
8  ’  Ct  US  define  G(f  >• a  c°ntinuous  function  of  as 


/OO 

g(t)e-j“'dt 

-oo 
) 

g(t)e-J^ftdt 


-jT 

in  which  <u  =  2ff/.  A  glance  a,  Eqs  (3.2c)  and  (3.3>  then  shows  that 


(3.3 

(3.4 


D"  ~  J}G(nf()) 


(3. 


uniformly  Spaced  a,  <'/r0  times)  the  samples  of  C 

WILe  JS  many  components  (samples)  in  the^  3016,1131  ,requency/o-  so  that  there 
- -  Pectrum.  However,  by  doubling  7o, 

For  the  sake  of  simplicity  we  n 
val.d  for  complex  D„  (or  C(/)].  *  °n  and  theref<«  G(f)  in  Fi  3  2 

** redl*  The  argument,  however,  is  i 


Figure  3.2 

Change  in  the 
Fourier  spectrum 
when  the  period 
7o  in  Fig.  3.1  is 
doubled. 
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— ^  1 1 1 1 1 1 1 1 1 

L 

Ll 

>* 

ll 

1 

ll 

-  Envelope 

T  G(f) 

1  0  1 

Min . , 

0 

/-♦ 

have  halved  the  envelope  (l/7o)G(/),  as  shown  in  Fig.  3.2b.  If  we  continue  this  process 
of  doubling  To  repeatedly,  the  spectrum  progressively  becomes  denser  while  its  magnitude 
becomes  smaller.  Note,  however,  that  the  relative  shape  of  the  envelope  remains  the  same 
[proportional  to  G(/)  in  Eq.  (3.3)].  In  the  limit  as  T0  -+  oo ,/„  ->  0  and  D„  -  0.  This  means 
that  the  spectrum  is  so  dense  that  the  spectral  components  are  spaced  at  zero  (infinitesimal) 
interval.  At  the  same  time,  the  amplitude  of  each  component  is  zero  (infinitesimal).  We  have 
nothing  of  every  thing,  yet  we  have  something!  This  sounds  like  Alice  in  Wonderland ,  but  as 
we  shall  see,  these  are  the  classic  characteristics  of  a  very  familiar  phenomenon.* 

Substitution  of  Eq.  (3.5)  in  Eq.  (3.1)  yields 

*b«>  =  t  (3.6I 

n=-oo  0 

As  T0  -*  oo,/o  =  1  /7o  becomes  infinitesimal  (/0  0).  Because  of  this,  we  shall  use  a  more 

appropriate  notation.  A/ ,  to  replace /q.  In  terms  of  this  new  notation,  Eq.  (3.2b)  becomes 


and  Eq.  (3.6)  becomes 


oo 

8T0(t)=  [G(nAf)Af]  gO'2?rnA/)' 

n=-o o 


(3.7a) 


Equation  (3.7a)  shows  that  gTo(t)  can  be  expressed  as  a  sum  of  everlasting  exponentials  of 
frequencies  0.  ±Af ,  ±2  Af ,  ±3  A / ,  . . .  (the  Fourier  series).  The  amount  of  the  component 
of  frequency  nAf  is  [G(nAf)Af],  In  the  limit  as  7o  ->  oo,  Af  -*■  Oand  gT  (t)  g(t). 

Therefore, 


00 

g(t)  =  ^Um^gTo(t)  =  Jim(|  G(nAf)eij27rnAf)t  Af 

A'^°  n=-oo 


(3.7b) 


*  You  may  consider  this  as  an  irrefutable  proof  of  the  proposition  that  0%  ownership  of  everything  is  better  than 
100%  ownership  of  nothing! 
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The  sum  on  the  right-hand  side  of  Eq.  (3.7b)  can  be  viewed  as  the 
G(f)ejZ7TJ',  as  shown  in  Fig.  3.3.  Therefore, 


area  under  the  function 


8(t) 


=  r  G(f)eWdf 

J- 00 


(3.8) 


in  representing  an  aperiodic  signal  e(t)  bv  F *  F  integral.  We  have  now  succeeded 
This  integral  is  basically  a  Fourier  lntegra  (rather  than  a  Fourier  series), 

seen  from  Eq  (3  7b)  The  am™  ,  fT  “  ^  ,mit)  With  fundamenlal  frequency  A/  — ►  0,  as 

Of  Gif).  The  same  informationiTconreySbv  th  “"'J  gU)  llK  inverMf  founcr  transfomi 
transform  pair.  Symbolically,  this  is  expressed^  C  *  a*6"16"1  ***  G(f)  are  3  F°Urier 


C(/>  FlgU)}  and  g(r)  =  Jr~l  [C(/)J 
or 


To  recapitulate. 


g(t)  <=>  G(f) 


and 


G(f  )  =  r  gU)e~fi”fidt 
J-00 

/oo 

G(f)ej^fidf 


(3.9a) 


Fourier 


lijs  helpful  to  keep  in  mind  that  the  Fouri*  • 

SeneS  W'th  fundamental  frequency  A/  anni^f  **  Eq‘  (3%)  is  of  ttie  nature  01  a 

J  aPProaching  zero  [Eq.  (3.7b)].  Therefore,  most 


(3.9b) 
the  nature  of  a 


;ThisshoUid„„,beconsjderedtobear.; 


appear. 1 
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of  the  discussion  and  properties  of  Fourier  series  apply  to  the  Fourier  transform  as  well.  We 


can  plot  the  spectrum  G(/)  as  a  function  off.  Since  G(/)  is  complex,  we  have  both  amplitude 
and  angle  (or  phase)  spectra: 

G(f)  =  |  G(/)|<?'W) 


and  angle  (or  phase)  spectra: 


in  which  |G(/)|  is  the  amplitude  and  9g(f)  is  the  angle  (or  phase)  of  G(f ).  From  Eq.  (3.9a), 


OO 


g(t)eJ2nfi  dt 


—  OO 


/  versus  co 

Traditionally,  two  equivalent  notations  of  angular  frequency  co  and  frequency/  are  often  used 
in  representing  signals  in  the  frequency  domain.  There  is  no  conceptual  difference  between 
the  use  of  angular  frequency  w  (in  radians  per  second,  rad/s)  and  frequency  /  (in  hertz,  Hz). 
Because  of  their  direct  relationship,  we  can  simply  substitute  (o  =  2 nf  into  G(f)  to  arrive  at 
the  Fourier  transform  relationship  in  the  cu-domain: 


Because  of  the  additional  lit  factor  in  the  variable  co  used  by  Eq.  (3.10),  the  inverse  transform 
as  a  function  of  to  requires  an  extra  division  by  In.  Therefore,  the  notation  of /  is  slightly 
favored  in  practice  when  we  write  Fourier  transforms.  For  this  reason,  we  shall,  for  the  most 
part,  denote  the  Fourier  transforms  of  signals  as  functions  of  G(/).  On  the  other  hand,  the 
notation  of  angular  frequency  co  can  also  offer  some  convenience  when  we  are  dealing  with 
sinusoids.  Thus,  in  later  chapters,  whenever  it  is  convenient  and  nonconfusing,  we  shall  use 
the  two  equivalent  notations  interchangeably. 


Conjugate  Symmetry  Property 

From  Eq.  (3.9a),  it  follows  that  if  g(t)  is  a  real  function  oft,  then  G(/)  and  G(-f )  are  complex 
conjugates,  that  is,*  H 


G(— /)  =  G*(/) 


(3.11) 


Therefore, 


|G(-/)I  =  |G(/)| 

#*(-/)  =  -og(f) 


(3.12a) 

(3.12b) 


Thus,  for  real  g(t).  the  amplitude  spectrum  |G(/)|  is  an  even  function,  and  the  phase  spectrum 
0g(f)  is  an  odd  function  of/.  This  property  (the  conjugate  symmetry  property)  is  valid  only 
for  real  g(t).  These  results  were  derived  earlier  for  the  Fourier  spectrum  of  a  periodic  signal 
in  Chapter  2  and  should  come  as  no  surprise.  The  transform  G(f)  is  the  frequency  domain 
specification  ofg(t). 


*  Hermitian  symmetry  is  the  term  used  to  describe  complex  functions  that  satisfy  Eq.  (3.11) 
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Example  3.1 


Figure  3.4 

(a)  e~a,u(t)  and 

(b)  its  Fourier 
spectra. 


Find  the  Fourier  transform  of  e  atu(t). 


By  definition  [Eq.  (3.9a)], 

/OO  /*0° 

^)e-^  dt  =  jo  dt  =  -+j2nf 


-1 


,-(a+j2nf  )t 


oo 


But  \e  |  =  1.  Therefore,  as  t  — <■  oo,  e  (a+fW)'  =  e  °'e  =  0  if  a  >  0. 

Therefore, 


G(f)  =  — —  a  >  0 

a+jco 


(3.13a) 


where  co  =  2nf.  Expressing  a  +  jco  in  the  polar  form  as  s/a1  +  w1  eJ  ,an  ' ( a  * ,  we  obtain 


G(f)  =  =e~jtan~'  {2nf/a) 

x/a2  +  (27 r/)2 


(3.13b) 


Therefore, 


|C(/)|  =  — 

+  (2tt/)2 


OherPm?^^t,rUm  |G(/)I  and  the  phase  spectrum  W>  are  shown  in  Fig.  3.4b. 
e*S  f)  15  ^  CVen  fUnCti°n  °f/’  and  W  is  an  odd  Unction  of/,  as 


in  p  ,  o  ™uner  Transform 

xample  3. 1  we  observed  that  when  a  <  0  thP  p  • 
verge.  Hence,  the  Fourier  transform  forego  h  mtegral  for  e~a'u(t)  does  not  c< 

“(0  does  not  exist  if  «  <  0  (growing  exponents 
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Clearly,  not  all  signals  are  Fourier  transformable.  The  existence  of  the  Fourier  transform  is 
assured  for  any  g(t)  satisfying  the  Dirichlet  conditions  given  in  Eq.  (2.84).  The  first  of  these 
conditions  is* 


\g{t)\dt  <  oo 


(3.14) 


To  show  this,  recall  that  \e  =  1 .  Hence,  from  Eq.  (3.9a)  we  obtain 


|C(/)I  < 


\g(t)\dt 


This  shows  that  the  existence  of  the  Fourier  transform  is  assured  if  condition  (3. 14)  is  satisfied. 
Otherwise,  there  is  no  guarantee.  We  saw  in  Example  3.1  that  for  an  exponentially  growing 
signal  (which  violates  this  condition)  the  Fourier  transform  does  not  exist.  Although  this 
condition  is  sufficient,  it  is  not  necessary  for  the  existence  of  the  Fourier  transform  of  a  signal. 
For  example,  the  signal  (sin  at)/t,  violates  condition  (3. 14)  but  does  have  a  Fourier  transform. 
Any  signal  that  can  be  generated  in  practice  satisfies  the  Dirichlet  conditions  and  therefore  has 
a  Fourier  transform.  Thus,  the  physical  existence  of  a  signal  is  a  sufficient  condition  for  the 
existence  of  its  transform. 


Linearity  of  the  Fourier  Transform  (Superposition  Theorem) 

The  Fourier  transform  is  linear;  that  is,  if 

g\(t)  <=>  Gi(/)  and  g2(t)  <<=>  G2(f) 

then  for  all  constants  a\  and  a2,  we  have 

a\g\{t)  +  a2g2(t)  <=>  a}Gi(f)  +  a2G2(f)  (3.15) 

The  proof  is  trivial  and  follows  directly  from  Eq.  (3.9a).  This  theorem  simply  states  that  linear 
combinations  of  signals  in  the  time  domain  correspond  to  linear  combinations  of  their  Fourier 
transforms  in  the  frequency  domain.  This  result  can  be  extended  to  any  finite  number  of 
terms  as 


y  Am(0^==»  TakGk(f) 

k  k 

for  any  constants  { a *}  and  signals  {#*(/)}. 

Physical  Appreciation  of  the  Fourier  Transform 

To  understand  any  aspect  of  the  Fourier  transform,  we  should  remember  that  Fourier  repre¬ 
sentation  is  a  way  of  expressing  a  signal  in  terms  of  everlasting  sinusoids,  or  exponentials. 
The  Fourier  spectrum  of  a  signal  indicates  the  relative  amplitudes  and  phases  of  the  sinu¬ 
soids  that  are  required  to  synthesize  that  signal.  A  periodic  signal’s  Fourier  spectrum  has  finite 
amplitudes  and  exists  at  discrete  frequencies  (fo  and  its  multiples).  Such  a  spectrum  is  easy 


*  The  remaining  Dirichlet  conditions  are  as  follows.  In  any  finite  interval,  g(t)  may  have  only  a  finite  number  of 
maxima  and  minima  and  a  finite  number  of  finite  discontinuities.  When  these  conditions  are  satisfied,  the  Fourier 
integral  on  the  right-hand  side  of  Eq.  (3.9b)  converges  to  g(t)  at  all  points  where  g(r)  is  continuous  and  converges 
to  the  average  of  the  right-hand  and  left-hand  limits  of  g(f)  at  points  where  g(t)  is  discontinuous. 
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Figure  3.5 

Analogy  for 

Fourier 

transform. 


.  um  ....  ^UUIH  U1  an  aperiodic  signal  is  not  easy  to  visualize  because  it  has  n 

continuous  speclnim  that  exists  a,  every  freqtrency.  The  continis  spec‘s  co“  p,  cl  be 

{acond^ 

Wr-j^D, 

i=  1 

S“af°Wn  "  Fi^  35b'  In  **>  -e, 

not  mean  that  there  is  no  load  on  the  be-m  a  ’  ? d  1  any  one  po,nt  ,s  zero-  This  does 

not  the  load  at  a  point,  but  rather  the  loading  Tfa”‘ng  u  measure  of  load  in  this  situation  is 
be  the  loading  density  per  unit  length  of  beam  S'  **  ““If  le"gth  *  tHa'  P°int'  Let  G« 
A,  (A*  -  0)  a,  some  point  x  is  “  ?“  the  ]°ad  °Ver  a  beam  le"gtb 

beam  into  segments  of  interval  A*  (A*  ->  ot  Th  1  C  j°t3  °ad  on  tbe  ^eam’  we  divide  the 
Aa  is  [G(/tAx)J  Ax.  The  total  load  WT  is  given  byC  ^  °Ver  ”th  SUCh  segment  of  length 


*1 

/*n 

G(x)dx 

i 

In  the  case  of  discrete  loading  (Fig  3  sal  thA  ,  . 

^ints  there  is  no  load.  On  the  other  hand,  in  thecoTj^ \  T*  "  diSCrete  points- At  other 
^Very  po,nt’  ^ut  at  any  specific  point  x  the  load  i«  ^  y  oaded  case,  the  load  exists  at 
however,  is  |G(„Ax)|  Ax  (Fig.  Wb)  i  n  ^  k  ^  0W  *  s™"  Ax. 

relative  load  at  that  point  is  G(x).  Ugb  tbe  oad  at  a  point  x  is  zero,  the 

An  exactly  analogous  situation  exists  in  ih» 

penodlc,  the  spectrum  is  discrete,  and  g(t)  can  be  exore  °  i  Sig"al  Spectrum-  when  g(t)  is 
W  b  bn.te  amplitudes:  expressed  as  a  sum  of  discrete  exponentials 


meiraf ' that  iS’ the  Spectrum  exists  for 

measure  here  ,s  not  the  amplitude  of  a  component  of  ?PeCtrUm  is  “fo.  The  meaningful 

her  am,  bandwidth.  From  e,,  (3.7b) ,  is  £  “  “ f  ™  bu,  the  spectra,  density 

gU)  ,S  &y"thes,zed  by  adding  exponentials 


Figure  3.6 

Time-limited 

pulse. 
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c 


b 


of  the  form  eJ  ft,  in  which  the  contribution  of  any  one  exponential  component  is  zero.  But 


the  contribution  of  exponentials  in  an  infinitesimal  band  A /  located  at/  =  nAf  is  G(nAf)  Af 
and  the  addition  of  all  these  components  yields  g(t)  in  integral  form: 


OO 


The  contribution  of  components  within  the  band  df  is  G(/)  df ,  in  which  df  is  the  bandwidth 
in  hertz.  Clearly  G(/ )  is  the  spectral  density  per  unit  bandwidth  (in  hertz).  This  also  means 
that  even  it  the  amplitude  of  any  one  component  is  zero,  the  relative  amount  of  a  component 
of  frequency/  is  G(/).  Although  G(/)  is  a  spectral  density,  in  practice  it  is  customarily  called 
the  spectrum  of  g(t)  rather  than  the  spectral  density  of  g(t).  Deferring  to  this  convention,  we 
shall  call  G(f)  the  Fourier  spectrum  (or  Fourier  transform)  of  g(t). 

A  Marvelous  Balancing  Act 

An  important  point  to  remember  here  is  that  g(t)  is  represented  (or  synthesized)  by  exponentials 
or  sinusoids  that  are  everlasting  (not  causal).  This  leads  to  a  rather  fascinating  picture  when 
we  try  to  visualize  the  synthesis  of  a  time-limited  pulse  signal  g(t)  (Fig.  3.6)  according  to  the 
sinusoidal  components  in  its  Fourier  spectrum.  The  signal  g(t)  exists  only  over  an  interval 
(a,  b)  and  is  zero  outside  this  interval.  The  spectrum  of  g(t)  contains  an  infinite  number 
of  exponentials  (or  sinusoids),  which  start  at  t  =  -oo  and  continue  forever.  The  amplitudes 
and  phases  of  these  components  are  such  that  they  add  up  exactly  to  g(t)  over  the  finite 
interval  (a,  b)  and  add  up  to  zero  everywhere  outside  this  interval.  Juggling  with  such  a 
perfect  and  delicate  balance  of  amplitudes  and  phases  of  an  infinite  number  of  components 
boggles  the  human  imagination.  Yet,  the  Fourier  transform  accomplishes  it  routinely,  without 
much  thinking  on  our  part.  Indeed,  we  become  so  involved  in  mathematical  manipulations  that 
we  fail  to  notice  this  marvel. 


3  2  TRANSFORMS  OF  SOME  USEFUL  FUNCTIONS 


For  convenience,  we  now  introduce  a  compact  notation  for  some  useful  functions  such  as 
rectangular,  triangular,  and  interpolation  functions. 

Unit  Rectangular  Function 

We  use  the  pictorial  notation  n  (*)  for  a  rectangular  pulse  of  unit  height  and  unit  width,  centered 
at  the  origin,  as  shown  in  Fig.  3.7a: 


nw  =  0.5  |jc|  =  x- 

o  W  >  ^ 


(3.16) 
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Figure  3.7 

Rectangular 

pulse. 


Figure  3.8 

Triangular  pulse. 


1 

nw 

1 

nef) 

_ 

0  J 

- 1 

c_ 

0  I  .v 

(a) 


(b) 


Notice  that  the  rectangular  pulse  in  Fig.  3.7b  is  the  unit  rectangular  pulse  ITU)  expanded 
by  a  factor  t  and  therefore  can  be  expressed  as  THx/t)  (see  Chapter  2).  Observe  that  the 
denominator  r  in  fl(jr/T)  indicates  the  width  of  the  pulse. 


Unit  Triangular  Function 

We  use  a  pictorial  notation  A(*)  for  a  triangular  pulse  of  unit  height  and  unit  width,  centered 
at  the  origin,  as  shown  in  Fig.  3.8a: 


A«  = 


1  -  2 |.t| 
0 


M  <  ^ 


M>  2 


(3.17) 


Observe  that  the  pulse  in  Fig.  3.8b  is  A(*/r).  Observe  that  here,  as  for  the  rectangular  pulse, 
the  denominator  r  in  A(*/r)  indicates  the  pulse  width. 


Sine  Function  sinc(jt) 

T  he  function  sin  x/x  is  the  “sine  over  argument”  function  denoted  by  sine  (*).* 
I  his  function  plays  an  important  role  in  signal  processing.  We  define 


sine  (jc)  = 


sin  x 
x 


(3.18) 


•  The  function  sine  W  is  also  denoted  by  Sa  « in  the  literature.  Some  authors  define  sine  (,)  as 


sine  (jc)  = 


sin  7tx 

7 TX 


Figure  3.9 

Sine  pulse. 
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1 


x 


(a) 


sine  (“^°) 


-14?rs — S-ln  0  7rcx ^1471 

3  3  3  3  « 


(b) 


Inspection  of  Eq.  (3.18)  shows  that 

1 .  sine  ( x )  is  an  even  function  of  jc. 

2.  sine  (x)  =  0  when  sin  *  =  0  except  at  *  =  0,  where  it  is  indeterminate.  This  means  that 
sine  (jc)  =  0  for  t  =  ±n,  ±2n ,  ±3;r, .... 

3.  Using  L’Hopital’s  rule,  we  find  sine  (0)  =  1. 

4.  sine  (jc)  is  the  product  of  an  oscillating  signal  sin  *  (of  period  2n)  and  a  monotonically 
decreasing  function  \/x  .  Therefore,  sine  (jc)  exhibits  sinusoidal  oscillations  of  period  2 tt, 
with  amplitude  decreasing  continuously  as  1/jc. 

5.  In  summary,  sine  (jc)  is  an  even  oscillating  function  with  decreasing  amplitude.  It  has  a  unit 
peak  at  jc  =  0  and  zero  crossings  at  integer  multiples  of  n. 

Figure  3.9a  shows  sine  (jc).  Observe  that  sine  (jc)  =  0  for  values  of  jc  that  are  positive  and 

negative  integral  multiples  of  tt.  Figure  3.9b  shows  sine  (3o>/7).  The  argument  3cu/7  =  n  when 

co  =  In/ 3  or/  =  7/6.  Therefore,  the  first  zero  of  this  function  occurs  at  to  =  ln/3  (f  =  7/6). 


Example  3.2  Find  the  Fourier  transform  of  g(t)  =  FI(//r)  (Fig.  3.10a). 


We  have 
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Since  n(//r)  =  I  for  |f|  <  r/2,  and  since  it  is  zero  for  |f|  >  r/2, 

rr/ 2 

G(f)  =  /  e~j2*fidt 
J-x/2 

1  (e-MT_c7>/r)  =  2sin(7r/r) 


=  r- 


sin  (nf  t) 
(xfr) 


2  nf 


=  r  sine  (nf  r) 


Therefore, 


n 


«=>  r  sine 


(y )  =  t  sinc  (tt/t) 


(3.19) 


Recall  that  sine  (x)  =  0  when  *  =  ±/ot.  Hence,  sine  (<ur/2)  =  0  when  wr/2  =  ±nn\  that 
is,  when  /  =  ±Az/r(rt  =  1,2,3,...),  as  shown  in  Fig.  3.10b.  Observe  that  in  this  case 
Gif)  happens  to  be  real.  Hence,  we  may  convey  the  spectral  information  by  a  single  plot 
of  G(f)  shown  in  Fig.  3.10b,  without  having  to  show  |G(/)|  and  ZG(/)|. 


Figure  3.10 

(a)  Rectangular 
pulse  and  (b)  its 
Fourier  spectrum 


Bandwidth  of  IK^) 

□  <X“T„w£V"  Fi8'13l0lPeakS  : =°  “d  *  h'ghor  frequencies.  Therefore. 

ZZZXlZZX*  T'  °f  “*  Slgna‘  e"ergy  in  lower  ^uency  components, 
(^nl  Z?"  r  "  in  m  !  e'"  ""  l'iglMS' (Significaml  frequency  anti  the  lowest 

mm' 0  ,o  ’ T ZX 1  SPeC!mm'  StriC'ly  Speaking-  the  spectrum  extends 

:^r;:z7-r,T::r much  of  *?  >«>— is 

ir  ~  riXshma,:  if, z 


*/  trigonometric 

mathematical  convenience.  Each  sinusoid  cos  w„i  appears  is  a  sum  of  ^  USe  exponentlal  sPectra  for 

and  with  frequencies  d  respectively  Rnr  in  i  T°  C°mp  **  eXponemial  components  */■** 
frequency  u>„.  resPectlve'y-  But  m  reahty,  there  is  only  one  real  component  of 
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Example  3.3  Find  the  Fourier  transform  of  the  unit  impulse  signal  S(t). 

We  use  the  sampling  property  of  the  impulse  function  [Eq.  (2.19)],  to  obtain 

/OO 

me-j2*fidt  =  e-W-0  =  1 

*00 


Figure  3.1 1 

(a)  Unit  impulse 
and  (b)  its 
Fourier  spectrum. 


or 


5(f)  1 

Figure  3. 1 1  shows  5(f)  and  its  spectrum. 


(a) 


(3.20a) 


(3.20b) 


1 

g(t)  =  5  (/) 

L 

1 

G(/)=  1 

o  /-► 

0  /- 

(b) 


Example  3.4  Find  the  inverse  Fourier  transform  of  8(2izf)  =  ^5 (/). 

From  Eq.  (3.9b)  and  the  sampling  property  of  the  impulse  function, 

/OO  i  /»00 

8(2  nf)e™  df  =  —  I  8(2nf)eJ2^  d(2nf) 

-OO  J  —  OO 

2n  2  7T 


Therefore, 


or 


&  ^  w) 


8(f) 


(3.21a) 


(3.21b) 


This  shows  that  the  spectrum  of  a  constant  signal  g(t)  =  I  is  an  impulse  8(f)  =  2n8(2nf), 
as  shown  in  Fig.  3.12. 
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Figure  3.12 

(a)  Constant  (dc) 
signal  and  (b)  its 
Fourier  spectrum. 


(a) 


(b) 


The  result  [Eq.  (3.21b)]  also  could  have  been  anticipated  on  qualitative  grounds. 
Recall  that  the  Fourier  transform  of  g(t)  is  a  spectral  representation  of  g(t)  in  terms  of 
everlasting  exponential  components  of  the  form  ,  Now  to  represent  a  constant  signal 
g(f)  =  1,  we  need  a  single  everlasting  exponential  ej2nfi  with  /  =  0.  This  results  in  a 
spectrum  at  a  single  frequency/  =  0.  Another  way  of  looking  at  the  situation  is  that 
“  &(*)  =  1  is  a  dc  signal  that  has  a  single  frequency/  =  0  (dc). 


...  I!  *mPu^se  ® 's  a  spectrum  of  a  dc  signal,  what  does  an  impulse  at  f  =fo  represent? 

We  shall  answer  this  question  in  the  next  example. 


Example  3.5  Find  the  inverse  Fourier  transform  of  6(/  —fy). 

From  the  sampling  property  of  the  impulse  function,  we  obtain 


-i  f°° 

T  [<S(/-/o)]=  /  S(f  -f0)ej2nft  df  =  ej2nfo' 
J -00 


Therefore, 


eJlnfo'  *=>  Hf  -fo) 


(3.22a) 


t7=/  rre  chr:arrm|0fan  erlaSting  exp°nential  ^  »  a  single  impu. 

^~T"„IqualiTive  reasonin8' To  rep:esc"' « 

Therefore  the  ,  gle  everlasting  exponential  e^f'  with  /  =  , 

Fmm  F  ,TST  “"SISts  °f  a  sin8,e  component  at  frequency/  =  ft 
From  Eq.  (3.22a)  it  follows  that  y/  j0‘ 


e-j2nf0l  ^  8{f  +/o) 


(3.22b) 
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Example  3.6  Find  the  Fourier  transforms  of  the  everlasting  sinusoid  cos  2 jrf0t. 

Recall  the  Euler  formula 


cos  2nfot  =  -  (ej2*f°l  -f 


Upon  adding  Eqs.  (3.22a)  and  (3.22b),  and  using  the  preceding  formula,  we  obtain 


cos  2nf0t  <=»  -[<$(/  +/0)  +  S(f  -f0)] 


(3.23) 


The  spectrum  of  cos  2nf0t  consists  of  two  impulses  at  /„  and  -f0  in  the /-domain,  or, 
two  impulses  at  ±cuo  =  ±2jr/0  in  the  cu-domain  as  shown  in  Fig.  3.13.  The  result  also 
follows  from  qualitative  reasoning.  An  everlasting  sinusoid  cos  coot  can  be  synthesized  by 
two  everlasting  exponentials,  e™1  and  e~^‘ .  Therefore,  the  Fourier  spectrum  consists 
of  only  two  components,  of  frequencies  coq  and  -coo. 


Figure  3.13 

(a)  Cosine  signal 
and  (b)  its 
Fourier  spectrum. 


i  Gif) 

0.5 

0.5 

~fo 

0  j 

Fo  /■ 

(b) 


Example  3.7 


Find  the  Fourier  transform  of  the  sign  function  sgn  (/)  (pronounced  signum  t),  shown  in 
Fig.  3.14.  Its  value  is  +1  or  —1,  depending  on  whether  /  is  positive  or  negative: 


sgn  (/)  = 


1 

0 

-1 


/  >  0 
/  =  0 
t  <  0 


(3.24) 


We  cannot  use  integration  to  find  the  transform  of  sgn  (/)  directly.  This  is  because 
sgn  (/)  violates  the  Dirichlet  condition  (see  Sec.  3. 1 ).  Specifically,  sgn  (/)  is  not  absolutely 
integratable.  However,  the  transform  can  be  obtained  by  considering  sgn  /  as  a  sum  of  two 
exponentials,  as  shown  in  Fig.  3.14,  in  the  limit  as  a  — *  0: 

sgn  /  =  lim  [e~a,u(t)  —  ea,u(—  /)] 
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Figure  3.14 

Sign  function. 


1 

1 

sgn  (r) 

. .  alu(t) 

. . 

)  f 

-1 

Therefore, 


.F(sgn(/)]  =  lim  \T[e  a,«(/)]  -  f[ea'u(-t)]\ 

=  iLmo  (see  pairs  1  and  2  in  Table  31> 

=  lim  \  =  _L 

<*-* 0  V  a*  +  4 jt2/2  )  jnf 


(3.25) 


3.3  SOME  PROPERTIES  OF  THE  FOURIER 
TRANSFORM 

«T,7  Sm,dJ  SO"“,°f,he  lmp"na"‘ prope,,ieS  of  ,he  Follri'r  transf°rm  and  their  implications 
»  we  1  ns  their  applications.  Before  embarking  on  this  study,  i,  is  impottan,  point  on.  a 
pervasive  aspect  of  the  Fourier  transform— the  time-frequency  duality. 

3.3.1  Time-Frequency  Duality 

Equations  (3.9a)  and  (3.9b)  show  an  interesting  tw  .  ,  .  . 

oneritinrK  qrp  -muriraW,  •  i  1  taCt‘  ^irect  a°d  the  inverse  transform 

then  from  Cl  fu  m  ^  ^  These  operations’  required  to  go  from  g(t)  to  G(f)  and 

then  from  G{f)  log(t),  are  shown  graphically  in  Fi?  l  1  s  tu*  ,.rr  , 

1  he“r  °t  t*  "?  i.n  * 

basis  of  Ae^o-ralleddutdity^ftime'^^frequmicy  *j7i  ?Fl*le  Fwil^er,rans^orm-  I* 's  ^ 

by  analogy  a  photognph  and  Us  negate  A  ph^UgraTcm^ Z,T  JT  beC°™d‘rei 
and  by  using  an  identical  procedure  a  negative  can  2  u  be°?aine?from  lts  ne8ative’ 
any  result  or  relationship  between  g(t)  and  G(  f)  the  btamedfrom  the  Photograph.  For 
obtained  by  interchanging  the  roles' of  g(n  and  G(  f)  ^  reSUk  ^  relationsh,p’ 

minor  modifications  arising  because  ofasi.-n  eh-.  {  c  he  onginal  result  (alonf?  w'th  some 

to  be  proved  later,  states  that  if  g(t)  <=>  G(f)  then  F°reXample’ the  time‘shifting  property, 


g(t  -  t0)  <=>  G(f)e~j2nfi<> 
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TABLE  3.1 


Short  Table  of  Fourier  Transforms 

8(0 

Gif) 

1  e~atu(t) 

1 

a  >  0 

a  +j2nf 

2  eatu(—t) 

a  >  0 

a  —j2nf 

3 

2  a 

a  >  0 

a2  +  (2n f)2 

4  te~atu(t) 

1 

a  >  0 

(a  +j2nf)2 
n\ 

5  tne~atu(t) 

a  >  0 

(a  +  j2nf)n+ 1 

7  1 

8 

9  cos  2 nfat 

10  sin  Infot 

11  u(t) 

12  sgn  / 

13  cos  2nfot  u(t) 

14  sin  2nfot  u(t) 

15  e~at  sin  2 nfot  u(t) 

16  e~at  cos  2nfat  u(t) 

17  n(i) 

1 8  2  B  sine  (2jiBt) 

19  A (I) 

20  flsinc2(;rfl/) 

21  E^L-ooHt-nT) 

22  €_,2/2<t2 


1 

*(/) 

«(/  ~/o) 

0.5  [S(/ +/<,)  +  *(/ -/<,)] 

>0.5  [<5(/  +/0)  -  <5(/  — /o)] 

lw + s' 

2 


yW 


2tt/o 

(o  +j2rtf)2  +  4tt2/02 

a  +j2nf 

( a  +j2nf)2  +  4jt2/02 
r  sine  (;r/r) 


"(4) 


(?) 


^  sine2 


44) 

foEZ-ooS(f-nfo) 

oj2^e~1(anrt 


a  >  0 
a  >  0 


/o  =  - 


The  dual  of  this  property  (the  frequency-shifting  property)  states  that 

«=>  G(/  -/o) 

Observe  the  role  reversal  of  time  and  frequency  in  these  two  equations  (with  the  minor  differ¬ 
ence  of  the  sign  change  in  the  exponential  index).  The  value  of  this  principle  lies  in  the  fact 
that  whenever  we  derive  any  result,  we  can  be  sure  that  it  has  a  dual.  This  assurance  can  lead 
to  valuable  insights  about  many  unsuspected  properties  or  results  in  signal  processing. 

The  properties  of  the  Fourier  transform  are  useful  not  only  in  deriving  the  direct  and  the 
inverse  transforms  of  many  functions,  but  also  in  obtaining  several  valuable  results  in  signal 
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Figure  3.1 5 

Near  symmetry 
between  direct 
and  inverse 
Fourier 
transforms. 


Example  3.8 


Figure  3.16 

Duality  property 
of  the  Fourier 
transform. 


wr^ing'  .y.hthreaide';.sh0Uld  not  fail  to  observe  ‘he  ever-present  duality  in  this  discussion. 
discS  ^  Pr0Pe,1y'  WhiCh  ^  °"e  °f  thC  conse£luences  °t  the  duality  principle 


3.3.2  Duality  Property 

The  duality  property  states  that  if 


g(t)  G(/) 

then 


G(t ) 


S(~f) 


(3.26) 


transform  of  G(/X  ^th/^pIaced^byT,  ^"the  ^-/^which'is  th^  G(j°  the"  the  F°Ur'er 
with  i  replaced  bv  —f  '  '  bltb  s  tbe  ong,nal  time  domain  signal 


replaced  by  -/ 
Proof:  From  Eq.  (3.9b), 


Hence. 


gU) 


/OO 

G(x)eJ2™  dx 

-OO 


/OO 

G(x)e~ilnx‘  dx 

-OO 


Changing  t  to/  yields  Eq.  (3.26). 
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Example  3.8  In  this  example  we  shall  apply  the  duality  property  [Eq.  (3.26)]  to  the  pair  in  Fig.  3.16a. 


Figure  3.16 

Duality  property 
of  the  Fourier 
transform. 


From  Eq.  (3. 19)  we  have 


n 

n 


G) 

G) 


►  r  sine  (;r/r) 
ot  sine  [n fa) 


g(t) 


Gif) 


(3.27a) 

(3.27b) 


Also  G(t)  is  the  same  as  G(f)  with /  replaced  by  u  and  g(-f )  is  the  same  as  g(t)  with  t 
replaced  by  — /.  Therefore,  the  duality  property  (3.26)  yields 


a  sine  {nat) 
G{t) 


n  nG) 

«<-/) 


(3.28a) 


Substituting  r  =  2na ,  we  obtain 


rsinc 


<=>  2  Jr  n 


(3.28b) 


In  Eq.  (3.28)  we  used  the  fact  that  n  (— t)  =  n  (/)  because  n (r)  is  an  even  function. 
Figure  3.16b  shows  this  pair  graphically.  Observe  the  interchange  of  the  roles  of  t  and 
2nf  (with  the  minor  adjustment  of  the  factor  2n).  This  result  appears  as  pair  18  in  Table  3.1 
(with  a  =  2 B). 
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As  an  interesting  exercise,  readers  should  apply  the  duality  property  to  generate  a  dual  of  every 
pair  in  Table  3.1. 


3.3.3  Time-Scaling  Property 

if 


g(t)  <=*  G(/) 


then,  for  any  real  constant  a. 


g(at) 


\a\  \a) 


(3.29) 


Proof:  For  a  positive  real  constant  a, 

ng(at)  ]  =  g(at)e-Mdt  =  -  [°°  g(x)e{~^f  ^  dx  =  i-G 

J-°°  a  J- OO  a  \a ) 

Similarly,  it  can  be  shown  that  if  a  <  0, 


g(at)  <=>  ~G 


Hence  follows  Eq.  (3.29).  . 

Significance  of  the  Time-Scaling  Property 

The  function  ,(«/)  represents  the  function  g(r)  compressed  in  time  by  a  factor  a  (|a|  >  l)(se 

bv  the  same  fv!)  ** '  ti  U?Ctl°n  ^  ^  represents  the  function  G(/)  expanded  in  frequenc 

ind'swmllZ  tme.  8  PWperty  StQteS  that  time  compression  of  a  signal  resuh 

lntmtiTelvweZ,,V'T''  Ti  eXpansi°n  °f,he  ****  ^sults  in  its  spectral  compressio, 

T  i  C0TeSSi0n  in  time  by  a  factor  «  means  that  the  signal  , 
varying  more  rapidly  by  the  same  factor.  To  synthesize  such  a  signal  the  frequencies  of  il 

'l  TSed  by  ba"4,‘  ™plyins  lha‘ ils  M-e-cy  spectra, 
is  expanded  by  the  factor  a.  Similarly,  a  signal  expanded  in  time  varies  more  slowlv  hence  th 

iTciS  tr.S,'!*:  same  “  ,he  si8“'  «*  2-/0/  time -compressed  by 
spectrum  of  the  latler  (impulse  a,  ±/bb  ^'e'eflecl'ofttds  soling  u  demonstrated 

Reciprocity  of  Signal  Duration  and  Its  Bandwidth 

«mu  stirr„rz'!::h'hr if .*« is  -**•  *  is  M,row.  * 

bandwidth  of  a^ignal  is  inverselv  "*  bandwidth’ and  vice  versa.  This  suggests  that  th 

have  already  verified  this  fact  fothT" ^  S'gnal duration  or  width  (in  seconds).  W 
of  a  gate  pul  of  wX  ??  "I""  «  *■*  that  the  bandwtd, 

found  in  the  literature.2  °re  lscusslon  °f  this  interesting  topic  can  b 


Figure  3.1 7 

The  scaling 
property  of  the 
Fourier 
transform. 


Example  3.9 


Figure  3.18 

M  ^a|r|  and  (b) 
its  Fourier 
spectrum. 
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Figure  3.1 7 

The  scaling 
property  of  the 
Fourier 
transform. 


1 

gU) 

T 

2 

0  1 

2 

L  t- 
» 

Example  3.9  Show  that 


g(-t)  <=►  G(—f) 


(3.30) 


Use  this  result  and  the  fact  that  e  alu(t)  <=>  1  /(a  +  jlnf)  to  find  the  Fourier  transforms  of 
ea,u(—t)  and  e~a^. 


Equation  (3.30)  follows  from  Eq.  (3.29)  by  letting  a  =  - 1 .  Application  of  Eq.  (3.30)  to 
pair  1  of  Table  3.1  yields 


Figure  3.18 

W  e~altl  and  (b) 
its  Fourier 
spectrum. 


1 1 2  ANALYSIS  AND  TRANSMISSION  OF  SIGNALS 


3.3.4  Time-Shifting  Property 

if 


then 


Proof:  By  definition. 


<=>  G(f) 


g(t  -  to)  <=>  G(f)e~j2nfio 


/OO 

g(t  -  to)e~j2*ft  dt 

-OO 


Letting  /  —  to  =  x,  we  have 


(3.32a) 


T[g{t  -  f0)]  =  f  g(x)e-j2nf(x+,°)  dx 
J— OO 

/OO 

g(x)e~j2jtfx  dx  =  G(f)e~j2nfi 0  (3.32b) 


rnysicai  explanation  of  the  Linear  Phase 

uhaSe  Shift  in  itS  Spectrum' This  result  can  also  be  derived 
sinusoids  of  certain  amplitudes  and  pha^ef  The  del '^1^  ltS,F°Urier  comPonents>  which  are 
the  same  sinusoidal  components,  each  delayed  bv  /  Ilf  h8"™*  '  ~  ^  C™  be  synthesized  by 
remain  unchanged.  Therefore  the  amnlim  t  y  °  nds' The  amplitudes  of  the  components 

The  time  delay  of  r„  iSZSt?  ’’"T"  °f  *<>  ~  '»> is  “»&•<  >»  that  of  *(/). 
Now.  a  sinusoid  cos  lift  delayed  by  is  giv'enV”  ““  PhaSe  °f  eadl  C‘,mpon"" 

cos  2n f  (t  -  t0)  =  cos  (2 nft  _  2;r ft0) 

a  linear  function  of/Zeaning^that  highS UCnCy/  manifests  3S  a  phase  delay  of2*fio- This  is 
larger  phase  shifts  to  achieve  the  same  time^l^^Th™1301161115  must  undergo  proportionately 
sinusoids,  the  frequency  ^  *  Sh°Wn  in  Fi§-  319  with  *"° 

delay  t()  amounts  to  a  phase  shift  of  jt/2  in  the  ,W'te  tbat  of  tbe  uPPer-  The  same  time 

sinusoid.  This  verities  the  fact  that  to  achip  •  - sinusoid  and  a  phaseshift  of  n  in  the  lower 
must  undergo  proportionately  larger  phase  Ihiftf^  ^  delay’  hl  She  r  frequency  sinusoids 
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Figure  3.19 

Physical 
explanation  of 
the  time-shifting 
property. 


Example  3. 10  Find  the  Fourier  transform  of  e  a ^  /0L 


This  function,  shown  in  Fig.  3.20a,  is  a  time-shifted  version  of  e  a ^  (shown  in  Fig.  3. 1 8a). 
From  Eqs.  (3.3 1 )  and  (3.32)  we  have 


Figure  3.20 

Effect  of  time 
shifting  on  the 
Fourier  spectrum 
°f  o  signal. 


e-a\t-to\ 


2  a 

a 2  +  (2  nf)2 


e-j2nfto 


(3.33) 


The  spectrum  of  £-al'-,ol  (Fig.  3.20b)  is  the  same  as  that  of  e~a^  (Fig.  3.18b),  except  for 
an  added  phase  shift  of  —2nfto  . 

Observe  that  the  time  delay  to  causes  a  linear  phase  spectrum  —InftQ.  This  example 
clearly  demonstrates  the  effect  of  time  shift. 
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Example  3.11  Show  that 


g(t  -T)  +  g(t  +  T)  <=>  2  •  G(f)  cos  27 tJT  (3.34) 


Figure  3.21 

Amplitude  modu¬ 
lation  of  a 
signal  causes 
spectral  shifting. 


This  follows  directly  from  Eqs.  (3.32a)  and  (3.32b). 


3.3.5  Frequency-Shifting  Property 

if 

5(0  «=►  G(f) 


then 


g(t)ej2*f0‘  <=*  G(f  -f0)  (3.35) 


This  property  is  also  called  the  modulation  property. 
Proof:  By  definition. 


F[g(t)eM»' ]  =  r  =  f°°  g(t)e~^f- =  G(f  -f0) 

J  00  J — 00 

rf™~  -TSr  Tltipl'Cati0n  °f  3  signal  by  a  factor  eJ2*h'  shifts  the  spectrum 
properties  ^  °  °'e  ^  dUahty  between  the  time-shifting  and  the  frequency-shifting 

Changing /o  to  -/0  in  Eq.  (3.35)  yields 


is  actovedby Tulti^v'in/  Tm  that  Can  be  generated'  frequency  shifting  in  practice 

P  y  ng  5(0  with  a  sinusoid.  This  can  be  seen  from  the  fact  that 

5(0  cos  2nf0t  =  I  [g(t)ej27If°'  +  g(t)e-j2*/ a'j 

From  Eqs.  (3.35)  and  (3.36),  it  follows  that 


£(/)COS  27r/()/ 


2  \.G(f  —fo)  +  G(f  +/o)] 


(3.37) 


spectrum  G(f)  of TsinLdd cos  fKqUency  fo  shiftS  the 

the  earner,  the  signal  g(t)  is  the  modulating  signal,  and  the 
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Figure  3.21 

Amplitude  modu¬ 
lation  of  a 
signal  causes 
spectral  shifting. 


signal  g(t )  cos  2nfot  is  the  modulated  signal.  Modulation  and  demodulation  will  be  discussed 
in  detail  in  Chapters  4  and  5. 

To  sketch  a  signal  g(r)  cos  2nfot,  we  observe  that 


g(t)  cos  2n f0t  = 


|*(0 

i  -g(t) 


when  cos  2nfyt  =  1 
when  cos  2 nfot  =  —  1 


Therefore.  g(t)  cos  2 nfot  touches  g(t)  when  the  sinusoid  cos  2nfot  is  at  its  positive  peaks  and 
touches  -g(t)  when  cos  2nfot  is  at  its  negative  peaks.  This  means  that  g(t)  and  -g(t)  act  as 
envelopes  for  the  signal  g{t)  cos  2nfyt  (see  Fig.  3.21c).  The  signal  -g(t)  is  a  mirror  image 
of  g(t)  about  the  horizontal  axis.  Figure  3.21  shows  the  signals  g(t),  g(t)  cos  2 nfot  and  their 
spectra. 


Shifting  the  Phase  Spectrum  of  a  Modulated  Signal 

We  can  shift  the  phase  of  each  spectral  component  of  a  modulated  signal  by  a  constant  amount 
do  merely  by  using  a  carrier  cos  (2nfot  +  Oo)  instead  of  cos  2^r/o/.  If  a  signal  g(t)  is  multiplied 
by  cos  (2nfot  +  Oo ).  then  using  an  argument  similar  to  that  used  to  derive  Eq.  (3.37),  we  can 
show  that 


g(t)  cos  (2nfot  +  Oo)  ■<=>  -  ^G(f  —fo)  +  G(/  +/o)  e  j 


(3.38) 
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Example  3.1 


Figure  3.22 

Example  of 
spectral  shifting 
by  amplitude 
modulation. 


For  a  special  case  when  60  =  -7t/2,  Eq.  (3.38)  becomes 

g(t)  sin  2jt/0/  <=>  [G(f  -f0)  e~jn/2  +  G(f  +/0)  eJn/2l  (3.39) 

Observe  that  sin  2nfnt  is  cos  2nfot  with  a  phase  delay  of  tt/2.  Thus,  shifting  the  carrier  phase 
by  tt/2  shifts  the  phase  of  every  spectral  component  by  tc/2.  The  signal  g(t)  sin  2nfot  and  its 
spectrum  are  shown  in  Figure  3.2  le  and  f. 


2  Find  and  sketch  the  Fourier  transform  of  the  modulated  signal  g(t)  cos  Info  in  which  g(t)  is 
a  rectangular  pulse  fl  (t/T),  as  shown  in  Fig.  3.22a. 


The  pulse  g(t)  is  the  same  rectangular  pulse  shown  in  Fig.  3. 10a  (with  r  =  T).  From  pair 
17  of  Table  3.1,  we  find  G(/),  the  Fourier  transform  of  g(r),  as 


n  T  sine  (nfT) 


This  spectrum  G(f)  is  shown  in  Fig.  3.22b.  The  signal  g(r)cos  2nf0t  is  shown  in 
Fig.  3.22c.  From  Eq.  (3.37)  it  follows  that 


g(t)  cos  27Tf0t  «=►  I  [Gif  +/o)  +  G(f  -fo)] 


/„  and ’idsoTot  he  rfehtb iSh°b,ain.ed  by  shiftin8  G(/>  in  Fig.  3.22b  to  the  left  by 
/o  also  the  nght  by/„  and  then  mult.plying  them  by  half,  as  shown  in  Fig.  3.22d. 


*(/) 


(a) 


Kin  cos  W0/ 
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Application  of  Modulation 

Modulation  is  used  to  shift  signal  spectra.  Spectrum  shifting  is  necessary  in  the  following 
situations  (there  are  others). 

1.  If  several  signals,  each  occupying  the  same  frequency  band,  are  transmitted  simul¬ 
taneously  over  the  same  transmission  medium,  they  will  all  interfere;  it  will  be  impossible 
to  separate  or  retrieve  them  at  a  receiver.  For  example,  if  all  radio  stations  decide  to  broad¬ 
cast  audio  signals  simultaneously,  a  receiver  will  not  be  able  to  separate  them.  This  problem  is 
solved  by  using  modulation,  whereby  each  radio  station  is  assigned  a  distinct  carrier  frequency. 
Each  station  transmits  a  modulated  signal,  thus  shifting  the  signal  spectrum  to  its  allocated 
band,  which  is  not  occupied  by  any  other  station.  A  radio  receiver  can  pick  up  any  station  by 
tuning  to  the  band  of  the  desired  station.  The  receiver  must  now  demodulate  the  received  signal 
(i.e.,  undo  the  effect  of  modulation).  Demodulation  therefore  consists  of  another  spectral  shift 
required  to  restore  the  signal  to  its  original  band.  Note  that  both  modulation  and  demodulation 
implement  spectral  shifting.  Consequently,  demodulation  operation  is  similar  to  modulation 
(see  Prob.  3.3-9).  This  method  of  transmitting  several  signals  simultaneously  over  a  channel 
by  using  its  different  frequency  bands  is  known  as  frequency  division  multiplexing  (FDM). 

2.  For  effective  radiation  of  power  over  a  radio  link,  the  antenna  size  must  be  on  the  order 
of  the  wavelength  of  the  signal  to  be  radiated.  Audio  signal  frequencies  are  so  low  (wavelengths 
are  so  large)  that  impractically  large  antennas  will  be  required  for  radiation.  Here,  shifting  the 
spectrum  to  a  higher  frequency  (a  smaller  wavelength)  by  modulation  solves  the  problem. 

Bandpass  Signals 

We  have  seen  that  if  gc(t )  and  gs(t)  are  low-pass  signals,  each  with  a  bandwidth  B  Hz  or  2 jtB 
rad/s,  then  the  signals  gc(t)  cos  2nfot  and  g.v(r)  sin  2nfot  are  both  bandpass  signals  occupying 
the  same  band  (Fig.  3.2  Id,  0,  and  each  having  a  bandwidth  of  2 B  Hz.  Hence,  a  linear  combi¬ 
nation  of  both  these  signals  will  also  be  a  bandpass  signal  occupying  the  same  band  as  that  of 
either  signal,  and  with  the  same  bandwidth  (2 B  Hz).  Hence,  a  general  bandpass  signal  gbp(0 
can  be  expressed  as* 


gbp(0  =  gcU)  cos  2jr/0/  +  gs(t)  sin  2 nf0t  (3.40) 

The  spectrum  of  gbp(0  is  centered  at  ±/o  and  has  a  bandwidth  2 B,  as  shown  in  Fig.  3.23b. 
Although  the  magnitude  spectra  of  both  gc(t)  cos  2nfot  and  gs(t)  sin  2nfot  are  symmetrical 
about  ±/o,  the  magnitude  spectrum  of  their  sum,  gbp(0.  is  not  necessarily  symmetrical  about 
±/o.  This  is  because  the  amplitudes  of  the  two  signals  do  not  add  directly  with  different  phases 
for  the  reason  that 


a\e ^  («i  +  ci2)e^'Pl+<P2^ 

A  typical  bandpass  signal  gbp(0  and  its  spectra  are  shown  in  Fig.  3.23.  Using  a  well-known 
trigonometric  identity,  Eq.  (3.40)  can  be  expressed  as 

gbp(r)  =  £(/)  cos  [2nf0t  +  \fr(t)]  (3.41 ) 


*  See  Sec.  9.9  for  a  rigorous  proof  of  this  statement. 
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I  «*</>! 


where 


-  +\fgc(0  +g}(t) 

M)  =  -  tan-1 

Lfc(f)J 


(3.42a) 

(3.42b) 


£(0  is  nonne^tivef[Ec^3^a)tSitSfollow  T  ^  ^  "  also  ^-pass  signals.  Because 
envelope  and  ^(0  isaslowly  va^ingphaseoflheha  <341>  ,h“  's  *  *»■>  'arying 

of  .he  time-varying 

about  the  center  frequency /0.  H  Y  "  S'tiusoid  also  vanes  slowly*  with  time 


Example  3.13  Find  the  Fourier  transform  of  a  general  neriodir  •  ,  - 

•he  Fonder  Won.  „f  ,de  periodic  |mpulsc 


A  periodic  signal  g(t)  can  be  expressed 


as  an  exponential  Fourier  series 


as 


g(0  =  E  D«  ejn2nf0t  ^  __  1 
"*-00  ~~  T0 


Therefore, 


00 

£(')<=>  F[Dn  ejn2*fo<] 

n=-oo 

Now  from  Eq.  (3.22a),  it  follows  that 


*(')*=>  E  D»S(f-nf0) 

n=— oo 


(3.43) 


*— •'a-vaiisi-ss*, 
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Figure  3.24 

(a)  Impulse  train 
and  (b)  its 
spectrum. 


A  A  A  A  A  A  A  A  A 


£(/) 


G(/)  =  7b  6/o(/) 


A  A  A 


-37b  -7b  0  27b  47b 


-2/o  -/o  0  f0  2/o 


(a) 


(b) 


Equation  (2.100)  shows  that  the  impulse  train  <S70(r)  can  be  expressed  as  an  exponential 
Fourier  series  as 


Here  Dn  =  \/Tq.  Therefore,  from  Eq.  (3.43), 


T0  Sf,(f)  f°  To 


(3.44) 


Thus,  the  spectrum  of  the  impulse  train  also  happens  to  be  an  impulse  train  (in  the 
frequency  domain),  as  shown  in  Fig.  3.24b. 


3.3.6  Convolution  Theorem 

The  convolution  of  two  functions  g(t)  and  w(t),  denoted  by  g(r)  *  w(r),  is  defined  by  the 
integral 


The  time  convolution  property  and  its  dual,  the  frequency  convolution  property,  state 
that  if 


£i(0  <=>  G\(f)  and  g2(t)  <=>  G2(f) 


then  (time  convolution) 


gl(U  *£’(0  <=>  Gi(f)G2(f) 


(3.45) 


and  (frequency  convolution) 


£|(0#2(0  <=>■  G\(f)  *  G2 (/) 


(3.46) 
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These  two  relationships  of  the  convolution  theorem  state  that  convolution  of  two  signals  in  the 
time  domain  becomes  multiplication  in  the  frequency  domain,  whereas  multiplication  of  two 
signals  in  the  time  domain  becomes  convolution  in  the  frequency  domain. 

Proof:  By  definition. 


*>*' (,)  *  *<»>l  =  /“  ^2">  [/”  -  rwrl  dt 

r  c°° 

£i(*)  /  e 

00  U-oo 

The  inner  integral  is  the  Fourier  transform  of  2i(t  —  t \  aiv^n  k*,  rr  • 

(3.32a)]  G2(f)e-j2nfx.  Hence.  ’ g  by  tt,me"shlfting  property  in  Eq, 


-jlnfi82(t-x)d^dr 


Flgl(t)  * gl(t)] 


/OO 

g,(T)e~j2^zG2(f)dT 

•00 

G2{f)j_J^~j2nfTdr  =  Gl(/)G2(/) 


The  frequency  convolution  property  (3.46)  can  be  proved 
the  roles  of  g(t)  and  G(f).  F 


in  exactly  the  same  way  by  reversing 


Bandwidth  of  the  Product  of  Two  Signals 

?i  (0  and  g2(t)  have  bandwidths  fi.  and  B-,  h7  r«  ,•  . 

B\  +  B2  Hz.  This  result  follows  from  the  aoDlicatin^fT  bandwidth  of£l0)S2(0  is 
to  Eq.  (3.46).  This  property  states  that  the  width  of  "  °  '  C  Wldth  proPerty  of  convolution3 
Consequently,  if  the  bandwidth  of  g(t)  js  »  uv  *  f  X*yJS  ,he.sum  of  the  widths  of  x  and.y. 


~  aiaiw  ine  WKJth  of  v  *  *  *  - ~ - 

Consequently,  if  the  bandwidth  of «?(/)  k  ft  u,  1  ’  the  SUm  of  the  widths  of  jr  an< 

bandwidth  of  gn(i)  js  nB  Hz.*  ’  en  the  bandwidth  of  g2(t)  js  2B  Hz,  and 


the 


Example  3.14  Use  the  time  convolution 


property  to  show  that  if 


then 


g(t)  «=»  G(f) 


|  Because 


f  g(x)dx  <=>  ^£}_ 
J~°°  j2nf 


^O(0)S(f) 


u(t  -  r)  =  1 1  r  ^  t 
|0  r  >  t 


(3.47) 


>*f  two  functions^  anowra*-  d0eS  hold  in  some  pathological  cases— f 

however,  the  property  holds  even  in  ihi  Wh!n  bo,h  funct'ons  are  nonzero  fe  example-  when  the  convoluti 

*““*  *•  ■**  *■■■  «*  «~«2iT4r  ,he  '• »'  «l  i35t£:  str 
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it  follows  that 

g(t)*u(t)=  f  g(T)u(t -T)dz  =  f  g(z)dz 
J-oc  J- 00 

Now  from  the  time  convolution  property  [Eq.  (3.45)],  it  follows  that 
g(t)*u(t)  <=>  G(f)U(f) 

=cH^+H 

G(f)  1 

-  tk  +  iGm^ 

In  deriving  the  last  result  we  used  pair  11  of  Table  3.1  and  Eq.  (2.18a). 
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3.3.7  Time  Differentiation  and  Time  Integration 

if 


g(t)  G(/) 

then  (time  differentiation)* 

dg(t) 

<=*j2nfG(f)  (3.48) 

and  (time  integration) 

J  g(z)dz  <=>  +  ^G(0)<$(/)  (3.49) 

Proof:  Differentiation  of  both  sides  of  Eq.  (3.9b)  yields 

—p-  =  HjlM^df 

dt  J- oo 

This  shows  that 


dg(t) 

dt 


^j2nfG(f) 


Repeated  application  of  this  property  yields 


dng(t) 

dtn 


(j2jzf)nG(f) 


(3.50) 


The  time  integration  property  [Eq.  (3.49)]  already  has  been  proved  in  Example  3.14.  ■ 


Valid  only  if  the  transform  of  dg(t)/dt  exists. 


1 22  ANALYSIS  AND  TRANSMISSION  OF  SIGNALS 


Example  3.15 


Figure  3.25 

Using  the  time 
differentiation 
property  to  find 
the  Fourier 
transform  of  a 
piecewise-l  inear 
signal. 


Use  the  time  differentiation  property  to,  find  the  Fourier  transform  of 
A(r/r)  shown  in  Fig.  3.25a. 


the  triangular  pulse 


~4_  \ 

* 


in  Fig.  3.25b  and  c.  The  second  derivativ^consrststfr!"11316  *  SUfCcessive|y’  as  shown 
in  Fig.  3.25c.  Recall  that  the  derivative  of  »  c  ■  secluence  of  impulses,  as  shown 
of  strength  equal  to  the  amount  of  iunm  The  f^”3-  3t  3  JUmP  ^*scontinuity  is  an  impulse 
- '  =  and  a  =  0  1 JSC* has  *  P"*"  *»p  w 

d2gU )  2  r  /  r, 

dt 2  ~7r(,  +  2)~2*('>  +  3('-^)] 

From  the  time  differentiation  property  (3.50), 

“  W)2G(/)  = 

Also,  from  the  time-shifting  property  [Eq.  (3.32a)], 


(3.51) 


d2g 
dt 2 


(3.52a) 


Ht  -  t0)  <=>  e-j2nfio 


(3.52b) 


Taking  the  Fourier  transform  of  Eg  n  sn  a  • 

'  )  3  US,"g  the  results  in  Eq.  (3.52),  we  obtain 

U2ttf)2G(f)  =  2  Ljxfr  _  2  4 

rV  j  =  ;(cos^/r-i)  =  _!sin2  Mr\ 
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TABLE  3.2 

Properties  of  Fourier  Transform  Operations 


Operation 


g(t ) 


G{f) 


Superposition 
Scalar  multiplication 
Duality 

Time  scaling 
Time  shifting 
Frequency  shifting 
Time  convolution 
Frequency  convolution 

Time  differentiation 


g«  ~  to)  G{f)e~j2jTfi o 

g(t)ejMoi  G(/  -/0) 

Sl(f)*g2(')  Gi(/)G2(/) 


*(a/) 


gl(t)  +  g2(‘)  G|(/)  +  G2(/) 

kgU)  kG(f) 

G(t)  g(-f) 


(j2nf)"G(f) 


Time  integration 


/-  00  Six)  dx  ^£1  +  ^G(0)8(f) 


and 


r  •  2 
=  -  sine 

2 


The  spectrum  G(f)  is  shown  in  Fig.  3.25d.  This  procedure  of  finding  the  Fourier 
transform  can  be  applied  to  any  function  g{t)  made  up  of  straight-line  segments  with 

g(r)-»  Oas  |/|  -»•  oo.  The  second  derivative  of  such  a  signal  yields  a  sequence  of  impulses 

whose  Fourier  transform  can  be  found  by  inspection.  This  example  suggests  a  numerical 
method  of  finding  the  Fourier  transform  of  an  arbitrary  signal  g(t)  by  approximating  the 
signal  by  straight-line  segments. 


To  provide  easy  reference,  several  important  properties  of  Fourier  transform  are  summa¬ 
rized  in  Table  3.2. 


3.4  SIGNAL  TRANSMISSION  THROUGH 
A  LINEAR  SYSTEM 


A  linear  time-invariant  (LTI)  continuous  time  system  can  be  characterized  equally  well  in  either 
the  time  domain  or  the  frequency  domain.  The  LTI  system  model,  illustrated  in  Fig.  3.26,  can 
often  be  used  to  characterize  communication  channels.  In  communication  systems  and  in 
signal  processing,  we  are  interested  only  in  bounded-input-bounded-output  (BIBO)  stable 
linear  systems.  Detailed  discussions  on  system  stability  are  treated  in  the  textbook  by  Lathi.3 
A  stable  LTI  system  can  be  characterized  in  the  time  domain  by  its  impulse  response  h(t), 
which  is  the  system  response  to  a  unit  impulse  input,  that  is, 


y(t)  =  hit)  when  x(t)  =  S(/) 


The  system  response  to  a  bounded  input  signal  x(t)  follows  the  convolutional  relationship 


y(t)  =  h(t)  *  x(r) 


(3.54) 


1 24  ANALYSIS  AND  TRANSMISSION  OF  SIGNALS 


Figure  3.26 

Signal  trans¬ 
mission  through 
a  linear 
time-invariant 
system. 


Input  signal 


Output  signal 


Time-domain 

MO 

LTI  system 

y(t)  = 

hU) 

Frequency-domain 

X(f ) 

//(/) 

y  if)  — 

The  frequency  domain  relationship  between  the  input  and  the  output  is  obtained  by  taking 
Fourier  transform  of  both  sides  of  Eq.  (3.54).  We  let 

*(f  )«=>*(/) 

y(t )  ^  y (/) 

h(t)  «=►  H(f) 


Then  according  to  the  convolution  theorem,  Eq.  (3.54)  becomes 


Y(f)  =  H{f)-X(f)  (3.55) 

The  Fourier  transform  of  the  impulse  response  h(t),  given  as  //(/),  is  generally  referred 
to  as  the  transfer  function  or  the  frequency  response  of  the  LT1  system.  In  general,  H  (/)  is 
complex  and  can  be  written  as 


H{f)  =  \H(f)\eJW) 


3.4.1  Signal  Distortion  during  Transmission 

SSn1 r sisnal ,hrough  a  sys,em  « *«>  ^  ^  ^  m 

r  u  (  S  C  nature  l^ls  change  or  modification.  Here  X ( f )  and  Y(f)  are  the 

of  the  system  Equation  (Tssi  h  ^  mPUt  Spectrum  mu,tiphed  by  the  spectral  response 

signal  by  ,he  systtm.  Equation  (3 TsU  te  m0d,flCa,i°n)  °f ' “ 

\Y(f)\ej6>{f)  =  \X{f)\\H(f)\e^d^+W)] 


Therefore,  we  have  the  amplitude  and  phase  relationships 


l^(/)l  =  |X (/)|  \H (/)| 

oy(f)  =  ex(f)  +  eh(f) 


(3.56a) 

(3.56b) 


‘-'Ml  MIL;  | 


P  ^e  spectrum  0x(f)  is  changed  to  9x(f)  +  6h{f). 
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An  input  signal  spectral  component  of  frequency  /  is  modified  in  amplitude  by  a  factor 
\H (J  )|  and  is  shifted  in  phase  by  an  angle  #/,(/)•  Clearly,  | H (/) |  is  the  amplitude  response, 
and  0h(f  )  is  the  phase  response  of  the  system.  The  plots  of  \H(f)\  and  <?,,(/)  as  functions  of 
f  show  at  a  glance  how  the  system  modifies  the  amplitudes  and  phases  of  various  sinusoidal 
inputs.  This  is  why  H  (/)  is  called  the  frequency  response  of  the  system.  During  transmission 
through  the  system,  some  frequency  components  may  be  boosted  in  amplitude,  while  others 
may  be  attenuated.  The  relative  phases  of  the  various  components  also  change.  In  general,  the 
output  waveform  will  be  different  from  the  input  waveform. 

3.4.2  Distortionless  Transmission 


In  several  applications,  such  as  signal  amplification  or  message  signal  transmission  over  a 
communication  channel,  we  require  the  output  waveform  to  be  a  replica  of  the  input  waveform. 
In  such  cases,  we  need  to  minimize  the  distortion  caused  by  the  amplifier  or  the  communication 
channel.  It  is  therefore  of  practical  interest  to  determine  the  characteristics  of  a  system  that 
allows  a  signal  to  pass  without  distortion  (distortionless  transmission). 

Transmission  is  said  to  be  distortionless  if  the  input  and  the  output  have  identical  wave 
shapes  within  a  multiplicative  constant.  A  delayed  output  that  retains  the  input  waveform  is  also 
considered  distortionless.  Thus,  in  distortionless  transmission,  the  input  x(f)  and  the  output 
y(t)  satisfy  the  condition 


I 


y(t)  =  k  ■  x(t  —  td) 

The  Fourier  transform  of  this  equation  yields 


But  because 


Y(f)  =  kX{f)e-^f“ 


Y(f)=X(f)H(f) 


we  therefore  have 


(3.57) 


H(f)  =  k  e-j27tftd 

This  is  the  transfer  function  required  for  distortionless  transmission.  From  this  equation  it 
follows  that 


\H(f)\=k  (3.58a) 

0h(f)  =  -2jrftd  (3.58b) 


This  shows  that  for  distortionless  transmission,  the  amplitude  response  \H(f)\  must  be  a 
constant,  and  the  phase  response  9h(f)  must  be  a  linear  function  of /  going  through  the  origin 
/  =  0,  as  shown  in  Fig.  3.27.  The  slope  of  Oh  (/)  with  respect  to  the  angular  frequency  co  =  Inf 
is  —  tdy  where  td  is  the  delay  of  the  output  with  respect  to  the  input.* 
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Figure  3.27 

Linear  time- 
invariant  system 
frequency 
response  for 
distortionless 
transmission. 


\m\ 

0 

All-Pass  vs.  Distortionless  System 

In  circuit  analysis  and  filter  designs,  we  sometimes  are  mainly  concerned  with  the  gain  of  a 
system  response.  An  all-pass  system  is  one  that  has  a  constant  gain  for  all  frequencies  [i.e., 
|//(/)|  =  k\,  without  the  linear  phase  requirement.  Note  from  Eq.  (3.58)  that  a  distortionless 
system  is  always  an  all-pass  system,  whereas  the  converse  is  not  true.  Because  it  is  very 
common  for  beginners  to  be  confused  by  the  difference  between  all-pass  and  distortionless 
systems,  now  is  the  best  time  to  clarify. 

To  see  how  an  all-pass  system  may  lead  to  distortion,  let  us  consider  an  illustrative  example. 
Imagine  that  we  would  like  to  transmit  a  recorded  music  signal  from  a  violin-cello  duet.  The 
violin  contributes  to  the  high-frequency  part  of  this  music  signal  while  the  cello  contributes  to 
the  bass  part.  When  transmitting  this  music  signal  through  a  particular  all-pass  system,  both 
parts  have  the  same  gain.  However,  suppose  that  this  all-pass  system  would  cause  a  1 -second 
extra  delay  on  the  high-frequency  content  of  the  music  (from  the  violin).  As  a  result,  the 
audience  on  the  receiving  end  will  hear  a  “music”  signal  that  is  totally  out  of  sync  even  though 
all  signal  components  have  the  same  gain  and  are  all  present.  The  difference  in  transmission 
delay  tor  different  frequency  components  is  contributed  by  the  nonlinear  phase  of  H(f)  in  the 
all-pass  filter. 

To  be  more  precise,  the  transfer  function  gain  \H(f)\  determines  the  gain  of  each  input 
trequency  component,  whereas  ZH(f  )  determines  the  delay  of  each  component.  Imagine  a 
system  input  x(t)  consisting  of  a  sum  of  multiple  sinusoids  (its  spectral  components).  For  the 
output  signal  y(t)  to  be  distortionless,  it  should  be  the  input  signal  multiplied  by  a  gain  k 
and  delayed  by  td.  To  synthesize  such  a  signal,  it  would  be  necessary  fory(r)  to  have  exactly 
the  same  components  as  *(f),  with  each  component  multiplied  by  k  and  delayed  by  td.  This 
means  that  the  system  transfer  function  //(/)  should  be  such  that  each  sinusoidal  component 
encounters  the  same  gam  (or  loss)  k  and  each  component  undergoes  the  same  time  delay  of  td 
seconds.  The  first  condition  requires  that 


\H(f)\=k 


We  saw  earlier  (Sec.  3.3)  that  to  achieve  the  same  time  HpUv.  t 

"I— . -  delay  2*  (Fig.  3 I  l"h 

Oh(f)  =  —2 nftd 

*  only  approximately 

function  of  frequency.  This  slope  can  be  a  function  off  intlT  ?  °'  T  ^  °f  3S  3 

j  in  the  general  case  and  is  given  by 


tAf)  =  _L.d_w> 

2n  df 


(3.59) 
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It  the  slope  of  Oh  is  constant  (i.e.,  if  0h  is  linear  with  respect  to /),  all  the  components  are 
delayed  by  the  same  time  interval  tj.  But  if  the  slope  is  not  constant,  then  the  time  delay  t(j 
varies  with  frequency.  This  means  that  components  of  different  frequencies  undergo  different 
amounts  of  time  delay,  and  consequently  the  output  waveform  will  not  be  a  replica  of  the 
input  waveform  (as  in  the  example  of  the  violin-cello  duet).  For  a  signal  transmission  to  be 
distortionless,  td(f)  should  be  a  constant  tj  over  the  frequency  band  of  interest.* 

Thus,  there  is  a  clear  distinction  between  all-pass  and  distortionless  systems.  It  is  a  common 
mistake  to  think  that  flatness  of  amplitude  response  |  H  (/)|  alone  can  guarantee  signal  quality. 
A  system  may  have  a  flat  amplitude  response  and  yet  distort  a  signal  beyond  recognition  if  the 
phase  response  is  not  linear  (tj  not  constant). 


The  Nature  of  Distortion  in  Audio  and  Video  Signals 

Generally  speaking,  the  human  ear  can  readily  perceive  amplitude  distortion,  although  it  is 
relatively  insensitive  to  phase  distortion.  For  phase  distortion  to  become  noticeable,  the  vari¬ 
ation  in  delay  (i.e.,  in  the  slope  of  Oh)  should  be  comparable  to  the  signal  duration  (or  the 
physically  perceptible  duration,  in  case  the  signal  itself  is  long).  In  the  case  of  audio  signals, 
each  spoken  syllable  can  be  considered  to  be  an  individual  signal.  The  average  duration  of 
a  spoken  syllable  has  a  magnitude  on  the  order  of  0.01  to  0.1  second.  Audio  systems  may 
have  nonlinear  phases,  yet  no  noticeable  signal  distortion  results  because  in  practical  audio 
systems,  the  maximum  variation  in  the  slope  of  0h  is  only  a  small  fraction  of  a  millisecond.  This 
is  the  real  reason  behind  the  statement  that  “the  human  ear  is  relatively  insensitive  to  phase 
distortion.4  As  a  result,  the  manufacturers  of  audio  equipment  make  available  only  \H(f)\,  the 
amplitude  response  characteristic  of  their  systems,  in  published  product  specifications. 

For  video  signals,  on  the  other  hand,  the  situation  is  exactly  the  opposite.  The  human 
eye  is  sensitive  to  phase  distortion  but  is  relatively  insensitive  to  amplitude  distortion.  The 
amplitude  distortion  in  television  signals  manifests  itself  as  a  partial  destruction  of  the  rel¬ 
ative  half-tone  values  of  the  resulting  picture,  which  is  not  readily  apparent  to  the  human 
eye.  The  phase  distortion  (nonlinear  phase),  on  the  other  hand,  causes  different  time  delays 
in  different  picture  elements.  This  results  in  a  smeared  picture,  which  is  readily  apparent 
to  the  human  eye.  Phase  distortion  is  also  very  important  in  digital  communication  systems 
because  the  nonlinear  phase  characteristic  of  a  channel  causes  pulse  dispersion  (spreading 
out),  which  in  turn  causes  pulses  to  interfere  with  neighboring  pulses.  This  interference 
can  cause  an  error  in  the  pulse  amplitude  at  the  receiver:  a  binary  1  may  read  as  0,  and 
vice  versa. 


*  Figure  3.27  shows  that  for  distortionless  transmission,  the  phase  response  not  only  is  linear  but  also  must  pass 
through  the  origin.  This  latter  requirement  can  be  somewhat  relaxed  for  bandpass  signals.  The  phase  at  the  origin 
may  be  any  constant  [#*(/)  =  flo  —  2 nfij  or  <4(0)  =  0q].  The  reason  for  this  can  be  found  in  Eq.  (3.38),  which 
shows  that  the  addition  of  a  constant  phase  to  a  spectrum  of  a  bandpass  signal  amounts  to  a  phase  shift  of  the 
carrier  by  %  The  modulating  signal  (the  envelope)  is  not  affected.  The  output  envelope  is  the  same  as  the  input 
envelope  delayed  by 


1  dOH(f) 

8  2 7T  df 

called  the  group  delay  or  envelope  delay,  and  the  output  carrier  is  the  same  as  the  input  carrier  delayed  by 

,  <W) 

p  2irf 


called  the  phase  delay,  where /o  is  the  center  frequency  of  the  passband. 
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Example  3.16  If  g(/)  and  y(/)  are  the  input  and  the  output,  respectively,  of  a  simple  RC  low-pass  filter 
(Fig.  3.28a),  determine  the  transfer  function  //(/)  and  sketch  |//(/)|,  6^(/),  and  /</(/).  For 
distortionless  transmission  through  this  filter,  what  is  the  requirement  on  the  bandwidth  of  g(t) 
if  amplitude  response  variation  within  2%  and  time  delay  variation  within  5%  are  tolerable? 
What  is  the  transmission  delay?  Find  the  output  y(t). 


Application  of  the  voltage  division  rule  to  this  circuit  yields 

X/jlnfC  1 


where 


Hence, 


H(f)  = 


R  +  (l/jlnfC)  1  +j2nfRC  a  +  j2nf 


a  =  —  =  106 
RC 


\H(f)\  = 


y/a2  +  (2tt/)2 


~  1 


Finally,  the  time  delay  is  given  by  [Eq.  (3.59)] 


|2jt/|  «  a 
|2jt/|  «;  a 


td(f)  =  - 


d&h  _  a  1 

d  (2 nf)  ~  (2^/)2  +  a2  -  -  =  10  6 


|2jt/|  «  a 


delav™!  PlT  T**  Charac,eris,ics  «  given  in  Fig.  3.28b.  The  time 

y  functl0n  off  is  shown  in  Fig.  3.28c.  For  |2tt/|  «  a  (a  -  106)  the 

tt and  ,he  phase  «  * X* 

meanly  results  in  a  constant  time  delay  characteristic.  The  filter  therefore  can  transmit 
low-trequency  s.gnals  with  negligible  distortion 

5%  «£££  U^hTh”  /T"  Wid,in  2%  “d  ,ime  *'»’  variation  within 

and  time  delay  both  at  maximum  when/  =  ()  and  “  3  'P3SS  fi'ter  Wkh  8ain 


|//(0)l  =  1  and  td( 0)  =  -  second 

a 

Therefore,  l«</„)|  >  0,98  and  Wo)  >  0.95 /a,  so  that 


|W(/o)l  = 


Wo)  = 


a 

7(2jt/o)2  +  a2  ~  0  98  ^  2nf°  -  0  203a  =  203, 000  rad/s 


0.95 


12T/o)2  +  a2  ~  a  ^  -  0-2294a  =  229,400  rad/s 


Figure  3.28 

(a)  Simple  RC 
filter,  (b)  Its 
frequency 
response  and 
time  delay. 
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The  smaller  of  the  two  values,/)  =  32.3 1  kHz,  is  the  highest  bandwidth  that  satisfies  both 
constraints  on  \H(f)\  and  td. 

The  time  delay  td  ^  \ /a  =  \0  6  s  =  1  /zs  over  this  band  (see  Fig.  3.28c).  Also  the 
amplitude  response  is  almost  unity  (Fig.  3.28b).  Therefore,  the  output  y(t)  *g(t-  10"6). 


/?=  103 
+  o - VVA/ — 


git) 


-  o- 


-o  + 


c=  10- Vo 


(a) 


3.5  IDEAL  VERSUS  PRACTICAL  FILTERS 

Ideal  filters  allow  distortionless  transmission  of  a  certain  band  of  frequencies  and  suppress 
all  the  remaining  frequencies.  The  ideal  low-pass  filter  (Fig.  3.29),  for  example,  allows  all 
components  below/  =  B  Hz  to  pass  without  distortion  and  suppresses  all  components  above 
/  =  B.  Figure  3.30  shows  ideal  high-pass  and  bandpass  filter  characteristics. 

The  ideal  low-pass  filter  in  Fig.  3.29a  has  a  linear  phase  of  slope  —  td%  which  results  in  a 
time  delay  of  td  seconds  for  all  its  input  components  of  frequencies  below  B  Hz.  Therefore,  if 
the  input  is  a  signal  g(t)  band-limited  to  B  Hz,  the  output  y(t)  is  g(t)  delayed  by  td,  that  is, 


y(t)  =  g(t  -  td) 
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Figure  3.29 

(a)  Ideal 
low-pass  filter 
frequency 
response  and 

(b)  its  impulse 
response. 


Figure  3.30 

Ideal  high-pass 
and  bandpass 
filter  frequency 
responses. 


. 

|H(/)I 

0 

/-> 

-.../v/) 

-fo 

0 

Jo 

f-+ 

The  signal  g(t)  is  transmitted  by  this  system  without  distortion,  but  with  time  delay  td. 
For  this  filter  \H(f)\  =  U(f/2B),  and  dh(f)  =  -2nftd,  so  that 


H(f) 


-"(a) 


(3.60a) 


responsc  H,)  of  ,h,s  m,er  is  found  from  pa,r  18  in  Tab,e  3-‘  ^ ,he 


time- 


—  j- 


n 

L  V2 

J 

=  2Bsinc[2nB(t-td)]  (3  (, 

3.29b  showsacurioILfeKb'lhe^po^e'/rt'o'bee^  'nPU1 8<',;wh'cl' is  aPPlied  at '  =  Fif 
Clearly,  .he  filler  Is  noncausal  and  iherelbre  'T  k  apP“ed  (“ '  = 

impossible,  since  no  sensible  cV>.»e™  ..  .  Zd  .  ’ that  1S>  such  a  system  is  physic 

Similarly,  one  can  show  that  other  ideal  S (suchaTtZd  “  “  aPPUed  ‘°  SySt 

filters  shown  in  Fig.  3.30)  are  also  physically  unrealizable  8  ^  ^ ^  ^  ^ 
For  a  physically  realizable  system,  h(t)  must  be  causal;  that  is, 


h{t)  =  0  for  t  <  0 
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Figure  3.31 

Approximate 
realization  of  an 
ideal  low-pass 
filter  by  trun¬ 
cating  its  impulse 
response. 


In  the  frequency  domain,  this  condition  is  equivalent  to  the  Paley-Wiener  criterion,  which 
states  that  the  necessary  and  sufficient  condition  for  \H (f)\  to  be  the  amplitude  response  of  a 
realizable  (or  causal)  system  is* 


|ln  \H(f)\  | 
1  +  (2  tt/)2 


df  <  oo 


(3.61) 


If  H (/)  does  not  satisfy  this  condition,  it  is  unrealizable.  Note  that  if  |//(/)|  =  0  over  any 
finite  band,  |  !n|// (/)||  =  oo  over  that  band,  and  the  condition  (3.61)  is  violated.  If,  however, 
H(f)  =  0  at  a  single  frequency  (or  a  set  of  discrete  frequencies),  the  integral  in  Eq.  (3.61) 
may  still  be  finite  even  though  the  integrand  is  infinite.  Therefore,  for  a  physically  realizable 
system,  H (/)  may  be  zero  at  some  discrete  frequencies,  but  it  cannot  be  zero  over  any  finite 
band.  According  to  this  criterion,  ideal  filter  characteristics  (Figs.  3.29  and  3.30)  are  clearly 
unrealizable. 

The  impulse  response  h(t)  in  Fig.  3.29  is  not  realizable.  One  practical  approach  to  filter 
design  is  to  cut  off  the  tail  of  h(t)  for  t  <  0.  The  resulting  causal  impulse  response  h(t),  where 


h(t)  =  h(t)u(t) 


is  physically  realizable  because  it  is  causal  (Fig.  3.30.  If  tj  is  sufficiently  large,  H(t)  will  be  a 
close  approximation  of  /t(r),  and  the  resulting  filter  //(/)  will  be  a  good  approximation  of  an 
ideal  filter.  This  close  realization  of  the  ideal  filter  is  achieved  because  of  the  increased  value 
of  time  delay  tj.  This  means  that  the  price  of  closer  physical  approximation  is  higher  delay  in 
the  output;  this  is  often  true  of  noncausal  systems.  Of  course,  theoretically  a  delay  ^  =  oo  is 
needed  to  realize  the  ideal  characteristics.  But  a  glance  at  Fig.  3.29b  shows  that  a  delay  tj  of 
three  or  four  times  1  /2 B  will  make  h(t)  a  reasonably  close  version  of  h(t  —  td).  For  instance, 
audio  filters  are  required  to  handle  frequencies  of  up  to  20  kHz  (the  highest  frequency  the 
human  ear  can  hear).  In  this  case  a  tj  of  about  10-4  (0. 1  ms)  would  be  a  reasonable  choice. 
The  truncation  operation  [cutting  the  tail  of  h(t)  to  make  it  causal],  however,  creates  some 
unsuspected  problems  of  spectral  spread  and  leakage,  and  can  be  partly  corrected  by  truncating 
h(t)  gradually  (rather  than  abruptly)  using  a  tapered  window  function.5 

In  practice,  we  can  realize  a  variety  of  filter  characteristics  to  approach  ideal  characteris¬ 
tics.  Practical  (realizable)  filter  characteristics  are  gradual,  without  jump  discontinuities  in  the 
amplitude  response  \H(f)\.  For  example,  Butterworth  filters,  which  are  used  extensively  in 


*  \H(f)\  is  assumed  to  be  square  integrable.  that  is. 


\H(f)\2df 


is  assumed  to  be  finite. 
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Figure  3.32 

Butterworth  filter 
characteristics. 


various  applications  including  practical  communication  circuits,  have  an  amplitude  response 


\H(f)\  = 


Vi+a/s)2" 


These  characteristics  are  shown  in  Fig.  3.32  for  several  values  of  n  (the  order  of  the  filter). 
Note  that  the  amplitude  response  approaches  an  ideal  low-pass  behavior  as  n  —  oo 

The  half-power  bandwidth  of  a  filter  is  defined  as  the  bandwidth  over  which  the  amplitude 
response  \H(f)\  remains  constant  within  variations  of  3  dB  (or  a  ratio  of  1/^2,  i.e..  0.707). 
Figure  3.32  shows  that  for  all  n,  the  Butterworth  filter  (half-power)  bandwidth  is  B  Hz.  The 
la  I -power  bandwidth  of  a  low-pass  filter  is  also  called  the  cutoff  frequency  Figure  3  33 
shows  \H(f)\,  and  h(t)  for  the  case  of  n=4.  ‘  8 

It  should  be  remembered  that  the  magnitude  \H(f)\  and  the  phase  dh(f)  of  a  system  are 

^ Lff  ,'S kT  Can"at  ChOOSe  |//(/)l  a"d  9h(f  )  indePendently  as  we  please.  A 

t  to  perfect  °\mn I  \etWeen  K  Cal  magnitude  and  ideal  phase  characteristics.  The  more  we 
try  to  perfect  \H(f)\.  the  more  9h(f)  deviates  from  the  ideal,  and  vice  versa  As  n  ^  oo  the 
amplitude  response  approaches  the  ideal  (low  nas«t  h.,t  .i  .•  J'  AS  ” 

badly  distoned  in  .he^cini.y  ,he  'S 

3.5.1  Digital  Filters 

Analog  signals  can  also  be  processed  by  digital  means  tA/n  .  .  . 

sampling,  quantizing,  and  coding.  The  reLting  digTal  s  IT  Thls  ,nvolv*S 

special-purpose  digital  computer  designed  to  convert  the  Timm  PTOCeSSed  by  3  Sm 

sequence.  The  output  sequence  is  converted  back  into  the  ,  PUt  Sequfnce  lnto  a  desired  outPut 
digital  computer  can  used  a  special  algorithm  to  achieve  a  lTv/"  8  T*'' ^  prOCeSS'ng 
pass,  bandpass,  or  high-pass  filtering)  Figure  3  M  n  8  S,gnal  °Perat,on  low‘ 
digital  filter  consisting  of  an  A/D  converter,  a  computational  d.  ^  t1^3"1  °f  3  typiCa‘ 

or  a  digital  signal  processor  (DSP),  and  a  D/A  converter.  *  ’  Wh’Ch  C3"  be  3  comPuter 

Digital  processing  of  analog  signals  has  several  advantages  A  email  •  • 

^'C,CM  f~  «■  -  -  «  ofdS^TiXS.'rrn 
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Figure  3.33 

Comparison  of 
Butterworth  filter 
(/i  =  4)  and  an 
ideal  filter. 


Figure  3.34 

Basic  diagram  of 
a  digital  filter  in 
practical  applica¬ 
tions. 


considerably  lower  than  that  of  its  analog  counterpart.  The  accuracy  of  a  digital  filter  is  depen¬ 
dent  only  on  the  computer  word  length,  the  quantizing  interval,  and  the  sampling  rate  (aliasing 
error).  Digital  filters  employ  simple  elements,  such  as  adders,  multipliers,  shifters,  and  delay 
elements,  rather  than  RLC  components  and  operational  amplifiers.  As  a  result,  they  are  gen¬ 
erally  unaffected  by  such  factors  as  component  accuracy,  temperature  stability,  and  long-term 
drift  that  afflict  analog  filter  circuits.  Also,  many  of  the  circuit  restrictions  imposed  by  physical 
limitations  of  analog  devices  can  be  removed,  or  at  least  circumvented,  in  a  digital  processor. 
Moreover,  filters  of  a  high  order  can  be  realized  easily.  Finally,  digital  filters  can  be  modified 
simply  by  changing  the  algorithm  of  the  computer,  in  contrast  to  an  analog  system,  which  may 
have  to  be  physically  rebuilt. 

The  subject  of  digital  filtering  is  somewhat  beyond  our  scope  in  this  book.  Several  excellent 
books  are  available  on  the  subject.3 


3-6  SIGNAL  DISTORTION  OVER  A 
COMMUNICATION  CHANNEL 


Various  channel  imperfections  will  distort  the  signals  transmitted  over  a  channel.  The  nature 
of  signal  distortion  will  now  be  studied. 


134 


ANALYSIS  AND  TRANSMISSION  OF  SIGNALS 


3.6.1  Linear  Distortion 

We  shall  first  consider  linear  time-invariant  channels.  Signal  distortion  can  be  caused  over  such 
a  channel  by  nonideal  characteristics  of  magnitude  distortion,  phase  distortion,  or  both.  We 
can  identify  the  effects  these  nonidealities  will  have  on  a  pulse  g(t)  transmitted  through  such  a 
channel.  Let  the  pulse  exist  over  the  interval  ( a ,  b)  and  be  zero  outside  this  interval.  We  recall 
the  discussion  in  Sec.  3. 1  about  the  marvelous  balance  of  the  Fourier  spectrum.  The  components 
of  the  Fourier  spectrum  of  the  pulse  have  such  a  perfect  and  delicate  balance  of  magnitudes  and 
phases  that  they  add  up  precisely  to  the  pulse  g(t)  over  the  interval  ( a ,  b )  and  to  zero  outside 
this  interval.  The  transmission  of  g(t)  through  an  ideal  channel  that  satisfies  the  conditions 
of  distortionless  transmission  also  leaves  this  balance  undisturbed,  because  a  distortionless 
channel  multiplies  each  component  by  the  same  factor  and  delays  each  component  by  the 
same  amount  of  time.  Now,  if  the  amplitude  response  of  the  channel  is  not  ideal  [i.e.,  \H(f)\ 
is  not  equal  to  a  constant],  this  delicate  balance  will  be  disturbed,  and  the  sum  of  all  the 
components  cannot  be  zero  outside  the  interval  ( a ,  b).  In  short,  the  pulse  will  spread  out  (see 
Example  3.17).  The  same  thing  happens  if  the  channel  phase  characteristic  is  not  ideal,  that 
is*  @h(f)  7^  27t ftd.  Thus,  spreading,  or  dispersion,  of  the  pulse  will  occur  if  the  amplitude 

response,  the  phase  response,  or  both,  are  nonideal. 

Linear  channel  distortion  (dispersion  in  time)  is  damaging  particularly  to  digital  commu¬ 
nication  systems.  It  introduces  what  is  known  as  the  intersymbol  interferences  (ISI).  In  other 
words,  a  digital  symbol,  when  transmitted  over  a  dispersive  channel,  tends  to  spread  wider 
than  its  allotted  time.  Therefore,  adjacent  symbols  will  interfere  with  one  another,  thereby 
increasing  the  probability  of  detection  error  at  the  receiver. 


Example  3. 1  7  A  low-pass  filter  (Fig.  3.35a)  transfer  function  //(/)  is  given  by 

W)  = 


(1  +  k  cos  2n fT)e~j27lft<t  \f\<B 
0  If  I  >  B 


(3.62) 


ou.pXr  g(°  band'Umited  l°  B  HZ  (Fig‘  335b)  is  *****  ^  the  input  of  this  filter.  Find  the 


« nn  rr an,i  "onideai  n,a8"i,“de  •*•*****■  b“«“*  *<» 


Y(f)  =  G(f)H  (/) 

=  G(/)  •  n  (i  +£cos  2 7tJT)e~j2TCfid 
=  G(f)e  J-rt'd  +  k  [G(/)  cos  2jtjT] 


(3.63) 


Note  that  in  the  derivation  of  Eq.  (3  63)  because  •  .  ... 

C.(  f\  n  (L\  ^  ^  b  cause  S(t)  ts  band-limited  to  B  Hz,  we  have 

w ;  hav"1  ~  G(f)  By  USing  the  time'shiftin8  property , 


'  and  Eqs.  (3.32a)  and  (3.34), 


y(0  ~  8it  ~ td)  +  2lg(t  ~t‘>~T)  +  8(t  -td  +  T)] 


(3.64) 


. 
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Figure  3.35 

Pulse  is 

dispersed  when 
it  passes  through 
a  system  that  is 
not  distortionless. 


The  output  is  actually  g(t)  +  {k/2)[g(t  -T)  +  g(t  +  7)]  delayed  by  td.  It  consists 
of  g(t)  and  its  echoes  shifted  by  ±td.  The  dispersion  of  the  pulse  caused  by  its  echoes 
is  evident  from  Fig.  3.35c.  Ideal  amplitude  but  nonideal  phase  response  of  //(/)  has  a 
similar  effect  (see  Prob.  3.6-1). 


3.6.2  Distortion  Caused  by  Channel  Nonlinearities 

Until  now  we  have  considered  the  channel  to  be  linear.  This  approximation  is  valid  only 
for  small  signals.  For  large  signal  amplitudes,  nonlinearities  cannot  be  ignored.  A  general 
discussion  of  nonlinear  systems  is  beyond  our  scope.  Here  we  shall  consider  the  simple  case 
of  a  memoryless  nonlinear  channel  in  which  the  input  g  and  the  output  y  are  related  by  some 
(memoryless)  nonlinear  equation. 


y  -fig) 

The  right-hand  side  of  this  equation  can  be  expanded  in  a  Maclaurin  series  as 

y(t)  =  oo  +  a\g{t)  +  a2g2(t)  +  fl3g3(0  H - h  akgk{t)  H - 

Recall  the  result  in  Sec.  3.3.6  (convolution)  that  if  the  bandwidth  of  g(t)  is  B  Hz,  then  the 
bandwidth  of  gk  ( t )  is  kB  Hz.  Hence,  the  bandwidth  of  y(/)  is  greater  than  kB  Hz.  Consequently, 
the  output  spectrum  spreads  well  beyond  the  input  spectrum,  and  the  output  signal  contains 
new  frequency  components  not  present  in  the  input  signal.  In  broadcast  communication,  we 
need  to  amplify  signals  at  very  high  power  levels,  where  high-efficiency  (class  C)  amplifiers 
are  desirable.  Unfortunately,  these  amplifiers  are  nonlinear,  and  their  use  to  amplify  signals 
causes  distortion.  This  is  one  of  the  serious  problems  associated  with  AM  signals.  However, 
FM  signals  are  not  affected  by  such  nonlinear  distortion,  as  will  be  shown  in  Chapter  5.  If 
a  signal  is  transmitted  over  a  nonlinear  channel,  the  nonlinearity  not  only  distorts  the  signal, 
but  also  causes  interference  with  other  signals  in  the  channel  because  of  its  spectral  dispersion 
(spreading). 
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For  digital  communication  systems,  the  nonlinear  distortion  effect  is  in  contrast  to  the 
time  dispersion  effect  due  to  linear  distortion.  Linear  distortion  causes  interference  among 
signals  within  the  same  channel,  whereas  spectral  dispersion  due  to  nonlinear  distortion  causes 
interference  among  signals  using  different  frequency  channels. 


Example  3.18  The  input  x(t)  and  the  output  y(t)  of  a  certain  nonlinear  channel  are  related  as 

y(t)=x(t)  +  0.000  158jc2(0 

Find  the  output  signal  y(t )  and  its  spectrum  Y(f)  if  the  input  signal  is  x (t)  =  2000 
sine  (20007rf).  Verify  that  the  bandwidth  of  the  output  signal  is  twice  that  of  the  input  sig¬ 
nal.  This  is  the  result  of  signal  squaring.  Can  the  signal  x(t)  be  recovered  (without  distortion) 
from  the  output  y(t)l 

Since 


x(t)  =  2000  sine  (20007rf)  X(f)  =  U 


We  have 

v(/)  =  x(t)  +  0.000 158.r2(r)  =  2000  sine  (2000^)  +  0.316  •  2000  sine2  (  2000717) 


Y(f)  =  n  (J—)+03l6*(J-\ 

\2000/  \  4000  / 


Obser\  e  that  0.3 1 6  2000  sine  (2000;r/)  is  the  unwanted  (distortion)  term  in  the  received 
signal.  Figure  3.36a  shows  the  input  (desired)  signal  spectrum  X(f)\  Fig.  3.36b  shows 
the  spectrum  of  the  undesired  (distortion)  term;  and  Fig.  3.36c  shows  the  received  signal 
spectrum  Y (/).  We  make  the  following  observations: 

1  •  ^  bandwidth  of  the  received  signal  y(t)  is  twice  that  of  the  input  signal  x(t) 
(because  of  signal  squaring). 

632sL2??r^nV™  S'8nal  COnfla'nS  thC  'nput  signal  P|US  a"  unwanted  signal 
,  y  '  (  hc  sPectra  these  two  signals  are  shown  in  Fig.  3.36a  and  b.  Figure 

sienalanTth  ■  !£»’  l *  sp?ctn“n  ot  the  received  signal.  Note  that  the  spectra  of  the  desired 
signal  and  the  distortion  s.gnal  overlap,  and  it  is  impossible  to  recover  the  signal  x(t)  from 
the  received  signal  y(f)  without  some  distortion.  8 

filter  having  a  bandwkhh  ol  KKIOH^Th  PaSSmg  ***  received  signal  throu8h  a  low-pass 
Fic  3  36d  Observe  th  tu  c  C  spectrum  of  the  output  of  this  filter  is  shown  in 

™  e  0U'P'U  °f  ,hiS  filKr  iS  *>  *->  with 

.  J  “C^:rence;i,h  o,her  sig”is  if  ,he  '"pui 

channel.  This  means  that  several  siemk  ,  8  W'th  SCVeral  °ther  signals  °n  th'S 

transmitted  simultaneously  on  the  sLe  channef  SDre°d°VeriaPPing  freqUe"Cy  bands 
original  band  of  1000  Hz  will  interfere  with  th  Sp  dmg  the  spectrum  X  (/)  outside  its 

Thus,  in  addition  to  the  distortion  of  x(h  L  h  ^  the  band  °f  1000  t0  2000  HZ' 
band.  ;  ’  we  have  an  tnterference  with  the  neighboring 
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5.  It  x(t)  were  a  digital  signal  in  the  formal  of  a  pulse  train,  each  pulse  would  be 
distorted,  but  there  would  be  no  interference  with  the  neighboring  pulses.  Moreover  even 
with  distorted  pulses,  data  can  be  received  without  loss  because  digital  communication  can 
withstand  considerable  pulse  distortion  without  loss  of  information.  Thus,  if  this  channel 
were  used  to  transmit  a  time-division-multiplexed  signal  consisting  of  two  interleaved 
pulse  trains,  the  data  in  the  two  trains  would  be  recovered  at  the  receiver. 


Figure  3.36 

Signal  distortion 
caused  by 
nonlinear 
operation: 

(a)  desired 
(input)  signal 
spectrum; 

(b)  spectrum  of 
the  unwanted 
signal  (distortion) 
in  the  received 
signal; 

(c)  spectrum  of 
the  received 
signal; 

(d)  spectrum  of 
the  received 
signal  after 
low-pass 
filtering. 


1 

X(f) 

1000 

0 

1000 

(a) 


Distortion  term  spectrum 


2000 


(b) 


-2000  -1000 


1000  2000 


(c) 


(d) 


3.6.3  Distortion  Caused  by  Multipath  Effects 

Multipath  transmission  occurs  when  a  transmitted  signal  arrives  at  the  receiver  by  two  or 
more  paths  of  different  delays.  For  example,  if  a  signal  is  transmitted  over  a  cable  that  has 
impedance  irregularities  (mismatching)  along  the  path,  the  signal  will  arrive  at  the  receiver 
in  the  form  of  a  direct  wave  plus  various  reflections  with  various  delays.  In  radio  links,  the 
signal  can  be  received  by  direct  path  between  the  transmitting  and  the  receiving  antennas  and 
also  by  reflections  from  other  objects,  such  as  hills  and  buildings.  In  long-distance  radio  links 
that  use  the  ionosphere,  similar  effects  occur  because  of  one-hop  and  multihop  paths.  In  each 
of  these  cases,  the  transmission  channel  can  be  represented  as  several  channels  in  parallel, 
each  with  a  different  relative  attenuation  and  a  different  time  delay.  Let  us  consider  the  case 
of  only  two  paths:  one  with  a  unity  gain  and  a  delay  and  the  other  with  a  gain  a  and  a 
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Figure  3.37 

Multipath 

transmission. 


Received 

signal 


I  H(f)\ 


(b) 


delay  tj  +  A t,  as  shown  in  Fig.  3.37a.  The  transfer  functions  of  the  two  paths  are  given  by 
g-jlnftd  an(j  ae-j2nf(td+At)'  reSpeCtiVely.  The  overall  transfer  function  of  such  a  channel  is 
//(/),  given  by 


H(f)  =  e~P*fti  +  ae~^f  b</+^0 

=  e~j2nfid  ( 1  +  ae~J2nfAl)  (3.65a) 

=  e~j2jrfid  ( 1  +  a  cos  Inf  At  -  ja  sin  2: rf  Ar) 


1  +  or2  +  2crcos  2nf  At  exp 


°sinto/a'  ) 
V  1  +acos  2nf  At ) 

- - - - - - - - 

W) 


(3.65b) 


Both  the  magnitude  and  the  phase  characteristics  of  H(f)  are  periodic  in/  with  a  period  of 
1  /  A t  <F«g-  3  37b).  The  multipath  channel,  therefore,  can  exhibit  nonidealities  in  the  magnitude 
and  the  phase  characteristics  of  the  channel  and  can  cause  linear  distortion  (pulse  dispersion), 
as  discussed  earlier. 

If.  for  instance,  the  gains  of  the  two  paths  are  very  close,  that  is.  o  %  I  then  the  signals 
received  Iron,  the  two  paths  may  have  opposite  phase  (tr  rad  apart)  at  certain  frequencies. 
This  means  that  at  those  frequencies  where  the  two  paths  happen  to  result  in  opposite  phase, 
the  signals  from  the  two  paths  will  almost  cancel  each  other.  Equation  (3,65b)  shows  that  at 
frequences  where/ =„/<2A,  („  odd),  cos  2u/A,  -  -1,  and  |H,/)|  *  0  when  „  *  1- 
These  frequences  are  the  multipath  null  frequencies.  At  frequencies/  _  „/(2At)  (n  even). 

,ere  “"T  '? to  enh™“  said,  such  channels  cause  frequency 

select,,,  fading  ot  transmuted  signals.  Such  disunion  can  be  parily  corrected  by  using  the 


3.7 


CO  CO 
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tapped  delay  line  equalizer,  as  shown  in  Prob.  3.6-2.  These  equalizers  are  useful  in  several 
applications  in  communications.  Their  design  issues  are  addressed  later  (Chapters  7  and  13). 

3.6.4  Fading  Channels 

Thus  tar.  the  channel  characteristics  have  been  assumed  to  be  constant  over  time.  In  practice,  we 
encounter  channels  whose  transmission  characteristics  vary  with  time.  These  include  troposcat- 
ter  channels  and  channels  using  the  ionosphere  for  radio  reflection  to  achieve  long-distance 
communication.  The  time  variations  of  the  channel  properties  arise  because  of  semiperiodic 
and  random  changes  in  the  propagation  characteristics  of  the  medium.  The  reflection  properties 
of  the  ionosphere,  for  example,  are  related  to  meteorological  conditions  that  change  seasonally, 
daily,  and  even  from  hour  to  hour,  much  like  the  weather.  Periods  of  sudden  storms  also  occur. 
Hence,  the  effective  channel  transfer  function  varies  semiperiodically  and  randomly,  causing 
random  attenuation  of  the  signal.  This  phenomenon  is  known  as  fading.  One  way  to  reduce 
the  effects  slow  fading  is  to  use  automatic  gain  control  (AGC).* 

Fading  may  be  strongly  frequency  dependent  where  different  frequency  components 
are  affected  unequally.  Such  fading,  known  as  frequency-selective  fading,  can  cause  serious 
problems  in  communication.  Multipath  propagation  can  cause  frequency-selective  fading. 


3.7  SIGNAL  ENERGY  AND  ENERGY 
SPECTRAL  DENSITY 


The  energy  Eg  of  a  signal  g(t)  is  defined  as  the  area  under  |g(/)|2.  We  can  also  determine  the 
signal  energy  from  its  Fourier  transform  G(f)  through  Parseval’s  theorem. 

3.7.1  Parseval's  Theorem 

Signal  energy  can  be  related  to  the  signal  spectrum  G(f)  by  substituting  Eq.  (3.9b)  in  Eq.  (2.2): 


Eg=  g(t)g*(t)dt  =  I  g(t)\  G*(f)e-j27lfidf\dt 
J-o o  J- 00  U -oc  J 


Here,  we  used  the  fact  that  g*(t),  being  the  conjugate  of  g(t),  can  be  expressed  as  the  conjugate 
of  the  right-hand  side  of  Eq.  (3.9b).  Now,  interchanging  the  order  of  integration  yields 


(3.66) 


This  is  the  well-known  statement  of  Parseval’s  theorem.  A  similar  result  was  obtained  for  a 
periodic  signal  and  its  Fourier  series  in  Eq.  (2. 1 02).  This  result  allows  us  to  determine  the  signal 
energy  from  either  the  time  domain  specification  g(t)  or  the  frequency  domain  specification 
G(f )  of  the  same  signal. 


*  AGC  will  also  compensate  slow  variations  of  the  original  signal. 
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Example  3.19  Verify  Parseval’s  theorem  for  the  signal  g(r)  =  e  alu(t)  ( a  >  0). 
We  have 


Figure  3.38 

Interpretation  of 
the  energy 
spectral  density 
of  a  signal. 


/OO  pOO 

g2(t)dt=  e~2",dt=±- 
-00  JO 


la 


(3.67) 


We  now  determine  Eg  from  the  signal  spectrum  G(/)  given  by 

1 


j2nf  +  a 


and  from  Eq.  (3.66), 

/OO  /.£ 

\G(f)\2df  = 

■OO  J- 

which  verifies  Parseval’s  theorem. 


1  ,,  1  Inf  00 


1 

la 


3.7.2  Energy  Spectral  Density  (ESD) 

SIT conSbuw  to  trKd r°  ,he  “f  *  *<')  -  <■«  »f 

if  L  SPeC”'  COmp°nenB  »f  «S"al  8(0.  The  con.ribu.ion  of 
consider  a  signal  git )  annlird  o'th’  ■  pi?porllonal  to  IG(/)|2.  To  elaborale  this  further. 
H(/l  is  shown  in  Fig.  3.38a. This  fitosupp^r^Sen Wh“* 

3'3sb>  “  *  &  oTOn.rn.stu^ 

13  WJ  ,mj ).  and  the  energy  of  the  output  y(r),  is 


£r  =  /  IG(/)H(/)|! 

J  —  oc 


df 


(3.68) 


(a) 


(b) 


1 

1 

//(/). 

-/o 

/o  /— ► 

^^HG(/)I2 

- - 

1C(/0)|2 

~/o  C 

1 

1  \ 

>  /-^ 
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Because  H  (/)—  1  over  the  passband  A/,  and  zero  everywhere  else,  the  integral  on  the  right-hand 
side  is  the  sum  of  the  two  shaded  areas  in  Fig.  3.38b.  and  we  have  (for  A/  — >  0) 


Ey  =  2  \G(fo)\2  df 


Thus,  2|G(/o)|2  df  is  the  energy  contributed  by  the  spectral  components  within  the  two  narrow 


bands,  each  of  width  A f  Hz,  centered  at  ±/0.  Therefore,  we  can  interpret  |G(/)|2  as  the  energy 
per  unit  bandwidth  (in  hertz)  of  the  spectral  components  of  g(t )  centered  at  frequency/.  In 
other  words,  |G(/)|2  is  the  energy  spectral  density  (per  unit  bandwidth  in  hertz)  of  g(r). 
Actually,  the  energy  contributed  per  unit  bandwidth  is  2|G(/)|2  because  the  positive-  and  the 
negative-frequency  components  combine  to  form  the  components  in  the  band  A/.  However, 
for  the  sake  of  convenience  we  consider  the  positive-  and  negative-frequency  components 
being  independent.  (Some  authors  do  define  2|G(/)|2  as  the  energy  spectral  density.)  The 
energy  spectral  density  (ESD)  %,(/)  is  thus  defined  as 


%(f)  =  \G(f)\2 


(3.69) 


and  Eq.  (3.66)  can  be  expressed  as 


(3.70a) 


From  the  results  in  Example  3.19,  the  ESD  of  the  signal  g(t)  =  e  a'u(t )  is 


(3.70b) 


3.7.3  Essential  Bandwidth  of  a  Signal 


The  spectra  of  most  signals  extend  to  infinity.  However,  because  the  energy  of  a  practical  signal 
is  finite,  the  signal  spectrum  must  approach  0  as/  — >  oo.  Most  of  the  signal  energy  is  contained 
within  a  certain  band  of  B  Hz,  and  the  energy  content  of  the  components  of  frequencies  greater 
than  B  Hz  is  negligible.  We  can  therefore  suppress  the  signal  spectrum  beyond  B  Hz  with  little 
effect  on  the  signal  shape  and  energy.  The  bandwidth  B  is  called  the  essential  bandwidth  of  the 
signal.  The  criterion  for  selecting  B  depends  on  the  error  tolerance  in  a  particular  application. 
We  may,  for  instance,  select  B  to  be  the  bandwidth  that  contains  95%  of  the  signal  energy.* 
The  energy  level  may  be  higher  or  lower  than  95%,  depending  on  the  precision  needed.  Using 
such  a  criterion,  we  can  determine  the  essential  bandwidth  of  a  signal.  Suppression  of  all  the 
spectral  components  of  g(t )  beyond  the  essential  bandwidth  results  in  a  signal  g(t)y  which  is  a 
close  approximation  of  g{t).  If  we  use  the  95%  criterion  for  the  essential  bandwidth,  the  energy 
of  the  error  (the  difference)  g(t)  -  g(t)  is  5%  of  Eg.  The  following  example  demonstrates  the 
bandwidth  estimation  procedure. 


*  Essential  bandwidth  for  a  low-pass  signal  may  also  be  defined  as  a  frequency  at  which  the  value  of  the  amplitude 
spectrum  is  a  small  fraction  (about  5-10%)  of  its  peak  value.  In  Example  3.19,  the  peak  of  |G(/)|  is  1/a,  and  it 
occurs  at/  =  0. 
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Example  3.20  Estimate  the  essential  bandwidth  W  (in  rad/s)  of  the  signal  e  a,u(t)  if  the  essential  band 
required  to  contain  95%  of  the  signal  energy. 

In  this  case. 


G(/)  = 


1 


jlnf  +  a 


and  the  ESD  is 


|G(/)2|  = 


1 


(2jt/)2  +  a2 


ES?  !?uhT  ‘n  F’8'  3'39'  Moreover’ the  s*gnal  energy  Eg  is  the  area  under  this 
ESD  which  has  already  been  found  to  be  1/2 a.  Let  W  rad/s  be  the  essential  bandwidth, 

wh,ch  contains  95%  of  the  total  signal  energy  Eg.  This  means  the  shaded  area  in  Fig.  3.39 
is  0.95/2a,  that  ts,  & 


0.95  rW/2ir 

20  J-W/27T 


df 


w/2n  (27 r/)2  +  a2 


-i  27 r/ 
=  - —  tan  — 
27 xa  a 


w/2n  1  IV 

=  —  tan  1  — 
-w/2jt  ™  a 


or 


0.95tt  w 

——  =  tan  1  — 
1  a 


W  —  12.7a  rad/s 


In  terms  of  hertz,  the  essential  bandwidth  equals 


to 


Q  W 

8=^  =  2.02» 


Hz 


aos2T:s'  s,2(,’:r b-  « <*> « >«  * .  »* 

components  (in  the  band  from  2  02  y  „  u  S  energy:  all  the  remaining  spectral 
energy.*  z  to  00 )  contribute  only  5%  of  the  signal 
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Figure  3.39 

Estimating  the 
essential 
bandwidth  of  a 
signal. 


U) 


f 


Example  3.2  1  Estimate  the  essential  bandwidth  of  a  rectangular  pulse  g(t)  =  n  (t/T)  (Fig.  3.40a),  where 
the  essential  bandwidth  must  contain  at  least  90%  of  the  pulse  energy. 


For  this  pulse,  the  energy  Eg  is 


"7/ 2 


/oo  r  i 

g\t)d,=  / 

-oo  J-T/2 


dt  =  T 


Also  because 


n  <=>  T  sine  (nfT) 


the  ESD  for  this  pulse  is 


^(/)  =  |G(/)|2  =  r2sinc2  (jtfT) 


This  ESD  is  shown  in  Fig.  3.40b  as  a  function  of  coT  as  well  as /T,  where /  is  the  frequency 
in  hertz.  The  energy  Eb  within  the  band  from  0  to  B  Hz  is  given  by 


CB 

Eb=  T2  sine2  (nJT)  df 
J-B 


Setting  2 ttJT  =  x  in  this  integral  so  that  df  =  dx  /(2nT),  we  obtain 


t  r2nBT  .  2 

E‘=nl  ^  (i)  * 


Also  because  Eg  =  T,  we  have 


Eb_ 

% 


1  f 2nBT  .  2,x\ 

n  Jo  SinC  V2/ 


dx 


The  integral  on  the  right-hand  side  is  numerieally  computed,  and  the  plot  of  EB/Eg  vs. 
BT  is  shown  in  Fig.  3.40c.  Note  that  90.28%  of  the  total  energy  of  the  pulse  g(r)  is 
contained  within  the  band  B  =  \/T  Hz.  Therefore,  with  the  90%  criterion,  the  bandwidth 
of  a  rectangular  pulse  of  width  T  seconds  is  1/7  Hz.  A  similar  result  was  obtained  from 
Example  3.2. 
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Figure  3.40 

(a)  Rectangular 
function; 

(b)  its  energy 
spectral  density; 

(c)  energy  vs. 
WT. 
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3.7.4  Energy  of  Modulated  Signals 


nave  >ccil 


- - —  atuxw  me  M^nai  spectrum  O  /  )  to  the  left  ar 

now  show  that  a  similar  thing  happens  ,he  BSD  of  ,he  modulated  signll 

signaMoi's  *  “  !i8"al  ba,,d-|imittd  “  B  Hr.  The  amplitude-modal 


(p{t)  =  g(t)  cos  Infot 

and  the  spectrum  (Fourier  transform)  of  <p(t)  is 

fl/)  =  j[C(/+/„)  +  G(/_/o)] 
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Figure  3.41 

Energy  spectra 
densities  of 

(a)  modulating 
and 

(b)  modulated 
signals. 


(a) 


**(/) 


Kl  4 


0 


~/o 


/o 


(b) 


25 


The  ESD  of  the  modulated  signal  (p(t)  is  |  <!>(/)  |2,  that  is, 


%(f)  =  ^  I  G(/  +/0)  +  G(/  -/0)|2 


If /o  >  then  G(/  +  /o)  and  G(/  — /o)  are  nonoverlapping  (see  Fig.  3.41),  and 


=  4'M/+/o)  +  4*s  (/-/o) 


(3.71) 


The  ESDs  of  both  g(t)  and  the  modulated  signal  (p(t )  are  shown  in  Fig.  3.41 .  It  is  clear  that 
modulation  shifts  the  ESD  of  g(t)  by  ±fo.  Observe  that  the  area  under  ^(/)  is  half  the  area 
under  »l >g(f).  Because  the  energy  of  a  signal  is  proportional  to  the  area  under  its  ESD.  it  follows 
that  the  energy  of  <p(t)  is  half  the  energy  of  g(t),  that  is. 


E<p  —  2^g  fo  >  B 


(3.72) 


It  may  seem  surprising  that  a  signal  cp(t),  which  appears  so  energetic  in  comparison  to  g{t ), 
should  have  only  half  the  energy  of  g(t).  Appearances  are  deceiving,  as  usual.  The  energy  of 
a  signal  is  proportional  to  the  square  of  its  amplitude,  and  higher  amplitudes  contribute  more 
energy.  Signal  g(t)  remains  at  higher  amplitude  levels  most  of  the  time.  On  the  other  hand, 
<p(t),  because  of  the  factor  cos  Infyt,  dips  to  zero  amplitude  levels  many  times,  which  reduces 
its  energy. 


3.7.5  Time  Autocorrelation  Function  and 


Energy  Spectral 


In  Chapter  2  we  showed  that  a  good  measure  of  comparing  two  signals  g(t)  and  z(t)  is  the 
cross-correlation  function  x//gz( r)  defined  in  Eq.  (2.59).  We  also  defined  the  correlation  of  a 
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signal  g(t)  with  itself  (the  autocorrelation  function  \frg( r)]  in  Eq.  (2.60).  For  a  real  signal  g(t), 
the  autocorrelation  function  ^(r)  is  given  by* 


/OO 

g(r)g(r  +  r)  dt 

-OO 


(3.73a) 


Setting  x  =  t  +  r  in  Eq.  (3.73a)  yields 


iMr) 


-f 


g(x)g(x-r)dx 


In  this  equation,  x  is  a  dummy  variable  and  could  be  replaced  by  t.  Thus, 


iMf) 


/OO 

g(r)g(r  ±T)dt 

-OO 


(3.73b) 


This  shows  that  tor  a  real  g(t),  the  autocorrelation  function  is  an  even  function  of  r,  that  is, 

M  T)  =  fg(-r)  (3.73c) 

There  is,  in  tact,  a  very  important  relationship  between  the  autocorrelation  of  a  signal  and 
its  ESD.  Specifically,  the  autocorrelation  function  of  a  signal  g(r)  and  its  ESD  4\,(/)  form  a 
Fourier  transform  pair,  that  is, 


Thus, 


iMr)  <=>  »,(/) 


(3.74a) 


Example 


/OO 

tg(T)e~j2nf  *  dr 

-OO 

,  r  oo 

^(r)  =  F~x  { ¥,(/)}  =  /  ^(/)eW T df 

J -oo 


(3.74b) 

(3.74c) 


No“ 'hM  Foune'  m"sfr  °f  Eq'  <3'74al  is  P'*0™*  with  respect  to  r  in  place  of  I. 
tion  LZTtT,  Ai',1,  =  IC(/)P  iS,he  ^transform  oftheaotocorrela- 

rJSo  N«1  iS  Pr0VCd  here  for  real  If*,  it  is  valid  for  complex 

,raS„sfo™  is  r  fU"C',0n  iS  *  fUnC,i0n  °f  '•  “  '•  He"“- its  R”rKr 


tgiT)] 


Le  1  TfT  [£00g(,)g(r + z)dt] dx 

r  oo 


~  Lj(')[Lju+,)e~i2’,'dr\ 


dt 


*  For  a  complex  signal  g(t),  we  define 

/oo 

-oo 


8U)g*U -T)dt 


■r 


8  (l)g(t  +  x)dt 
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The  inner  integral  is  the  Fourier  transform  of  g( r  +  /),  which  is  g( r)  left-shifted  by  t.  Hence, 
it  is  given  by  the  time-shifting  property  [in  Eq.  (3.32a)],  G(f)ej2nf' .  Therefore, 


FWgi  r)]  = 


G(f)  r  giOeM 

J -00 


dt  =  G(/)G(-/)  =  |G(/)|2 


This  completes  the  proof  that 


fg(r)  ^gif)  =  |G(/)|2  (3.75) 

A  careful  observation  of  the  operation  of  correlation  shows  a  close  connection  to  con¬ 
volution.  Indeed,  the  autocorrelation  function  is  the  convolution  of  g(r)  with  g(— r) 

because 


/OO  rOO 

g(x)g[-(T -x)]dx  =  /  g(x)g(x  -  T)dx  =  \J/g(r) 

-00  J -00 

Application  of  the  time  convolution  property  [Eq.  (3.45)]  to  this  equation  also  yields  Eq.  (3.75). 


Example  3.22  Find  the  time  autocorrelation  function  of  the  signal  g(t)  =  e  a'u(t),  and  from  it  determine  the 
ESD  of  g(t). 


In  this  case. 


g(t)  =  e  a,u(t)  and  g(t  —  r)  =  e  a(t  x)u(t  —  r) 


Recall  that  g(t  —  r)  is  g(t)  right-shifted  by  r,  as  shown  in  Fig.  3.42a  (for  positive  r).  The 
autocorrelation  function  ^,(r)  is  given  by  the  area  under  the  product  g(t)g(t  -  r)  [see 
Eq.  (3.73b)].  Therefore, 


(Mr) 


-f 


g(t)g(t  ~  x)dt 


00  1 

e~2",dt  =  —  e~ax 
2 a 


This  is  valid  for  positive  r.  We  can  perform  a  similar  procedure  for  negative  r.  However, 
we  know  that  for  a  real  g(t),  r//g(r)  is  an  even  function  of  r.  Therefore, 

w  =  ie~a'T' 


Figure  3.42b  shows  the  autocorrelation  function  x//g(,T).  The  ESD  4^ (/)  is  the  Fourier 
transform  of  \j/g( r).  From  Table  3.1  (pair  3),  it  follows  that 


V/)  = 


1 


(2jt/)2  +  a 2 

which  confirms  the  earlier  result  of  Eq.  (3.70b)  in  Sec.  3.7.2. 
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Figure  3.42 

Computation  of 
the  time 
autocorrelation 
function. 


Figure  3.43 

Limiting  process 
in  derivation  of 
PSD. 


ESD  of  the  Input  and  the  Output 

It  x(t )  and  y(t)  are  the  input  and  the  corresponding  output  of  a  linear  time-invariant  (LTI) 
system,  then 


Therefore, 


Y(f)  =  H(f)X(f) 


\Y(f)\2  =  \H(f)\2\X(f)\2 

This  shows  that 


(3.76) 


Tta.  .he  output  signal  ESD  i.  |H(/)|2  times  ihe  input  signal  ESD. 

3.8  SIGNAL  POWER  AND  POWER 
SPECTRAL  DENSITY 


For  a  power  signal,  a  meaningful  measure  of 


its  size  is  its  power  [defined  in  Eq.  (2.4)]  as  the 


IZZrC’,  w  is  °Ver  ,h' inMte  ,im'  The  power  P,  of  a 


Pg  =  lim 

r- oo 


l  rW 

T  I  l 
‘  J-T/2 


SS?,  >7"  and  rela"d  COn“I,'S  «  *  — u  eaderstood 


(3.77) 


by  defining  a  truncated 


gr(t)  = 


U(0 

|0 


1*1  <  T/2 
l»l  >  T/2 
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Figure  3.43 

Limiting  process 
in  derivation  of 
PSD. 


£(0 


Sr(') 


-77  2 


77  2 


The  truncated  signal  is  shown  in  Fig.  3.43.  The  integral  on  the  right-hand  side  of  Eq.  (3.77) 
yields  EgT ,  which  is  the  energy  of  the  truncated  signal  grit)-  Thus, 


(3.78) 


This  equation  describes  the  relationship  between  power  and  energy  of  nonperiodic  signals. 
Understanding  this  relationship  will  be  very  helpful  in  understanding  and  relating  all  the 
power  concepts  to  the  energy  concepts.  Because  the  signal  power  is  just  the  time  average  of 
energy,  all  the  concepts  and  results  of  signal  energy  also  apply  to  signal  power  if  we  modify 
the  concepts  properly  by  taking  their  time  averages. 


3.8.1  Power  Spectral  Density  (PSD) 


If  the  signal  g(t)  is  a  power  signal,  then  its  power  is  finite,  and  the  truncated  signal  gr(t)  is  an 
energy  signal  as  long  as  T  is  finite.  If  gr(t)  <=>■  Gy(/).  then  from  Parseval’s  theorem. 


Hence,  Pg,  the  power  of  g(t),  is  given  by 


(3.79) 


As  T  increases,  the  duration  of  grU)  increases,  and  its  energy  EgT  also  increases  propor¬ 
tionately.  This  means  that  |Gt-(/)|2  also  increases  with  T ,  and  as  T  -►  oo,  |Gr(/)|2  also 
approaches  oo.  However,  |Gt-(/)|2  must  approach  oo  at  the  same  rate  as  T  because  for  a 
power  signal,  the  right-hand  side  of  Eq.  (3.79)  must  converge.  This  convergence  permits  us  to 
interchange  the  order  of  the  limiting  process  and  integration  in  Eq.  (3.79),  and  we  have 


(3.80) 
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We  define  the  power  spectral  density  (PSD)  as 


Consequently,* 


W)=  l™ 

/— ►  OC 


|Gr(/)l2 

T 


(3.81) 


(3.82a) 

(3.82b) 


This  result  is  parallel  to  the  result  [Eq.  (3.70a)]  for  energy  signals.  The  power  is  the  area  under 
the  PSD.  Observe  that  the  PSD  is  the  time  average  of  the  ESD  of  gT(t)  [Eq.  (3.8 1 )]. 

As  is  the  case  with  ESD,  the  PSD  is  a  positive,  real,  and  even  function  off.  If  g(t)  is  a 
voltage  signal,  the  units  of  PSD  are  volts  squared  per  hertz. 


3.8.2  Time  Autocorrelation  Function  of  Power  Signals 

The  (time)  autocorrelation  function  TZ^t)  of  a  real  power  signal  g(t)  is  defined  as+ 


TZg(r)  =  lim 
r-*  oo 


l  rT n 

r  /  g{t)git 
1  J-T/ 2 


-  x)dt 


(3.83a) 


Usmg  the  same  argument  as  that  used  for  energy  signals  [Eqs.  (3.73b)  and  (3.73c)],  we  can 
show  that  Kg(x)  is  an  even  function  of  r.  This  means  for  a  real  g(t) 


ftg(r)  =  lim 
r-oo 


1  [T! 2 

r  /  gU)g(t  +  T)dt 
1  J-T/2 


(3.83b) 


and 


'W  =  Kg(- r) 


(3.84) 


=  lim 


ir 

r]. 


griOgrit  +  z)dt  =  lim  ^gr(r) 
00  r-voo  T 


(3.85) 


c<m^nent^  Thelrnpi^f^fft  Sd  no?  h=  2  f  W)df  whe"  W>  contains , 
+  For  a  complex  *</),  we  define  P  should  not  **  m“'«>plied  by  the  factor  2. 


= 


..  i  rTn 

r^,  T  y_r/2s(,)g*('  ~x)dt=  lim 

/ -►oo 


1  fT/2 

T  J_T/2g*Ws(t  +  *)dt 
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TABLE  3.3 


t g  (*)  =  f”oo  g(')g(t  +  T)  dt 


i  rT'2 

ng(z)  =  lim  -  / 

T-*oo  T  J-T/2 


*g(f)  =  \G(f)\2 

^g(T)  <=>  »1 >g{f) 

Eg  =  f-oo  *g(f)4 


Kg( r)  «=>  Sg(f) 
Pg=f-00Sg(f)df 


Recall  from  Wiener-Khintchine  theorem  that  tygTix)  <=>  |Gy(/)|2.  Hence,  the  Fourier 
transform  of  the  preceding  equation  yields 


Hg(x)  <=>  lim 


(3.86) 


Although  we  have  proved  these  results  for  a  real  g(t),  Eqs.  (3.81),  (3.82a),  (3.82b),  and  (3.86) 
are  equally  valid  for  a  complex  git). 

The  concept  and  relationships  for  signal  power  are  parallel  to  those  for  signal  energy.  This 
is  brought  out  in  Table  3.3. 

Signal  Power  Is  Its  Mean  Square  Value 

A  glance  at  Eq.  (3.77)  shows  that  the  signal  power  is  the  time  average  or  mean  of  its  squared 
value.  In  other  words  Pg  is  the  mean  square  value  of  git).  We  must  remember,  however,  that 
this  is  a  time  mean,  not  a  statistical  mean  (to  be  discussed  in  later  chapters).  Statistical  means 
are  denoted  by  overbars.  Thus,  the  (statistical)  mean  square  of  a  variable  x  is  denoted  by  x2. 
To  distinguish  from  this  kind  of  mean,  we  shall  use  a  wavy  overbar  to  denote  a  time  average. 


Thus,  the  time  mean  square  value  of  git)  will  be  denoted  by  g2it).  The  time  averages  are 


conventionally  denoted  by  angle  brackets,  as  follows:  <g2(t)>.  We  shall,  however,  use  the 
wavy  overbar  notation  because  it  is  much  easier  to  associate  means  with  a  bar  on  top  rather 
than  the  brackets.  Using  this  notation,  we  see  that 


(3.87a) 


Note  that  the  rms  value  of  a  signal  is  the  square  root  of  its  mean  square  value.  Therefore, 


[g(0]rms  =  ^ 


(3.87b) 


From  Eq.  (3.83),  it  is  clear  that  for  a  real  signal  g(t),  the  time  autocorrelation  function 
Ilgiz)  is  the  time  mean  of  g(t)g(t  ±  r).  Thus, 


Ilgiz)  =  g(t)g(t  ±  r) 


(3.88) 


This  discussion  also  explains  why  we  have  been  using  the  term  time  autocorrelation  rather  than 
just  autocorrelation.  This  is  to  distinguish  clearly  the  present  autocorrelation  function  (a  time 
average)  from  the  statistical  autocorrelation  function  (a  statistical  average),  to  be  introduced 
in  Chapter  9  in  the  context  of  probability  theory  and  random  processes. 
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Example  3.23 


Interpretation  of  Power  Spectral  Density 

Because  the  PSD  is  the  time  average  of  the  ESD  of  g(t ),  we  can  argue  along  the  lines  used  in 
the  interpretation  of  ESD.  We  can  readily  show  that  the  PSD  Sg(f)  represents  the  power  per 
unit  bandwidth  (in  hertz)  of  the  spectral  components  at  the  frequency/.  The  amount  of  power 
contributed  by  the  spectral  components  within  the  band/i  to  fi  is  given  by 


A  P9 


(3.89) 


Autocorrelation  Method:  A  Powerful  Tool 

For  a  signal  g(t ),  the  ESD,  which  is  equal  to  |G(/)|2,  can  also  be  found  by  taking  the  Fourier 
transform  of  its  autocorrelation  function.  If  the  Fourier  transform  of  a  signal  is  enough  to  deter¬ 
mine  its  ESD,  then  why  do  we  needlessly  complicate  our  lives  by  talking  about  autocorrelation 
functions?  The  reason  for  following  this  alternate  route  is  to  lay  a  foundation  for  dealing  with 
power  signals  and  random  signals.  The  Fourier  transform  of  a  power  signal  generally  does  not 
exist.  Moreover,  the  luxury  of  finding  the  Fourier  transform  is  available  only  for  deterministic 
signals,  which  can  be  described  as  functions  of  time.  The  random  message  signals  that  occur 
in  communication  problems  (e.g.,  random  binary  pulse  train)  cannot  be  described  as  functions 
ot  time,  and  it  is  impossible  to  find  their  Fourier  transforms.  However,  the  autocorrelation 
function  for  such  signals  can  be  determined  from  their  statistical  information.  This  allows  us 
to  determine  the  PSD  (the  spectral  information)  of  such  a  signal.  Indeed,  we  may  consider 
the  autocorrelation  approach  as  the  generalization  of  Fourier  techniques  to  power  signals  and 
random  signals.  The  following  example  of  a  random  binary  pulse  train  dramatically  illustrates 


Figure  3.44 

Autocorrelation 
function  and 
power  spectral 
density  function 
of  a  random 
binary  pulse 
train. 


Figure  3 l.44u  shows  a  random  binary  pulse  main  sir).  The  pulse  width  is  Tb/ 2,  and  one  binary 

S“°'KiS  A  binary  ‘  iS  'ranSmi,“d  ^  lh'  Positive  pulse,  and  a 
dl  We  hT,  y  n”Sal,Ve  PU',e  ThC  ,W°  Symb“ls  are  equally  likely  and  oceur  ran- 
Ss^nT  '  aWOCOITe““"  Of***  -he  PSD.  and  Ihe  essential  bandwidth 

?  ^nrSomt^D"  fl,nC,i°n  °f  'ime  ^  precise  waveform, 

tt  ave^thesmtis^T  r  *  ho«w-  •«»  ««  signal  s  behavior  in  terms  of 


=  lim 


?r 

1  J-T/2 


8(t)g(t-r)dt 


lines.  To  determined  integrand  0^ bright  'S/(0  delayed  by  r’  by  dashed 

multiply  g(t)  with  g(t  -  ri "finH  .h«  8  3nd  Slde  of  tbe  foregoing  equation,  we 

by  the  averaging  interval  T  Ut  hl T  T?  Pr°duct  ~  *>•  and  divide  it 

r  =  M*.  and  as  “  ^  Tht  '  ^  duri"S  this  Nerval  7  so  that 


^(r)  =  lim 

W-oo 


1  rNTb/ 2 

NTb  J~NTb/ 2 


8(t)g(t-T)dt 
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Figure  3.44 

Autocorrelation 
function  and 
power  spectral 
density  function 
of  a  random 
binary  pulse 
train. 
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■T„I2 


(a) 


|J- 


g(t)  g(t-x) 

\ 


I 

I 


(b) 


FT 


Let  us  first  consider  the  case  of  r  <  Th/2.  In  this  case  there  is  an  overlap  (shaded 
region)  between  each  pulse  of  g(t)  and  that  of  g(t  -  r).  The  area  under  the  product 
g(t)g(t  -  r)  is  Tb/ 2  —  r  for  each  pulse.  Since  there  are  N  pulses  during  the  averaging 
interval,  then  the  total  area  under  g(t)g(t  -  r)  is  N (7*/2  —  r),  and 


-K"l) 


n  T» 

0  <  r  <  — 

2 


r  < 


Tb 
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Because  7^(r)  is  an  even  function  of  r, 

=  1*1  <y  (3.90a) 

as  shown  in  Fig.  3.44c. 

As  we  increase  r  beyond  Tb/2,  there  will  be  overlap  between  each  pulse  and  its 
immediate  neighbor.  The  two  overlapping  pulses  are  equally  likely  to  be  of  the  same 
polarity  or  of  opposite  polarity.  Their  product  is  equally  likely  to  be  1  or  - 1  over  the 
overlapping  interval.  On  the  average,  half  the  pulse  products  will  be  1  (positive-positive 
or  negative-negative  pulse  combinations),  and  the  remaining  half  pulse  products  will  be 
—  1  (positive-negative  or  negative-positive  combinations).  Consequently,  the  area  under 
g(t)g(t  -  r)  will  be  zero  when  averaged  over  an  infinitely  large  time  (T  ->  oo),  and 

*»(*>  =  0  |r|>y  (3.90b) 


The  two  parts  of  Eq.  (3.90)  show  that  the  autocorrelation  function  in  this  case  is  the 
triangular  function  jA(t/Th)  shown  in  Fig.  3.44c.  The  PSD  is  the  Fourier  transform  of 
\A(t/Tb),  which  is  found  in  Example  3.15  (or  Table  3.1,  pair  19)  as 


W)  =  ^si„c2(^) 


(3.91) 


SqUarei0f)theLSinC  function'  as  sh™n  in  Fig.  3.44d.  From  the  result  in 
the  band  from  U  31  ^ °^t*le  area  °f  this  spectrum  is  contained  within 

akenas^/r  »(b'Z  °  l°  2/Tb  Hz  Thus’ the  essential  bandwidth  may  be 

how  the  am^n  iTT  "  °%  P°Wer  criterion)' This  example  illustrates  dramatically 

iomt  lTwh  Ct'°n  ^  bC  USCd  '°  °btain  ,he  spiral  information  of  a 

uTable  conventional  means  of  obtaining  the  Fourier  spectrum  are  not 


3.8.3  Input  and  Output  Power  Spectral  Densities 

S'™:  tionf  p  br en  the  input  and  output 

argument  used  for  ESD  [Eq.  (3  76)1  we  can  rea  II  u™ Y  l°  ^  of  ESDs-  Following  the 
and  oolpu,  signals  of  an  ti?  ZZ&gfcZ* [>  Md  *>  “  *  ^ 


The  transfer  function  of  an  ideal  differentiator  is  H(f\ 

demodulator  output  is  «„(/),  then  from  Eq.  (3.92,  ^  ~  jlnf '  If  the  noise  at  the 

SnAf)  =  \H(f)\2Sni(f)  =  [j2nf\2K 
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The  output  PSD  Sno(f )  is  parabolic,  as  shown  in  Fig.  3.45c.  The  output  noise  power  N0 
is  the  area  under  the  output  PSD.  Therefore, 


/B  pB 

K(2nf)2df  =  2K  (2n f)2df  = 

-B  JO 


Stt2B*K 


Figure  3.45 

Power  spectral 
densities  at  the 
input  and  the 
output  of  an 
ideal 

differentiator. 
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3.8.4  PSD  of  Modulated  Signals 

Following  the  argument  in  deriving  Eqs.  (3.71)  and  (3.72)  for  energy  signals,  we  can  derive 
similar  results  for  power  signals  by  taking  the  time  averages.  We  can  show  that  for  a  power 
signal  g(t),  if 

<p(t)  =  g(t)  cos  2 nf0t 

then  if/o  >  B.  the  PSD  S^(/)  of  the  modulated  signal  <p(t)  is  given  by 

W)  =  \  [W  +/o)  +  W  -/o)]  (3-93) 

The  detailed  derivation  is  provided  in  Sec.  7.8.2.  Thus,  modulation  shifts  the  PSD  of  g(f)  by 
±/o.  The  power  of  <p(t)  is  half  the  power  of  g(t),  that  is, 

Pv  =  \ Pg  fo>B  (3.94) 

3.9  NUMERICAL  COMPUTATION  OF  FOURIER 
TRANSFORM:  THE  DFT 

To  compute  G(/),  the  Fourier  transform  of  g(t),  numerically,  we  have  to  use  the  samples 
of  g(t).  Moreover,  we  can  determine  G(f)  only  at  some  finite  number  of  frequencies.  Thus, 
we  can  compute  samples  of  G(/)  only.  For  this  reason,  we  shall  now  find  the  relationships 
between  samples  of  g(t)  and  samples  of  G(/). 
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Figure  3.46  Relation  between  the  samples  of  g(t)  and  of  G(/). 


In  numerical  computations,  the  data  must  be  finite.  This  means  that  the  number  of  samples 
of  g(t)  and  G(/)  must  be  finite.  In  other  words,  we  must  deal  with  time-limited  signals.  If  the 
signal  is  not  time-limited,  then  we  need  to  truncate  it  to  make  its  duration  finite.  The  same  is 
true  ot  G(/).  To  begin,  let  us  consider  a  signal  g(t)  of  duration  r  seconds,  starting  at  t  =  0, 
as  shown  in  Fig.  3.46a.  However,  for  reasons  that  will  become  clear  as  we  go  along,  we  shall 
consider  the  duration  of  g(t)  to  be  To,  where  To  >  r,  which  makes  g(t)  =  0  in  the  interval 
T  <  t  as  shown  in  Fig.  3.46a.  Clearly,  this  makes  no  difference  in  the  computation  of 
G(/).  Let  us  take  samples  of  g(t)  at  uniform  intervals  of  Ts  seconds.  There  are  a  total  of  No 
samples,  where 


Now,* 


(3.95) 


G(f)  = 


rTo 

/  g(t)e~j2jTft  dt 
Jo 


Aib-l 

=  ft,  £  *( WS)e-i2nfkT‘  Ts 
'  *= o 


(3.96) 


Let  us  consider  the  samples  of  G(/)  at  uniform  intervals  of /0 
Gq  =  G(qfo ).  then  from  Eq.  (3.96),  we  obtain 


If  Gq  is  the  <yth  sample,  that  is, 


Afo-l 

Gq=Y,  Tsg(kT5)e-j^foTk 
k=0 


N0-\ 

=  ^2  gke~jqQ ok 
k=  0 


(3.97) 


*  The  upper  limit  on  the  summation  in  Eq.  (3.96)  is  A/n  -  1  \r  ^  u 

(A^o  -  1  )ls  and  covers  the  area  under  the  summand  up  to  N()TS  =  CCaUSC  the  ,ast  term  in  the  sum  starts  at 
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where 


8k  =  Tsg(kTs)9  Gq  =  G(qfo),  =  2  nf0Ts 


(3.98) 


Thus,  Eq.  (3.97)  relates  the  samples  of  g(t)  to  the  samples  of  G(  f).  In  this  derivation,  we  have 
assumed  that  Ts  — ►  0.  In  practice,  it  is  not  possible  to  make  Ts  — >  0  because  it  will  increase 
the  data  enormously.  We  strive  to  make  Ts  as  small  as  is  practicable.  This  will  result  in  some 
computational  error. 

We  make  an  interesting  observation  from  Eq.  (3.97).  The  samples  Gq  are  periodic  with  a 
period  of  2tz/SIo  samples.  This  follows  from  Eq.  (3.97),  which  shows  that  G(q+2n/a0)  =  Gq. 
Thus,  the  number  of  samples  Gq  that  can  be  independent  is  only  27t/£V  Equation  (3.97) 
shows  that  Gq  is  determined  by  No  independent  values  g*.  Hence,  for  unique  inverses  of  these 
equations,  there  can  be  only  No  independent  sample  values  Gq.  This  means  that 


In  other  words,  we  have 


and  fo  =  — 


(3.100) 


Thus,  the  spectral  sampling  interval  fo  Hz  can  be  adjusted  by  a  proper  choice  of  7b:  the  larger 
the  To,  the  smaller  the fy.  The  wisdom  of  selecting  7o  >  r  is  now  clear.  When  7o  is  greater 
than  r,  we  shall  have  several  zero-valued  samples  g*  in  the  interval  from  r  to  7b.  Thus,  by 
increasing  the  number  of  zero-valued  samples  of  g*,  we  reduce/o  [more  closely  spaced  samples 
of  G(/)],  yielding  more  details  of  G(/).  This  process  of  reducing/o  by  the  inclusion  of  zero¬ 
valued  samples  g*  is  known  as  zero  padding.  Also,  for  a  given  sampling  interval  Tv,  larger 
To  implies  larger  No.  Thus,  by  selecting  a  suitably  large  value  of  No,  we  can  obtain  samples 
of  G(/)  as  close  as  possible. 

To  find  the  inverse  relationship,  we  multiply  both  sides  of  Eq.  (3.97)  by  e^mn°q  and  sum 
over  q  as 


q—Q  </=0  k=  0 


Upon  interchanging  the  order  of  summation  on  the  right-hand  side,  we  have 


N0-\  N0-\  Nq-1 


E  ej(m-k)Q.  0q 


ej{m-k)Qoq 


(3.101) 


To  find  the  inner  sum  on  the  right-hand  side,  we  shall  now  show  that 


r  1 


No  n  =  0,  ±No,  ±2No,  . . . 
0  otherwise 


(3.102) 
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To  show  this,  recall  that  QqNo  =  27r  and  e^°k  =  1  for  n  =  0,  ±No,  ±2No, . . . ,  so  that 


Afo-l  No~\ 

Y,ejnQ °k=  E  l=N°  n  =  0,±N0,±2N0,... 

k= 0  k= 0 


To  compute  the  sum  for  other  values  of  n,  we  note  that  the  sum  on  the  left-hand  side  of 
Eq.  (3.102)  is  a  geometric  series  with  common  ratio  a  =  Therefore,  its  partial  sum  of 
the  first  No  terms  is 


Afo-l 

E 


ejnQ0k  _ 


*=o 


ejnn0N0  _  j 
ejn& 0  —  1 


=  0 


where  =  e^nn  =  1 . 

This  proves  Eq.  (3.102).  It  now  follows  that  the  inner  sum  on  the  right-hand  side  of  Eq. 
(3.101 )  is  zero  for  k  j=-  m,  and  the  sum  is  No  when  k  =  m.  Therefore,  the  outer  sum  will  have 
only  one  nonzero  term  when  k  =  m,  which  leads  to  N0gk  =  No gm-  Therefore, 


tfo-l 


8m  -  No  £  G<> 


Jm&oq 


q=0 


_  2  7T 

S2°  =  — 
Nq 


(3.103) 


Equation  (3  103)  reveals  the  interesting  fact  that  g{m+No)  =  gm.  This  means  that  the 
sequence  g*  is  a  so  periodic  with  a  period  of  Nq  samples  (representing  the  time  duration 
NoTs  =  To  seconds).  Moreover,  Gq  is  also  periodic  with  a  period  of  N0  samples,  representing 
a  frequency  interval  A^o  =  1  ,TS  =  fs  Hz.  But  1  /Ts  is  the  number  of  samples  of  *(/>  per 
second.  Thus,  1  Ts  -  fs  is  the  sampling  frequency  (in  hertz)  ofg(r).  This  means  Gq  is  N0- 

penodic,  relating  every/,  Hz.  Let  us  summarize  the  results  derived  so  far.  We  have  proved 
the  discrete  Fourier  transform  (DFT)  pair 


Afo-l 

G<?  =  ]£  gke~jqa°k 
k=  0 


8k  = 


Afo-l 


No  S  G<? 


,jkSl0q 


9=0 


where 


(3.104a) 

(3.104b) 


Jo  \ - a  / 


~  f  2n 
2nfo  =  — 
To 


N0  =  ~  = 


To  fs 


fo 


ot  In 
2nfs  =  ~ 

Ts 

Vo  =  2nf0Ts  =  — 
No 


(3.105) 


repeating  with  period  To  second/a^d  c  W‘th  a  period  of  ^o  samples.  This  results  in  gk 

frequency).  The^samp. iTmeta,  of  ^  ^  ^  <‘he 

fc  ot  gk  ,s  T,  seconds  and  the  sampling  interval  of  Gq  is 
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/o  =  1  /To  Hz.  This  is  shown  in  Fig.  3.46c  and  d.  For  convenience,  we  have  used  the  frequency 
variable /  (in  hertz)  rather  than  co  (in  radians  per  second). 

We  have  assumed  g(t)  to  be  time-limited  to  r  seconds.  This  makes  G(f)  non-band- 
limited.*  Hence,  the  periodic  repetition  of  the  spectra  Gq ,  as  shown  in  Fig.  3.46d,  will  cause 
overlapping  of  spectral  components,  resulting  in  error.  The  nature  of  this  error,  known  as 
aliasing  error,  is  explained  in  more  detail  in  Chapter  6.  The  spectrum  Gq  repeats  every  fs 
Hz.  The  aliasing  error  is  reduced  by  increasing /y,  the  repetition  frequency  (see  Fig.  3.46d). 
To  summarize,  the  computation  of  Gq  by  means  of  DFT  entails  aliasing  error  when  g(t)  is 
time-limited.  This  error  can  be  made  as  small  as  desired  by  increasing  the  sampling  frequency 
fs  =  \/Ts  (or  reducing  the  sampling  interval  Ts).  The  aliasing  error  is  the  direct  result  of  the 
nonfulfillment  of  the  requirement  Ts  — ►  0  in  Eq.  (3.96). 

When  g(r)  is  not  time-limited,  we  need  to  truncate  it  to  make  it  time-limited.  This  will 
cause  further  error  in  Gq.  This  error  can  be  reduced  as  much  as  desired  by  appropriately 
increasing  the  truncating  interval  7o. f 

In  computing  the  inverse  Fourier  transform  [by  using  the  inverse  DFT  in  Eq.  (3.104b)], 
we  have  similar  problems.  If  G(f)  is  band-limited,  g(t)  is  not  time-limited,  and  the  periodic 
repetition  of  samples  gk  will  overlap  (aliasing  in  the  time  domain).  We  can  reduce  the  aliasing 
error  by  increasing  7o,  the  period  of  gk  (in  seconds).  This  is  equivalent  to  reducing  the  frequency 
sampling  interval  /o  =  1  /7o  of  G(f).  Moreover,  if  G(/)  is  not  band-limited,  we  need  to 
truncate  it.  This  will  cause  an  additional  error  in  the  computation  of  g By  increasing  the 
truncation  bandwidth,  we  can  reduce  this  error.  In  practice,  (tapered)  window  functions  are 
often  used  for  truncation5  to  reduce  the  severity  of  some  problems  caused  by  straight  truncation 
(also  known  as  rectangular  windowing). 

Because  Gq  is  No-periodic,  we  need  to  determine  the  values  of  Gq  over  any  one  period. 
It  is  customary  to  determine  Gq  over  the  range  (0,  No  —  1)  rather  than  over  the  range 
(-No/2,  No/2  —  1).  The  identical  remark  applies  to  g *. 


3.9.1  Choice  of  Ts,  To,  and  No 

In  DFT  computation,  we  first  need  to  select  suitable  values  for  No,  Ts,  and  To.  For  this  purpose 
we  should  first  decide  on  B,  the  essential  bandwidth  of  g(t).  From  Fig.  3.46d,  it  is  clear  that  the 
spectral  overlapping  (aliasing)  occurs  at  the  frequency  fs/2  Hz.  This  spectral  overlapping  may 
also  be  viewed  as  the  spectrum  beyond  fs/2  folding  back  at  fs/2.  Hence,  this  frequency  is  also 
called  the  folding  frequency.  If  the  folding  frequency  is  chosen  such  that  the  spectrum  G(/)  is 
negligible  beyond  the  folding  frequency,  aliasing  (the  spectral  overlapping)  is  not  significant. 
Hence,  the  folding  frequency  should  at  least  be  equal  to  the  highest  significant  frequency,  that 
is  ,  the  frequency  beyond  which  G(f)  is  negligible.  We  shall  call  this  frequency  the  essential 
bandwidth  B  (in  hertz).  If  g(r)  is  band-limited,  then  clearly,  its  bandwidth  is  identical  to  the 
essential  bandwidth.  Thus, 


-  >  B  Hz 
2  “ 


(3.106a) 


*  We  can  show  that  a  signal  cannot  be  simultaneously  time-limited  and  band-limited.  If  it  is  one,  it  cannot  be  the 
other,  and  vice  versa.3 

+  The  DFT  relationships  represent  a  transform  in  their  own  right,  and  they  are  exact.  If,  however,  we  identify  gk  and 
Gq  as  the  samples  of  a  signal  g(t)  and  its  Fourier  transform  G(/),  respectively,  then  the  DFT  relationships  are 
approximations  because  of  the  aliasing  and  truncating  effects. 
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Moreover,  the  sampling  interval  Ts  =  I  //*  [Eq.  (3.105)].  Hence, 

1 

Ts<  — 

~  2  B 

Once  we  pick  B ,  we  can  choose  Ts  according  to  Eq.  (3.106b).  Also, 


(3.106b) 


fo  = 

Jo 


(3.107) 


where /o  is  the  frequency  resolution  [separation  between  samples  of  G(/)].  Hence,  if/o  is 
given,  we  can  pick  To  according  to  Eq.  (3.107).  Knowing  Tq  and  Ts,  we  determine  No  from 


„  T0 
No  =  — 


(3.108) 


In  general,  if  the  signal  is  time-limited,  G(f)  is  not  band-limited,  and  there  is  aliasing  in 
the  computation  of  Gq.  To  reduce  the  aliasing  effect,  we  need  to  increase  the  folding  frequency, 
that  is,  reduce  Ts  (the  sampling  interval)  as  much  as  is  practicable.  If  the  signal  is  band-limited, 
gU)  is  not  time-limited,  and  there  is  aliasing  (overlapping)  in  the  computation  of  gk .  To  reduce 
lhis  abasing,  we  need  to  increase  T„.  the  period  of*,.  This  results  in  reducing  the  frequency 
samp  ing  interva  /o  (m  hertz).  In  either  case  (reducing  Ts  in  the  time-limited  case  or  increasing 

A/'  h'  C  J,\/  iTUAu’  *°r  accuracy>  we  need  to  increase  the  number  of  samples 
because  N0  -  7o/ T*.  There  are  also  signals  that  are  neither  time-limited  nor  band-limited. 
In  such  cases,  we  still  need  to  reduce  Ts  and  increase  T0. 

Points  of  Discontinuity 

fvereK  5Z ZTZTT*  “a  Sa?'ing  P0™- ,he  S™P"=  should  be  taken  as  the 

a  a S ofdisc^Tnl  disconlinui'>'  >*cause  the  Fourier  representation 

at  a  point  of  discontinuity  converges  to  the  average  value. 

l|sinf!  the  FFT  Algorithm  in  DFT  Computations 

ant,; h! tT  T  "  * 

Fourier  transform  (FFT).  reduces  the  number  of  %5'  TH'S  algonthm’  known  as  the  fast 
of  to  *  log  iv„.  T„  taffSTJ!? some,hing  on  ,he  orf ' ■ 

multiplications  and  N0  -  1  complex  addition®  V  Eq'  (3  l04a)’  we  recluire  M)  complex 

%  -  1).  we  require  a  total  of“7eompt,  Z  i'J°  Values  -  «• ' . 

For  large  N0.  lhis  can  be  prohibitively  nme-consum!™"5  and,A'oW(l  “  1 1  comPlex  addlMon' 
TheFFTis,  thus,  a  lifesaver  in  signal  nrra-  ne,  even  for  a  very  high  speed  computer. 

if  wo  choose  Nit  to  be  a  power  of  2  althouchdik^  ICa,l°nS' The  FhT  alf°rithm  is  simplified 
FFT  can  be  found  an/book  on  stgZpZLtg.3  nW  D“ils  °f  lte 
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~ . l  uting  r  ourter  Transforms 

In  this  section  of  computer  exercises  let  ..c  •  i 

in  finding  the  Fourier  transform.  We  shall  usTm  il,ustratin8  the  use  of  DFT 

use  MATLAB  to  find  DFT  by  the  FFT  algorithm. 
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In  the  first  example,  the  signal  g(t )  =  e  u(t)  starts  at  /  =  0.  In  the  second  example,  we  use 
g(t)  =  n(r),  which  starts  at  t  =  -T. 


COMPUTER  EXAMPLE  C3.1 

Use  DFT  (implemented  by  the  FFT  algorithm)  to  compute  the  Fourier  transform  of  e~2'u(t).  Plot  the 
resulting  Fourier  spectra. 


We  first  determine  Ts  and  7b.  The  Fourier  transform  of  e~2,u(t)  is  l/(j2nf  +  2).  This 
low-pass  signal  is  not  band-limited.  Let  us  take  its  essential  bandwidth  to  be  that  frequency 
where  |G(/)|  becomes  1%  of  its  peak  value  at/  =  0.  Observe  that 


|G(/)|  = 


1 

y/  (2?r/)2  +  4 


1 


2  nf  »  2 


Also,  the  peak  of  |G(/)|  is  at/  =  0,  where  |G(0)|  =  0.5.  Hence,  the  essential  bandwidth 
B  is  at/  =  B.  where 


|G(/)|  %  — — —  =  0.5  x  0.01  =»  B  =  —  Hz 
2  TIB  71 

and  from  Eq.  (3.106b), 


Ts  <  —  =  0.005 jt  =  0.0157 
“  2  B 

Let  us  round  this  value  down  to  Ts  =  0.015625  second  so  that  we  have  64  samples  per 
second.  The  second  issue  is  to  determine  To.  The  signal  is  not  time-limited.  We  need  to 
truncate  it  at  To  such  that  g(To)  <$C  1.  We  shall  pick  To  =  4  (eight  time  constants  of 
the  signal),  which  yields  Ab  =  To/Ts  =  256.  This  is  a  power  of  2.  Note  that  there  is 
a  great  deal  of  flexibility  in  determining  Ts  and  7o,  depending  on  the  accuracy  desired 
and  the  computational  capacity  available.  We  could  just  as  well  have  picked  7o  =  8  and 
Ts  =  1  /32,  yielding  No  =  256,  although  this  would  have  given  a  slightly  higher  aliasing 
error. 

Because  the  signal  has  a  jump  discontinuity  at  t  =  0,  the  first  sample  (at  t  —  0)  is  0.5, 
the  averages  of  the  values  on  the  two  sides  of  the  discontinuity.  The  MATLAB  program, 
which  uses  the  FFT  algorithm  to  implement  the  DFT,  is  as  follows: 


Ts=l/64;  T0=4;  N0=T0/Ts; 
t=0 : Ts : Ts* (N0-1) ;t=t' ; 
g=Ts*exp ( -2*t) ; 
g ( 1 ) =Ts  *0.5; 

G=f ft (g) ; 

[ Gp , Gm ] =cart2pol (real (G) , imag (G) ) ; 
k=0:N0-l;  k=k' ; 
w=2*pi*k/T0 ; 

subplot (211) , stem (w (1:32) , Gm(l :32)  )  ; 
subplot (212 ) , stem (w( 1 :32),Gp(l:32)) 
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Figure  3.47 

Discrete  Fourier 
transform  of  an 
exponential 
signal  e_2/w(/). 
Notice  that  the 
horizontal  axis  in 
this  case  is  co  (in 
radians  per 
second). 


Because  Gq  is  A^-periodic,  Gq  =  G(q+256)  so  that  G256  =  Go.  Hence,  we  need  to  plot 
Gq  over  the  range  q  =  0  to  255  (not  256).  Moreover,  because  of  this  periodicity,  G-q  = 
G(— <7+256)’  and  the  Gq  over  the  range  of  q  =  -127  to  -1  are  identical  to  the  Gq  over 
the  range  of  q  =  129  to  255.  Thus,  G_i27  =  Gi29,  G_i26  =  G130,  . . . ,  G_i  =  G255. 
In  addition,  because  of  the  property  of  conjugate  symmetry  of  the  Fourier  transform, 
G-q  =  G*,  it  follows  that  Gl29  =  G*27,  G130  =  G*26,  . . . ,  G255  =  G*.  Thus,  the  plots 
beyond  q  =  jVo/2  (128  in  this  case)  are  not  necessary  for  real  signals  (because  they  are 
conjugates  of  Gq  for  q  =  0  to  128. 

The  plot  of  the  Fourier  spectra  in  Fig.  3.47  shows  the  samples  of  magnitude  and  phase 
of  G(/)  at  the  intervals  of  l/7b  =  0.25  Hz  or  a*  =  1.5708  rad/s.  In  Fig.  3.47,  we  have 
shown  only  the  first  28  points  (rather  than  all  128  points)  to  avoid  too  much  crowding  of 
the  data. 

In  this  example,  we  knew  G(/)  beforehand  and  hence  could  make  intelligent  choices 
tor  B  (or  the  sampling  frequency/,).  In  practice,  we  generally  do  not  know  G(/)  before- 
hand.  In  tact,  that  is  the  very  thing  we  are  trying  to  determine.  In  such  a  case,  we  must 
make  an  intelligent  guess  for  B  or/,  from  circumstantial  evidence.  We  should  then  con¬ 
tinue  reducing  the  value  of  Ts  and  recomputing  the  transform  until  the  result  stabilizes 
within  the  desired  number  of  significant  digits. 


Next,  we  compute  the  Fourier  transform  of  g(t)  = 


sn«). 
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by  ,te  ^  ale°rilhm*  “  -P-  -  ,  n,„.  «  * 
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This  rectangular  function  and  its  Fourier  transform  are  shown  in  Fig.  3.48a  and  b.  To 
determine  the  value  of  the  sampling  interval  Ts,  we  must  first  decide  on  the  essential 
bandwidth  B.  From  Fig.  3.48b,  we  see  that  G(f )  decays  rather  slowly  with  /.  Hence, 
the  essential  bandwidth  B  is  rather  large.  For  instance,  at  15.5  Hz  (97.39  rad/s),  G(f)  = 
—0.1643,  which  is  about  2%  of  the  peak  at  G( 0).  Hence,  the  essential  bandwidth  may  be 
taken  as  16  Hz.  However,  we  shall  deliberately  take  B  =  4  for  two  reasons:  (1)  to  show 
the  effect  of  aliasing  and  (2)  because  the  use  of  B  >  4  will  give  an  enormous  number 
of  samples,  which  cannot  be  conveniently  displayed  on  a  book-sized  page  without  losing 
sight  of  the  essentials.  Thus,  we  shall  intentionally  accept  approximation  for  the  sake  of 
clarifying  the  concepts  of  DFT  graphically. 

The  choice  of  B  =  4  results  in  the  sampling  interval  Ts  =  1  /2B  =  1  /8.  Looking 
again  at  the  spectrum  in  Fig.  3.48b,  we  see  that  the  choice  of  the  frequency  resolution 
fo  =  0.25  Hz  is  reasonable.  This  will  give  four  samples  in  each  lobe  of  G(f).  In  this  case 
To  =  l//o  =  4  seconds  and  N0  =  T0/Ts  =  32.  The  duration  of  g(t)  is  only  1  second.  We 
must  repeat  it  every  4  seconds  (7b  =  4),  as  shown  in  Fig.  3.48c,  and  take  samples  every 
eighth  of  a  second.  This  gives  us  32  samples  (No  =  32).  Also, 


gk  =  Tsg(kTs ) 

=  ^(*TJ 


Since  g(t)  =  8  n (/),  the  values  of  gk  are  1,  0,  or  0.5  (at  the  points  of  discontinuity), 
as  shown  in  Fig.  3.48c.  In  this  figure,  gk  is  shown  as  a  function  of  t  as  well  as  k,  for 
convenience. 

In  the  derivation  of  the  DFT,  we  assumed  that  g(t)  begins  at  t  =  0  (Fig.  3.46a),  and 
then  took  No  samples  over  the  interval  (0,  7o).  In  the  present  case,  however,  g(t)  begins  at 
—  j .  This  difficulty  is  easily  resolved  when  we  realize  that  the  DFT  found  by  this  procedure 
is  actually  the  DFT  of  gk  repeating  periodically  every  7o  seconds.  From  Fig.  3.48c,  it  is 
clear  that  repeating  the  segment  of  gk  over  the  interval  from  —2  to  2  seconds  periodically 
is  identical  to  repeating  the  segment  of  gk  over  the  interval  from  0  to  4  seconds.  Hence, 
the  DFT  of  the  samples  taken  from  —2  to  2  seconds  is  the  same  as  that  of  the  samples 
taken  from  0  to  4  seconds.  Therefore,  regardless  of  where  g(t)  starts,  we  can  always  take 
the  samples  of  g(t)  and  its  periodic  extension  over  the  interval  from  0  to  7o.  In  the  present 
example,  the  32  sample  values  are 


gk 


1  0  <  k  <  3  and  29  <  k  <  3 1 

0  5  <  k  <  27 

0.5  k  =  4, 28 


Observe  that  the  last  sample  is  at  t  =  31/8,  not  at  4,  because  the  signal  repetition  starts 
at  t  =  4,  and  the  sample  at  t  =  4  is  the  same  as  the  sample  at  t  =  0.  Now,  No  =  32  and 
£2o  =  2tt/32  =  n/\6.  Therefore  [see  Eq.  (3.104a)], 

!31 

Gq  = 

*=0 

The  MATLAB  program  that  uses  the  FFT  algorithm,  to  implement  this  DFT  equation 
is  given  next.  First  we  write  a  MATLAB  program  to  generate  32  samples  of  g and  then 
we  compute  the  DFT. 
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%  (c32.m) 

B=4 ;  f 0=1/4; 

Ts=l/ (2*B) ;  T0=1 / f 0 ; 
N0=T0/Ts; 


Figure  3.48 

Discrete  Fourier 
transform  of  a 
rectangular 


pulse. 


k=0 :N0 ;  k=k' ; 
for  m=l : length (k) 

if  k(m)>=0  &  k(m)<=3/  gk(m)=l;  end 
if  k(m)==4  Sc  k(m)==28  gk(m)=0.5;  end 
if  k(m)>  =  5  &  k(m)<=27/  gk(m)=0;  end 
if  k  (m)  >=29  &  k(m)<=31,  gk(m)=l;  end 

end 

gk=gk' ; 

Gr=f f t (gk) ; 

subplot (211) , stem(k,gk) 
subplot (212) , stem(k,Gr) 


Figure  3.48d  shows  the  plot  of  GCJ. 


The  samples  Gq  are  separated  by/0  =  \/T0  Hz.  In  this  case  To  =  4.  so  the  frequency 
resolution  fo  is  0.25  Hz,  as  desired.  The  folding  frequency  fs/2  =  B  =  4  Hz  corresponds 
to  q  =  N0/2  =  16.  Because  Gq  is  /V0-periodic  (N0  =  32),  the  values  of  Gq  for  q  =  - 16 

to  q  =  - 1  are  the  same  as  those  for  q  =  1 6  to  q  =  3 1 .  The  DFT  gives  us  the  samples  of 
the  spectrum  G(f). 

For  the  sake  of  comparison.  Fig.  3.48d  also  shows  the  shaded  curve  8  sine  (nf),  which 
is  the  Founertransform  of  8  n  (r).  The  values  of  G,  computed  from  the  DFT  equation  show 
aliasing  error,  which  is  clearly  seen  by  comparing  the  two  suDerimoosed  nlots  The  error 


any  value  q,  we  must  make  sure  that  N0  »  q.  This  observation 


negligible  aliasing  error  at 
is  valid  for  all  signals  with 


jump  discontinuities. 


-it-*; 


Filtering 


-4 
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Figure  3.48 

Discrete  Fourier 
transform  of  a 
rectangular 
pulse. 
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COMPUTER  EXAMPLE  C3.3 

The  signal  g(t )  in  Fig.  3.49a  is  passed  through  an  ideal  low-pass  filter  of  transfer  function  H  (/).  shown 
in  Fig.  3.49b.  Using  DFT,  find  the  filter  output. 


Figure  3.49 

Filtering  g(t) 
through  H(f). 


8 

gC 

(a) 

1 

H(f) 

(b) 

1.5 

0.5  t  -► 

2 

2  /( Hz)-*- 

(c) 


We  have  already  found  the  32-point  DFT  of  *(/)  (see  Fig.  3.48d).  Next  we  multiply  Gq  by 
Hq.  To  compute  Hq,  we  remember  that  in  computing  the  32-point  DFT  of  g(r),  we  have 
used/o  =  0.25.  Because  Gq  is  32-periodic,  Hq  must  also  be  32-periodic  with  samples 
separated  by  0.25  Hz.  This  means  that  Hq  must  be  repeated  every  8  Hz  or  1 6zr  rad/s  (see 
Fig.  3.49c).  This  gives  the  32  samples  of  Hq  over  0  </  <  8  as  follows: 


Hq  = 


1  0  <  q  <  7 

0  9  <  q  <  23 

0.5  9  =  8,24 


and 


25  <  q  <  31 


p[g  and  ,ate  the  inverse  DFT-  The  resulting  output  signal  is  shown  in 

S  ™  g'eesapnntoutofg, ,G „H,,  and  v,. 

T  DFr<Ce»»fs<')  in  Example  C3.2.  The  MATLAB 

■„  tfe  MATI  AR  ,  l  S'  SaVed  as  ln  C32  .m).  We  can  import  G, 

of  H  muhttk  r  h  rema  y  l  ?Tnand  c32-  N“-  generate  32-point  samples 
by  coCl^h  £.  DPf  “  »'  »*»  «"d  » 
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c32  ; 

q=  0:32;  q=q'; 
for  m=l : length (q) 

if  q (m)  >  =  0  &  q (m)  <=7 ,  Hq(m)=l;  end 
if  q(m)>=25  &  q(m)<=31,  Hq(m)=l;  end 
if  q  (m)  >  =  9  &  q(m)<=23/  Hq(m)=0;  end 
if  q (m) ==8  &  q(m)==24,  Hq(m)=0.5;  end 

end 

Hq=Hq' ; 

Yq=Gq. *Hq; 
yk=if f t (Yq) ; 
elf , stem (k, yk) 


TABLE  3.4 


No. 

gk 

Gq 

Hq 

GqHq 

yk 

0 

1 

8,000 

1 

8.000 

.9285 

1 

1 

7,179 

1 

7.179 

1.009 

2 

1 

5,027 

1 

5.027 

1.090 

3 

1 

2,331 

1 

2.331 

.9123 

4 

1 

0.000 

1 

0.000 

.4847 

5 

0.5 

-1.323 

1 

-1.323 

.08884 

6 

0 

-1.497 

1 

-1.497 

-.05698 

7 

0 

-.8616 

1 

-.8616 

-.01383 

8 

0 

0.000 

0.5 

0.000 

.02933 

9 

0 

.5803 

0 

0.000 

.004837 

10 

0 

.6682 

0 

0.000 

-.01966 

11 

0 

.3778 

0 

0.000 

-.002156 

12 

0 

0.000 

0 

0.000 

.01534 

13 

0 

-.2145 

0 

0.000 

.0009828 

14 

0 

-.1989 

0 

0.000 

-.01338 

15 

0 

-.06964 

0 

0.000 

-.0002876 

16 

0 

0.000 

0 

0.000 

.01280 

17 

0 

-.06964 

0 

0.000 

-.0002876 

18 

0 

-.1989 

0 

0.000 

-.01338 

19 

0 

-.2145 

0 

0.000 

.0009828 

20 

0 

0.000 

0 

0.000 

.01534 

21 

0 

.3778 

0 

0.000 

-.002156 

22 

0 

.6682 

0 

0.000 

-.01966 

23 

0 

.5803 

0 

0.000 

.004837 

24 

0 

0.000 

0.5 

0.000 

.03933 

25 

0 

-.8616 

1 

-.8616 

-.01383 

26 

0 

-1.497 

1 

-1.497 

-.05698 

27 

0 

-1.323 

1 

-1.323 

.08884 

28 

0.5 

0.000 

1 

0.000 

.4847 

29 

1 

2.331 

1 

2.331 

.9123 

30 

1 

5.027 

1 

5.027 

1.090 

31 

1 

7.179 

1 

7.179 

1.009 
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PROBLEMS 


3.1-1  Show  that  the  Fourier  transform  of  g(t)  may  be  expressed  as 

[°°  [00 
G(/)  =  /  g(t) COS  2n ft  dt  —  j  I  g(t)  sin  2 nft  dt 

J~°°  J- 00 

Hence,  show  that  if  g(t)  is  an  even  function  of/,  then 

Jr  oo 

'  g(t)  cos  2nft  dt 

0 

and  if  g(t)  is  an  odd  function  of/,  then 


G(f)  = 

Hence,  prove  that: 

If  gU )  is: 

a  real  and  even  function  of  / 
a  real  and  odd  function  of  / 
an  imaginary  and  even  function  of/ 
a  complex  and  even  function  of  / 
a  complex  and  odd  function  of  / 


00 

g(0  sin  2nft  dt 


Then  G( f  )  is: 

a  real  and  even  function  off 
an  imaginary  and  odd  function  off 
an  imaginary  and  even  function  off 
a  complex  and  even  function  off 
a  complex  and  odd  function  off 


3.1-2  (a)  Use  direct  integration  to  find  the  Fourier  transform  G(/)  of  signal 

S(0  =  exp(-2|/ -  3|) 

(W  Vse  din*,  imegmiod  ,o  find  ,1*  Fonrler  transfomi  o( 


3-1-3  II  g(t)  <=>  G(f),  then  show  that  g*(t)  «=>  G*(-f) 

3-1-4  From  definition  (3.9a),  find  Ihe  Fourier  nansfonm  of  the  signals  shown  in  Fig.  P3, 1-4. 
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Figure  P.3.1-4 


£(') 


4 

2 


1  2 


(a) 


3.1-5  From  definition  (3.9b),  find  the  inverse  Fourier  transforms  of  the  spectra  shown  in  Fig.  P3.1-5. 


Figure  P.3.1-5 


G{f) 


-2  -l 


(b) 


3.1-6  From  definition  (3.9b),  find  the  inverse  Fourier  transforms  of  the  spectra  shown  in  Fig.  P3. 1-6. 


3.1-7  Show  that  the  two  signals  in  Fig.  P3.  l-7a  and  b  are  totally  different  in  time  domain,  despite  their 
similarity. 


F'gure  P.3.1-7 
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Hint:  G(f)  =  \G(f)\e>W\  For  Fig.  P3.1-7a,  G(/)  =  1  •  e  [Yl  <  B.  whereas  for 

Fig.  P3. 1 -7b, 


1  e~M2  =  -j  0  </  <  B 
\ejn/2  _  j  0  >f>-B 


! 

I 


3.2-1  Sketch  the  following  functions: 


(a)  n (//3) ;  (b)  A (3a:/ 100) ;  (c)n(f  -  5/4);  (d)  sinc[(2;r/  -  10tt)/5];  (e)  sinc(7rf/5);  (f) 
sine  (tt//5)  n(//10) 

3.2- 2  Use  pairs  7  and  12  (Table  3.1)  to  show  that  u(t)  <=>•  0.5  8(f)  +  X/jlnf. 

Hint:  Add  1  to  sgn  (/),  and  see  what  signal  comes  out. 

3.2- 3  Show  that  sin  (2n f0t  +  0)  •<=>■  ^[8(f  +  fo)e~j0+J°-5n  +  8(f  -  fQ)ej0-j0-S”  ]. 

Hint:  Use  Euler’s  formula  to  express  sin  (Infot  +  0)  in  terms  of  exponentials. 

3.3- 1  Apply  the  duality  property  to  the  appropriate  pair  in  Table  3. 1  to  show  that: 

(a)  0.5[S(f)  +  (j/nt)\  <=>  u(f ) 

(b)  S(t  +  T)  +  8(t  —  T)  <=>  2  cos  2 jtJT 

(c)  S(f  +  7")  —  8(t  —  T)  •$=>  2 /  sin  IrtjT 


Hint:  g(—t)  <=>  G(— /)  and  8(t)  =  8(-t). 

3.3-2  The  Fourier  transform  of  the  triangular  pulse  g(t)  in  Fig.  P3.3-2a  is  given  as 


Use  this  information,  and  the  time-shifting  and  time-scaling  properties,  to  find  the  Fourier 
transforms  of  the  signals  shown  in  Fig.  P3.3-2b,  c,  d,  e,  and  f. 


Figure  P.3.3-2 


(f) 


Figure  P.3.3-3 


Figure  P.3.3-4 
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Hint: Time  inversion  ing(f)  results  in  the  pulse  g,  (/)  in  Fig.  P3.3-2b; consequently gl  (r)  =  g(-t). 
The  pulse  in  Fig.  P3.3-2c  can  be  expressed  as  g(t-  T)  +gl  (t  -  T)  [the  sum  of  g(t)  and  g,(r)  both 
e  ayed  by  T\.  The  pulses  in  Fig.  P3.3-2d  and  e  both  can  be  expressed  as  g(t  —  T)  +  g\(t  +  T) 
[the  sum  of  *(/)  delayed  by  T  and  g,  (t)  advanced  by  T]  for  some  suitable  choice  of  T.  The  pulse 
in  Fig.  P3.3-2f  can  be  obtained  by  time-expanding  g(t)  by  a  factor  of  2  and  then  delaying  the 
resulting  pulse  by  2  seconds  [or  by  first  delaying  g(t)  by  1  second  and  then  time-expanding  by  a 
factor  of  2],  *  3 


3.3-3  Using  only  the  time-shifting  property  and  Table  3.1,  find  the  Fourier  transforms  of  the  signals 
shown  in  Fig.  P3.3-3. 

Hint:  The  signal  in  Fig.  P3.3-3a  is  a  sum  of  two  shifted  rectangular  pulses.  The  signal  in 
Fig. P3.3-3bissin  t[u(t)-u(t-n)]  =  sin  /M(f)-sin  tu(t-n)  =  sin  tu(t)+s'm(t-ji)u(t—jr). 
The  reader  should  verify  that  the  addition  of  these  two  sinusoids  indeed  results  in  the  pulse 
in  Fig.  P3.3-3b.  In  the  same  way,  we  can  express  the  signal  in  Figs.  P3.3-3c  as  cos  t  u(t)  + 
sin  (r  -  n/2)u(t  -  n/2)  (verify  this  by  sketching  these  signals).  The  signal  in  Fig.  P3.3-3d  is 
e~a'lu(t )  -  u(t  -  7)]  =  e~a,u(t)  -  e~aT e~a(,~T)u(t  -  T). 


(b) 


(d) 


3.3-4  Use  the  time-shifting  property  to  show  that  if  g(t)  <=>  G(/),  then 


g(t  +  T)  +  g(t  -  T)  <=>  2G(f)  cos  2njT 


This  is  the  dual  of  Eq.  (3.37).  Use  this  result  and  pairs  17  and  19  in  Table  3. 1  to  find  the  Fourier 
transforms  of  the  signals  shown  in  Fig.  P3.3-4. 


Figore  P.3.3-4 


3.3-5  Prove  the  following  results: 


g(r)sin  2nf0t  <=>  G(f  -  f0 )  -  G(f  +/0)[ 
+  T)  -  g(t  -  T )]  <=>  G(f)  sin  2 njT 

2 1 


Using  the  latter  result  and  Table  3.1,  find  the  Fourier  transform  of  the  signal  in  Fig.  F3.3-5. 
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Figure  P.3.3-5 

1 

2  3  - 

4 

-4  -3  -2  0 

-1 

3.3-6  The  signals  in  Fig.  P3.3-6  are  modulated  signals  with  carrier  cos  lOf.  Find  the  Fourier  transforms 
of  these  signals  using  the  appropriate  properties  of  the  Fourier  transform  and  Table  3.1.  Sketch 
the  amplitude  and  phase  spectra  for  Fig.  P3.3-6a  and  b. 

Hint:  These  functions  can  be  expressed  in  the  form  g(t)  cos  2nfot. 


(a) 


(b) 


(c) 


3.3- 7  A  signal  g(r)  is  band-limited  to  B  Hz.  Show  that  the  signal  gn(t)  is  band-limited  to  nB  Hz. 

Him:  g-(t)  <!=>  [G(f)  *  Gif)],  and  so  on.  Use  the  width  property  of  convolution. 

3.3- 8  Find  the  Fourier  transform  of  the  signal  in  Fig.  P3.3-3a  by  three  different  methods: 

(a)  By  direct  integration  using  the  definition  (3.9a). 

(b)  Using  only  pair  17  in  Table  3.1  and  the  time-shifting  property. 

<C>  ^  ,ime‘different'ati0n  and  time--shifting  properties,  along  with  the  fact  that 


Hint:  I  -  cos  2x  =  2  sin2  *. 


ulation  Show  thatth^"8  '  *^from  the  m°dulated  signal  g(t)  cos  2 nfyt  is  called  demod 
Tu  T*.  **  ca"  by  multiplying  i,  with  2  co.  ** 

“IS  8  <  t  '  "8  '  l0W'P,S5  """  of  «  H*  lL  Ldwidlh  of  *«>] 


Hint:  2  cos2  2nfot  =  1  +  cos  4nfQt. 
at  2/0  and  will  be  suppressed  by  a  low 


Recognize  that  the  spectrum  of  g(t)  cos  4n fat  is  centered 
-pass  filter  of  bandwidth  B  Hz. 


and  output  signal y(f)  areHnelr.^16™1"6  Wh'Ch  °f  ^  followin§  systems  with  input  signal  x(t) 


(a)  y(t )  ~  ax(t)  +  b ,  where  constants  a,  b  ^  0 

(b)  y(t)  =  a  cos  *(/),  where  A  is  constant 


3.4-; 


Figure  P.3.4-2 


*■ 


%2 


3.4-3 


3.5-1 
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(c)  y(t)  —  c\x(t  —  t\ )  +  C2x(t  —  t2),  where  constants  c\ ,  C2  ^  0 

(d)  y(t)  =  eM'xft) 

(e)  v(0  =  A(cos  coQt)x(t) 

(f)  v(r)  =  A(sin  (OQt)x(t) 

(g)  >’(')  =  A  (cos  coot)x(r  -  t0)  -  £(sin  coot)x(t) 

(h)  y(t)  =  Ap(t  —  x)x(x)dx,  where  A  is  constant 

(i)  y(t)  =  R e{x(/)} 

( j)  =  Im{x(f)} 

3.4-2  Signals  £i(/)  =  1 04 n ( 1 04/ )  and  g2(t)  =  <5(0  are  applied  at  the  inputs  of  the  ideal  low-pass 
filters  H\  (/)  =  n(//20,000)  and  H2(f)  =  n(//10,000)  (Fig.  P3.4-2).  The  outputs yx(t)  and 
y2(t)  of  these  filters  are  multiplied  to  obtain  the  signal  y(t)  =  y\  (t)y2(t). 

(a)  Sketch  G\  (/)  and  G2(f). 

(b)  Sketch  H\(f)  and  H2(f). 

(c)  Sketch  Yx(f)  and  Y2(f). 

(d)  Find  the  bandwidths  of  y\ (t), y2(t),  andy(/). 


3.4-3  For  a  linear  system  with  transfer  function 


//(/)  = 


1 


1  +j2nf 

find  the  output  signal  y(f)  of  this  system  when  the  input  signal  is  given  by 


x(t)  =  1  +  28  (t  -  to)  -  cos  (coot)  +  y^A/g  a,(/  U)u(t  -  /,)  a\  >  0 

i=l 

3.5-1  For  systems  with  the  following  impulse  responses,  which  system  is  causal? 


(a)  h(t)  =  e  atu(t ),  a  >  0 

(b)  h(t)  =  e~a^,  a  >  0 

(c)  h(t)  =  e~a(t-to)u(t  -  t0),  a>0,t0>0 

(d)  h(t)  =  sine  (at),  a  >  0 

(e)  h(t)  =  sine  [a(t  -  fo)]<  a  >  0 
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3.5-2  Consider  a  filter  with  the  transfer  function 

_  e-{2xk\f\+jbifto) 

Use  the  time-domain  criterion  [noncausal  /i(f)]  and  the  frequency  domain  (Paley-Wiener)  cri¬ 
terion  to  show  that  this  filter  is  physically  unrealizable.  Can  this  filter  be  made  approximately 
realizable  by  choosing  a  sufficiently  large  /o?  Use  your  own  (reasonable)  criterion  of  approximate 
realizability  to  determine  fy- 


3.5-3  Show  that  a  filter  with  transfer  function 


//(/)  = 


2(p)  c-j27ifto 
(Inf)2  +  fi 


is  unrealizable.  Can  this  filter  be  made  approximately  realizable  by  choosing  a  sufficiently  large 
'0?  TJse  your  own  (reasonable)  criterion  of  approximate  realizability  to  determine  tQ. 


Hint:  Show  that  the  impulse  response  is  noncausal. 

3.5- 4  A  bandpass  signal  g(/)  of  bandwidth  B  =  2000  Hz  centered  at/  =  10^  Hz  is  passed  through  the 

RC  filter  in  Example  3. 1 6  (Fig.  3.28a)  with  RC  =  1 0“^ .  If  over  the  passband,  the  variation  of  less 
than  2c/c  in  amplitude  response  and  less  than  1%  in  time  delay  is  considered  to  be  distortionless 
transmission,  would  g(t)  be  transmitted  without  distortion?  Find  the  approximate  expression  for 
the  output  signal. 

3.6- 1  A  certain  channel  has  ideal  amplitude,  but  nonideal  phase  response  (Fig.  P3.6-1),  given  by 


\H(f)\  =  1 

&h(f)  =  -2nftQ  -  k  sin  InfT  k  <&  1 


(a)  Show  that  y(t),  the  channel  response  to  an  input  pulse  g(t)  band-limited  to  B  Hz,  is 

,v(/)  =  g(t  - 10)  +  t[g(,  — —  T)  —  g(t  -  t0  +  T)] 

Him:  Usee"'*  *  W  «  i -jk  sin  2nfT. 

(b)  Discuss  how  this  channel  will  affect  time-  and  frequency  division  multiplexing  systems  from 
the  viewpoint  of  interference  among  the  multiplexed  signals. 


Figure  P.3.6-2 


2  wr  Sh!™ri  ir!r«  t'y  n'ul''pJth 'ransmisilon  cari  be  partly  corrected  by  a  tapped  delay  eqoab 
rer.  bhow  that  ita  c  I ,  the  disronton  m  the  mulripalh  system  in  Frg.  3.37a  can  be  appdrxinralely 
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corrected  it  the  received  signal  in  Fig.  3.37a  is  passed  through  the  tapped  delay  equalizer  shown 
in  Fig.  P3.6-2. 

Hint.’  From  Eq.  (3.65a),  it  is  clear  that  the  equalizer  filter  transfer  function  should  be  //eq(/)  = 

1/(1  +  a  e  f  ).  Use  the  fact  that  1/(1  —  at)  =  1  4-  x  -p +  x^  +  •  •  •  if  x  <&  1  to  show  what 
the  tap  parameters  aj  should  be  to  make  the  resulting  transfer  function 


# (/)tfeq(/)  *  e-JWo 


Figure  P.3.6-2 


Input 


At  At 


At  At 


•  •  •  • 


Output 


3.6-3  The  input  .r(/)  and  the  output  y(t)  of  a  certain  nonlinear  channel  are  related  as 


y(/)  =  jc(/)  4-  0.001jc2(0 


(a)  Find  the  output  signal  y(t)  and  its  spectrum  Y(f)  if  the  input  signal  is  x (t)  = 

•^sinc  (1000/). 

(b)  Verify  that  the  bandwidth  of  the  output  signal  is  twice  that  of  the  input  signal.  This  is  the 
result  of  signal  squaring. 

(c)  Can  signal  r(f)  be  recovered  (without  distortion)  from  the  output  y(/)?  Explain. 

(d)  The  input  signal  x(t)  is  frequency-division-multiplexed  along  with  several  other  signals  on  this 
channel.  This  means  that  several  signals  that  occupy  disjoint  frequency  bands  are  transmitted 
simultaneously  on  the  channel.  Explain  what  effect  the  signal-squaring  property  of  the  channel 
would  have  on  the  recovery  of  the  individual  signals  at  the  output. 

(e)  If  a  TDM  signal  consisting  of  two  interleaved  pulse  trains  is  applied  at  the  input,  can  the  two 
trains  be  recovered  at  the  output  (i)  without  distortion  and  (ii)  without  interference? 

3.7-1  Use  ParsevaFs  theorem  to  solve  the  following  integral: 


3.7-2  Generalize  ParsevaFs  theorem  to  show  that  for  real,  Fourier  transformable  signals  g  j  (t)  and  g2(0* 


3.7-3  Show  that 
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Figure  P.3.7-5 


Hint:  Recognize  that 


jnfk/B 


sine  (litBt  -  kit)  =  sine  ^2jtB  ^  j  <=>  ^  n  (  ^  ) 

Use  this  fact  and  the  result  in  Prob.  3.7-2  to  show  that 

/oo  j  rB  ^ 

sine  (InBt  —  nut)  sine  (2 nBt  —  nn)dt  =  — «  I  ejU»-m)/2B]2nt 
-00  4 B-  J-B 

The  desired  result  follows  from  this  integral. 

3.7-4  For  the  signal 


g(0  = 


la 


determine  in  hertz  the  essential  bandwidth  B  of  g(t)  such  that  the  energy  contained  in  the  spectral 
components  of  g(t)  of  frequencies  below  B  Hz  is  99%  of  the  signal  energy  Eg . 

Hint:  Determine  G(f)  by  applying  the  duality  property  [Eq.  (3.26)]  to  pair  3  of  Table  3. 1 . 

3.7-5  A  low-pass  signal  g(t)  is  applied  to  a  squaring  device.  The  squarer  output  g  2  (t)  is  applied  to  a 
unity-gain  ideal  low-pass  filter  of  bandwidth  A/  Hz  (Fig.  P3.7-5).  Show  that  if  A/  is  very  small 
(A/  0).  the  filter  output  is  a  dc  signal  of  amplitude  2EgAf,  where  Eg  is  the  energy  of  g(t). 

Hint:  The  output  y(t)  is  a  dc  signal  because  its  spectrum  Y(  f)  is  concentrated  at  /  =  0  from 
-A/  to  A/  with  A/  ->  0  (impulse  at  the  origin).  If  g2(t)  <=>  A(f),  and  y(r)  <=>  Y(f ),  then 
Y (/)  %  (2A(0)A/]<S(/).  Now,  show  that  Eg  =  A(0). 


g(t)  ^ 

(  )2 

^  g2<0 

Low-pass 

y(6=  2£A/ 

filter 

Figure  P.3.8-2 


Figure  P.3.8-4 


3"  7  "  function  ot  g(t)  =  C  cos  (2 nf0t  +  B0)  is  given  by  Hg(t)  = 

(t  /2)  cos  and  the  corresponding  PSD  is  Sg(f)  =  (C2/4)[8(f  -  fn)  +  8(f  +  M- 
Hence,  show  that  for  a  signal  v(f)  given  by  7  [  7  j0  +  7  J0 


oo 


y(t)  ~  C0  +  53  C"  cos  (nlnfot  +  e„) 

n=\ 


the  autocorrelation  function  and  the  PSD  are  given  by 

,  00 

n>(0  =  c0‘  +  2  C"2  cos  «27T/0r 

71=1 

|  00 

Sy(f)  =  Co2 8(f)  +  -  £  C2[8(f  -  nf0)  +  8(f  +  nfo)] 

71=1 

//<m:  Show  that  ifg(,)  =  gl(,)+^(,)  then7l 

where  7e„.„,(r)  =  |iniT  n  /r,  rT/2  *  '  *' W  +  W  +  *BftW  +  Thai  W* 

00  ^ -T 1 2#i  4-  t) dt.  If gj (/)  and g2(t)  repreSent any  two 
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of  the  infinite  terms  in  y(r),  then  show  that  Hg]g2( r)  =  Hg2gl  (r)  =  0.  To  show  this,  use  the  fact 
that  the  area  under  any  sinusoid  over  a  very  large  time  interval  is  at  most  equal  to  the  area  of  the 
half-cycle  of  the  sinusoid. 

3.8-2  The  random  binary  signal  x(t )  shown  in  Fig.  P3.8-2  transmits  one  digit  every  T ^  seconds.  A  binary 
1  is  transmitted  by  a  pulse  p{t)  of  width  7^/2  and  amplitude  A;  a  binary  0  is  transmitted  by  no 
pulse.  The  digits  1  and  0  are  equally  likely  and  occur  randomly.  Determine  the  autocorrelation 
function  Hx(t)  and  the  PSD  Sx(f). 


Figure  P.3.8-2 


1 

0  0  1 

0  1 

1 

1 

0  1 

0 

*(o  r 

LJ 
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l 

. _ ^ 

3.8- 3  Find  the  mean  square  value  (or  power)  of  the  output  voltage  y(t)  of  the  RC  network  shown 

in  Fig.  3.28a  with  RC  =  2i r  if  the  input  voltage  PSD  Sx(f)  is  given  by  (a)  K\  (b)  I~I(7r/); 
(c)  [<$(/  -f  1)  +  8(f  —  1)].  In  each  case  calculate  the  power  (mean  square  value)  of  the  input 
signal  jc(r). 

3.8- 4  Find  the  mean  square  value  (or  power)  of  the  output  voltage  y(t)  of  the  system  shown  in  Fig.  P3.8-4 

if  the  input  voltage  PSD  Sx(f )  =  n  (j if).  Calculate  the  power  (mean  square  value)  of  the  input 
signal  .r(/). 


Figure  P.3.8-4 


3.8-5  Consider  a  linear  system  with  impulse  response  e~2,u(t).  The  linear  system  input  is 

g(t)  =  w(t)  -  cos  (67 rt  +  ^) 

in  which  w(t )  is  a  noise  signal  with  power  spectral  density  of 

*.</>  =  "  (j) 

(a)  Find  the  total  output  power  of  the  linear  system. 

(b)  Find  the  output  power  of  the  signal  component  due  to  the  sinusoidal  input. 

(c)  Find  the  output  power  of  the  noise  component. 

(d)  Determine  the  output  signal-to-noise  ratio  (SNR)  in  decibels. 


A  AMPLITUDE  MODULATIONS 
A  AND  DEMODULATIONS 


Modulation  often  refers  to  a  process  that  moves  the  message  signal  into  a  specific 
frequency  band  that  is  dictated  by  the  physical  channel  (e.g.,  voiceband  telephone 
modems).  Modulation  provides  a  number  of  advantages  mentioned  in  Chapter  1, 
including  ease  of  RF  transmission  and  frequency  division  multiplexing.  Modulations  can  be 
analog  or  digital.  Though  traditional  communication  systems  such  as  AM/FM  radios  and  NTSC 
television  signals  are  based  on  analog  modulations,  more  recent  systems  such  as  second-  and 
third-generation  cellular  phone  systems,  HDTV,  and  DSL  are  all  digital. 

In  this  chapter  and  the  next,  we  will  focus  on  the  classic  analog  modulations:  amplitude 
modulation  and  angle  modulation.  Before  we  begin  our  discussion  of  different  analog  modula¬ 
tions,  it  is  important  to  distinguish  between  communication  systems  that  do  not  use  modulation 
(baseband  communications)  and  systems  that  use  modulation  (carrier  communications). 


4.1  BASEBAND  VERSUS  CARRIER 
COMMUNICATIONS 

I  he  term  baseband  is  used  to  designate  the  frequency  band  of  the  original  message  signal 
from  the  source  or  the  input  transducer  (see  Fig.  1 .2).  In  telephony,  the  baseband  is  the  audio 
band  (band  of  voice  signals)  of  0  to  3.5  kHz.  In  NTSC  television,  the  video  baseband  is  the 
v.deo  band  occupying  0  to  4.3  MHz.  For  digital  data  or  pulse  code  modulation  that  uses  bipolar 
signaling  at  a  rate  of  Rh  pulses  per  second,  the  baseband  is  approximately  0  to  Rh  Hz. 


Baseband  Communications 

Bee^r«am,^,nh',,U‘KiCHti0n'  direcMy  tra„smi„ed  w[,houtany  modification. 

Because  most  baseband  signals  such  as  audio  and  video  contain  significant  low-frequency 

content,  they  cannot  be  eHecfively  tnrnsntuted  ove,  radio ,  wireless Tnks T„s£d.  dedicated 
for  tan  disiancl  c  pa,rs  of  “W  and  coaxial  cables  are  assigned  to  each 

z  1,; estl :rrrons  Because  '**««'  bands. 

leave  much  of  the  channel  «  *  3  COtnmon  chan"el.  Thus,  baseband  communications 

“r ins  serai  “  *■*  - 

most  of  the  available  bandwidth  throuoh  f  V  USCrS  t0  share  one  channel  by  u,ll,Z  ^ 

ueh  frequency  division  multiplexing  (FDM).  Long-haul 

1  7  8 
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communication  over  a  radio  link  also  requires  modulation  to  shift  the  signal  spectrum  to  higher 
frequencies  in  order  to  enable  efficient  power  radiation  by  means  of  antennas  of  reasonable 
dimensions.  Yet  another  use  of  modulation  is  to  exchange  transmission  bandwidth  for  better 
performance  against  interferences. 

Carrier  Modulations 

Communication  that  uses  modulation  to  shift  the  frequency  spectrum  of  a  signal  is  known  as 
carrier  communication.  In  terms  of  analog  modulation,  one  of  the  basic  parameters  (ampli¬ 
tude,  frequency,  or  phase)  of  a  sinusoidal  carrier  of  high  frequency/,.  Hz  (or  coc  =  2n fc  rad/s) 
is  varied  linearly  with  the  baseband  signal  m(t).  This  results  in  amplitude  modulation  (AM), 
frequency  modulation  (FM),  or  phase  modulation  (PM),  respectively.  Amplitude  modulation 
is  linear  while  the  latter  two  types  of  carrier  modulation  are  similar  and  nonlinear,  often  known 
collectively  as  angle  modulation. 

A  comment  about  pulse-modulated  signals  [pulse  amplitude  modulation  (PAM),  pulse 
width  modulation  (PWM),  pulse  position  modulation  (PPM),  pulse  code  modulation  (PCM), 
and  delta  modulation  (DM)]  is  in  order  here.  Despite  the  common  term  modulation ,  these 
signals  are  baseband  digital  signals.  “Modulation”  is  used  here  not  in  the  sense  of  frequency  or 
band  shifting.  Rather,  in  these  cases  it  is  in  fact  describing  digital  pulse-coding  schemes  used  to 
represent  the  original  analog  signals.  In  other  words,  the  analog  message  signal  is  modulating 
parameters  of  a  digital  pulse  train.  These  signals  can  still  modulate  a  carrier  in  order  to  shift 
their  spectra. 

Amplitude  Modulations  and  Angle  Modulations 

We  denote  as  m(t)  the  source  message  signal  that  is  to  be  transmitted  by  the  sender  to  its 
receivers  and  its  Fourier  transform  as  M  (f).  To  move  the  frequency  response  of  m(t)  to  a  new 
frequency  band  centered  at/c  Hz,  the  Fourier  transform  has  already  revealed  a  very  strong 
property,  the  frequency -shifting  property,  that  will  allow  us  to  achieve  this  goal.  In  other  words, 
all  we  need  to  do  is  to  multiply  m(t)  by  a  sinusoid  of  frequency  fc  such  that 

s\(t)  =  m(t)  cos  In fct 

This  immediately  achieves  the  basic  aim  of  modulation  by  moving  the  signal  frequency  content 
to  be  centered  at  ±fc  via 


Sitf)  =  ^M(f-fc)  +  ^Af(f+fc) 

This  simple  multiplication  is  in  fact  allowing  changes  in  the  amplitude  of  the  sinusoid  5i(r) 
to  be  proportional  to  the  message  signal.  This  method  is  indeed  a  very  valuable  modulation 
known  as  amplitude  modulation. 

More  broadly,  consider  a  sinusoidal  signal 

s(t)  =  A(t)  cos  [a)ct  +  0(/)] 

There  are  three  variables  in  a  sinusoid:  amplitude,  (instantaneous)  frequency,  and  phase.  Indeed, 
the  message  signal  can  be  used  to  modulate  any  one  of  these  three  parameters  to  allow  s(t)  to 
carry  the  information  from  the  transmitter  to  the  receiver: 

Amplitude  A(t)  is  proportional  to  m{t)  <=>  amplitude  modulation 
Frequency  is  proportional  to  m(t)  <<=>>  frequency  modulation 
Phase  </>(/)  is  proportional  to  m(t)  phase  modulation 
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These  processes  are  known,  respectively,  as  amplitude  modulation,  frequency  modulation, 
and  phase  modulation.  In  this  chapter,  we  describe  various  forms  of  amplitude  modulations  in 
practical  communication  systems.  Amplitude  modulations  are  linear  and  their  analysis  in  the 
time  and  frequency  domains  is  simpler.  In  Chapter  5,  we  will  separately  discuss  the  nonlinear 
angle  modulations. 

The  Interchangeable  Use  off  and  co 

In  Chapter  3,  we  noted  the  equivalence  of  frequency  response  denoted  by  frequency/  or  by 
angular  frequency  co.  Each  of  these  notations  has  advantages  and  disadvantages.  After  the 
examples  and  problems  of  Chapter  3,  readers  should  be  familiar  and  comfortable  with  the  use 
ot  either  notation.  Thus,  from  this  point  on,  we  use  the  two  different  notations  interchangeably. 
Our  choice  in  each  case  is  based  on  the  notational  or  graphical  simplicity. 


4.2  DOUBLE-SIDEBAND  AMPLITUDE  MODULATION 

Amplitude  modulation  is  characterized  by  an  information-bearing  carrier  amplitude  A  (t)  that  is 
a  linear  function  of  the  baseband  (message)  signal  m(t).  At  the  same  time,  the  carrier  frequency 
wr  and  the  phase  9C  remain  constant.  We  can  assume  9C  =  0  without  loss  of  generality.  If  the 
carrier  amplitude  A  is  made  directly  proportional  to  the  modulating  signal  m(t),  then  modulated 

riv  m'"*'  ,COS  4‘  *  we  saw  eaid*er  [Eq.  (3.37)],  this  type  of  modulation  simply 

s  lifts  the  spectrum  ot  m(t)  to  the  carrier  frequency  (Fig.  4.1a).  Thus,  if 


m(t)  «=»  M  (f) 


then 


2‘  v  ,JC, 


n5* by/c>  and  M(/ +^> is  shifted  to 

the  left  and  to  the  right  by/.  Note  also  mTf m ^tS.  the/pectrum  of  the  modulating  signal  to 
from  Fig.  4.1c  the  module  \  .\  ha  ^  the  bandwidth  of  m(t)  is  B  Hz,  then,  as  seen 

modulated  signal  spectrum  centered^tif  ,bandwidth  of  28  Hz-  We  als«  observe  that  the 

lies  outside  ±/„  known  as  the  upper  sideband  ( USB)  /  C°"S1StS  °f ' two  parts:  3  p0rti°n  tha' 
as  the  lower  sideband  (LSB).  We  can  ,i,„  ,  ’ 3nd  a  Portl°n  that  lies  inside  ±fc,  known 

M  If)  has  an  impulse  at  zero  frequency  the  ^  /?m  F'8'  4lc  that’  unless  the  message  signal 

discrete  component  of  the  carrier  frequency  T\  °  ^  S'8nal th'S  scheme  does  not  contain  3 
introduce  a  sinusoid  at  f  For  thic  .  .  n  °™er  wor(fs’  the  modulation  process  does  not 

(DSB-SC)  modulation.*  "  “  ralled  *****>m*.  supprtssed-carrte 

The  relationship  of  B  to  f  is  of  into  .  n 

overlap  of  the  modulated  spectra  centered  at’  f  4'C  Sh°WS  that^  -  B'  thus  avoiding 
message  spectra  overlap  and  the  informatin  tt  ^  '  .If^c  <  B'  then  the  two  coPies  ot 
makes  it  impossible  to  recover  m(t\  f™  °  °  *s  distorted  during  modulation,  which 
____  (  )  fr°m  the  modulated  signal  m(t)  cos  a*. 

*  The  term  suppressed  carrier  doe. 

ETJSZZt r  *T  HWm.  « the  cmler  « 

of  the  form  k  co  ■’U  ?S  i**  ±^1'  also  implies  that  the  nm  l  ^IS  'mP*‘es  that  the  spectrum  of  the  DSB-SC 

the  form  *  cos  2 nfc,  [assuming  tha,  ,,,„)  has  a  zeroZnean  value'l  g"a'  "'(',t0S  ^  does  "°l 3 


Figure  4.1 

Double¬ 
sideband, 
suppressed- 
carrier  (DSB-SC) 
modulation  and 
demodulation. 
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Figure  4.1 

Double¬ 
sideband, 
suppressed- 
carrier  (DSB-SC) 
modulation  and 
demodulation. 
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m(r)  cos  ay 
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(Modulated  signal) 


(a)  Modulator 


cos  cy 
(Carrier) 


(b) 


cos  cy 
(Carrier) 


Note  that  practical  factors  may  impose  additional  restrictions  on  fc.  In  broadcast  applica¬ 
tions,  for  instance,  a  transmit  antenna  can  radiate  only  a  narrow  band  without  distortion.  This 
means  that  to  avoid  distortion  caused  by  the  transmit  antenna,  we  must  have/,/#  »  1.  Thus 
broadcast  band  AM  radio,  with  B  =  5  kHz  and  the  band  of  550  to  1600  kHz  as  the  carrier 
frequency,  gives  a  ratio  of  fc/B  roughly  in  the  range  of  100  to  300. 
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Demodulation 

The  DSB-SC  modulation  translates  or  shifts  the  frequency  spectrum  to  the  left  and  to  the  right 
by  fc  (i-e.,  at  +fc  and  —fc),  as  seen  from  Eq.  (4.1).  To  recover  the  original  signal  m(t)  from  the 
modulated  signal,  it  is  necessary  to  retranslate  the  spectrum  to  its  original  position.  The  process 
of  recovering  the  signal  from  the  modulated  signal  (retranslating  the  spectrum  to  its  original 
position)  is  referred  to  as  demodulation.  Observe  that  if  the  modulated  signal  spectrum  in 
Fig.  4.1c  is  shifted  to  the  left  and  to  the  right  by/c  (and  multiplied  by  one-half),  we  obtain  the 
spectrum  shown  in  Fig.  4.  Id,  which  contains  the  desired  baseband  spectrum  plus  an  unwanted 
spectrum  at  ±2 \fc.  The  latter  can  be  suppressed  by  a  low-pass  filter.  Thus,  demodulation,  which 
is  almost  identical  to  modulation,  consists  of  multiplication  of  the  incoming  modulated  signal 
m(t)  cos  <oct  by  a  carrier  cos  coct  followed  by  a  low-pass  filter,  as  shown  in  Fig.  4. le.  We 
can  verify  this  conclusion  directly  in  the  time  domain  by  observing  that  the  signal  e(t)  in 
Fig.  4.1e  is 


e(t)  =  m{t)  cos2  o)ct 

1 

=  2  MO  +  ai(f)cos  2coct] 
Therefore,  the  Fourier  transform  of  the  signal  e(t)  is 


(4.2a) 


E  (f)  =  2M(f)  +  +  2/c)  +  M  (f  —  2 fc)]  (4.2b) 

Thisanal\sis  shows  that  the  signal  e(t)  consists  of  two  components  (1/2  )m(t)  and 
( \JT)m(t)  cos  2 coct,  w.th  their  nonoverlapping  spectra  as  shown  in  Fig.  4. Id.  The  spectrum 
c4  the  second  component,  be.ng  a  modulated  signal  with  carrier  frequency  2/„  is  centered  at 

comDoneT(V/2\SMTr^nt  “  ?PPrCSSed  by  the  'OW-Pass  filter  in  Fig-  4-le.  The  desired 
the  filter  unh-.rme  1  ’  i.-1"8  *  '°w'pass  sPectrum  (centered  at  /  =  0),  passes  through 

characteristics  is  inH  Tnlf  °UtpUt  (1/2)m(,)'  A  Possible  form  of  low-pass  filter 

vd- The  m,er  leads  to  a  dis- 

rid  of  the  inconvenient  fraction w2 Tthe  " *  Vu  ***  DSB*SC  si&mL  We  ca"  gC 

cos  <D,  t.  In  fact  later  nn  .  .  ii  r  le  outPut  by  using  a  carrier  2  cos  coct  instead  of 
conclusions.  •  we  s  a  o  ten  use  this  strategy,  which  does  not  affect  general 

ent  detection ,  where  we  use^carter^f' J  1S  called  synchronous  detection,  or  coher- 

carrier  used  for  modulation.  Thus,  for  dem^n?  ^  S3me  frequency  (and  Phase)  aS  thC 
the  receiver  in  frequency  and  nhi  .  u  U  Utlon’  we  need  to  generate  a  local  carrier  at 
modulator.  '  P  Se  coherence  (synchronism)  with  the  carrier  used  at  the 


Figure  4.2 

Example  of 

DSB-SC 

modulation. 


w(0  =  cos  <*nt  =  cos  2 nfmt 

and  LSB).  Verify  that  the  DSB-SC  moHniC!?lim'  *dent'fy  dle  upper  and  lower  sidebands  (USB 
in  Fig.  4. le.  ae  S18nal  can  be  demodulated  by  the  demodulator 
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The  case  in  this  example  is  referred  to  as  tone  modulation  because  the  modulating  signal  is  a 
pure  sinusoid,  or  tone,  cos  comt.  We  shall  work  this  problem  in  the  frequency  domain  as  well 
as  the  time  domain  to  clarify  the  basic  concepts  of  DSB-SC  modulation.  In  the  frequency 
domain  approach,  we  work  with  the  signal  spectra.  The  spectrum  of  the  baseband  signal 
m(t)  =  cos  (t)mt  is  given  by 


=  +  +/„)] 

=  7 t[8(cd  -  com)  +  8(co  +  com)] 

The  message  spectrum  consists  of  two  impulses  located  at  ±/m,  as  shown  in  Fig.  4.2a. 
The  DSB-SC  (modulated)  spectrum,  as  seen  from  Eq.  (4.1),  is  the  baseband  spectrum  in 
Fig.  4.2a  shifted  to  the  right  and  to  the  left  by  fc  (times  one-half),  as  shown  in  Fig.  4.2b. 
This  spectrum  consists  of  impulses  at  angular  frequencies  ±(fc  —  fm)  and  ±(fc  +  fm).  The 
spectrum  beyond/r  is  the  USB,  and  the  one  below/r  is  the  LSB.  Observe  that  the  DSB-SC 
spectrum  does  not  have  the  component  of  the  carrier  frequency  fc.  This  is  why  it  is  called 
suppressed  carrier . 

In  the  time-domain  approach,  we  work  directly  with  signals  in  the  time  domain.  For 
the  baseband  signal  m(t)  =  cos  comt ,  the  DSB-SC  signal  <PDSB_SC(0  is 

^DSB-SC  M  COS  (Oct 

=  cos  COmt  COS  COct 

=  ^ [cos  (coc  +  com)t  +  COS  ((Dc  -  (Om)t] 

This  shows  that  when  the  baseband  (message)  signal  is  a  single  sinusoid  of  frequency 
/,„,  the  modulated  signal  consists  of  two  sinusoids:  the  component  of  frequency  fc  4-  fm 


Figure  4.2  M(f) 
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modulation. 
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(the  USB)  and  the  component  of  frequency  fc  -fm  (the  LSB).  Figure  4.2b  shows  precisely 


Figure  4.3 


the  spectrum  of  <pDSB_sc(f )•  Thus,  each  component  of  frequency/,,,  in  the  modulating 
signal  turns  into  two  components  of  frequencies  fc  +  fm  and  fc  —  fm  in  the  modulated 
signal.  Note  the  fact  that  there  is  no  component  of  the  carrier  frequency  fc  on  the  right- 
hand  side  of  the  preceding  equation.  As  mentioned,  this  is  why  it  is  called  double-sideband 
suppressed-carrier  (DSB-SC)  modulation. 

We  now  verify  that  the  modulated  signal  <pDSB_sc(f)  —  cos  cos  coct ,  when  applied 
to  the  input  of  the  demodulator  in  Fig.  4.1e,  yields  the  output  proportional  to  the  desired 
baseband  signal  cos  c omt .  The  signal  e(t)  in  Fig.  4.1e  is  given  by 


e(t)  =  cos  Q)mt  cos2  6Dct 


=  -  cos  comt  (1  +  cos  2 a>ct) 


The  spectrum  of  the  term  cos  a>m/cos  2 a)ct  is  centered  at  2fc  and  will  be  suppressed  by 
the  low-pass  filter,  yielding  ^  cos  comt  as  the  output.  We  can  also  derive  this  result  in  the 


frequency  domain.  Demodulation  causes  the  spectrum  in  Fig.  4.2b  to  shift  left  and  right 
by  fc  (and  multiplies  by  one-halD.  This  results  in  the  spectrum  shown  in  Fig.  4.2c.  The 
low-pass  filter  suppresses  the  spectrum  centered  at  ±2/c,  yielding  the  spectrum  (/*). 


Modulators 

Modulation  can  be  achieved  in  several  ways.  We  shall  discuss  some  important  categories  of 
modulators. 


Multiplier  Modulators:  Here  modulation  is  achieved  directly  by  using  an  analog  mul- 
tiplier  whose  output  is  proportional  to  the  product  of  two  input  signals  m{t)  and  cos  <oct . 
Typically,  such  a  multiplier  is  obtained  from  a  variable-gain  amplifier  in  which  the  gain  param¬ 
eter  (such  as  the  ft  ot  a  transistor)  is  controlled  by  one  of  the  signals,  say,  m(t).  When  the  signal 
cos  wct  is  applied  at  the  input  of  this  amplifier,  the  output  is  DroDortional  to  n,(t\ m s  «u. 


power  serie s" °UtPUt  CharaCteristics  of  either  of  the  nonlinear  elements  be  approximated 


by  a  power  series: 


y(t)  =  ax(t)  +  bx2(t) 


(4.3) 
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Figure  4.3 

Nonlinear 

DSB-SC 

modulator. 


Substituting  the  two  inputs  jci(r)  =  cos  a>ct  +  m{t)  and  xi(t)  =  cos  a>ct  —  m(t)  in  this  equation 
yields 


z(t)  =  2 a  •  m(t)  +  4 b  •  m(t)  cos  a)ct 

The  spectrum  of  m(t)  is  centered  at  the  origin,  whereas  the  spectrum  of  m(t)  cos  coct  is  centered 
at  ±coc.  Consequently,  when  z(t)  is  passed  through  a  bandpass  filter  tuned  to  coc ,  the  signal 
am(t)  is  suppressed  and  the  desired  modulated  signal  4 bm(t)  cos  a)ct  can  pass  through  the 
system  without  distortion. 

In  this  circuit  there  are  two  inputs:  m(t)  and  cos  coct.  The  output  of  the  last  summer,  z(r), 
no  longer  contains  one  of  the  inputs,  the  carrier  signal  cos  coc t.  Consequently,  the  carrier  signal 
does  not  appear  at  the  input  of  the  final  bandpass  filter.  The  circuit  acts  as  a  balanced  bridge 
for  one  of  the  inputs  (the  carrier).  Circuits  that  have  this  characteristic  are  called  balanced 
circuits.  The  nonlinear  modulator  in  Fig.  4.3  is  an  example  of  a  class  of  modulators  known 
as  balanced  modulators.  This  circuit  is  balanced  with  respect  to  only  one  input  (the  carrier); 
the  other  input  m(t)  still  appears  at  the  final  bandpass  filter,  which  must  reject  it.  For  this 
reason,  it  is  called  a  single  balanced  modulator.  A  circuit  balanced  with  respect  to  both  inputs 
is  called  a  double  balanced  modulator ,  of  which  the  ring  modulator  (see  later:  Fig.  4.6)  is  an 
example. 

Switching  Modulators:  The  multiplication  operation  required  for  modulation  can  be 
replaced  by  a  simpler  switching  operation  if  we  realize  that  a  modulated  signal  can  be  obtained 
by  multiplying  m(t)  not  only  by  a  pure  sinusoid  but  by  any  periodic  signal  (p(t)  of  the  fun¬ 
damental  radian  frequency  coc.  Such  a  periodic  signal  can  be  expressed  by  a  trigonometric 
Fourier  series  as 


00 

m  =  £  C„  cos  (na)ct  +  6„)  (4.4a) 

n= 0 


Hence, 


00 

m(t)<t>(t)  =  ^  C„  m(t)  cos  ( ncoct  +  6„)  (4.4b) 

71=0 


This  shows  that  the  spectrum  of  the  product  is  the  spectrum  M(f)  shifted  to 

±f,  ±2j\  ±  nfc  If  this  signal  is  passed  through  a  bandpass  filter  of  bandwidth 

2 B  Hz  and  tuned  to/c,  we  get  the  desired  modulated  signal  c\m(t)  cos  (coct  +  G\ )  * 


The  phase  9\  is  not  important. 
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Figure  4.4 

Switching 
modulator  for 
DSB-SC. 


Figure  4.5 

(a)  Diode-bridge 
electronic  switch. 

(b)  Series-bridge 
diode  modulator. 

(c)  Shunt-bridge 
diode  modulator. 


The  square  pulse  train  w(t)  in  Fig.  4.4b  is  a  periodic  signal  whose  Fourier  series  was  found 
earlier  in  Example  2.8  [Eq.  (2.86)]  as 


w(f) 

The  signal  m(t)w(t)  is  given  by 


1  2  (  1  l 

2  +  7r  \C°S  Wct  ~  3  cos  •^co<7  +  -  cos  5io, 


(4.5) 


(4.6) 


m(,)W(0  ~  \mU)  +  \  [m(r)  cos  <*'  -  ^(0  cos  3 (Oct  +  !«(,)  cos  5(oct  -  .  •  •  J 

number  ol  nodulated ^ 'signals  ShTngular  fre  C°mP°nent  "'(0  but  also  of  an.  infini.te 

spectrum  of  m(t)w(t)  consists  of  multiple  copi«  cfthT  ^  5"c . Therefore,  te 

0.  ±/,.  ±3fc,  ±5fc,  . . .  (with  decreasing  relive  aP' 

For  modulation,  we  are  interested  in  extracting  the  ’  a  ^  T  F'8'  4  4c '  , 

only.  To  separate  this  component  from  the  rest  of  th^  he.modulated  component  m(t)  cos  (oct 

a  bandpass  filter  of  bandwidth  ~>B  Hz  (ot4ttR  <■  t/  r°W  ' WC  pass  tbe  s‘8na*  m(t)w(t)  through 
carrier angular  ,requency/ °  >  “ ,he  f"W  «  p™i'W,he 

not  centered  at  ±fc  to  yield  the  desired  modi.!  ,’J  W'  suppress  al1  the  spectral  components 

We  now  see  the  real  payoff  ohhis  method  hluir^r3  f^rrlnifOcos  (ocl  (Fig.  4.4dl. 
is  in  nrol/ly  ,  switching  operation.  It  invoice.  swit(.|!]’k“0n"f  a,slSnal  b>" «  Pulse  ln;'n 
and  can  be  accomplished  by  simple  switching  el  S  ***  S'8na  '"(t)  °n  and  off  Per,odlcally 
one  such  electronic  switch  the  dwTSS by  "<'>•  Figure  45a  sh0WS 
produce  the  switching  action.  Diodes  D,  Da  and  n  dnVen  by  3  sinusoid  4  cos  ^  “! 

cos  a)ct  is  of  a  polarity  that  will  make  terminal  matched  pairs-  When  the  slgna 

terminal  c  positive  with  respect  to  d,  all  the  diodes 
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(a)  Diode-bridge 
electronic  switch. 

(b)  Series-bridge 
diode  modulator. 

(c)  Shunt-bridge 
diode  modulator. 


Figure  4.5 
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conduct.  Because  diodes  D\  and  Dj  are  matched,  terminals  a  and  b  have  the  same  potential 
and  are  effectively  shorted.  During  the  next  half-cycle,  terminal  d  is  positive  with  respect  to 

c,  and  all  four  diodes  open,  thus  opening  terminals  a  and  b.  The  diode  bridge  in  Fig.  4.5a, 
therefore,  serves  as  a  desired  electronic  switch,  where  terminals  a  and  b  open  and  close  peri¬ 
odically  with  carrier  frequency  fc  when  a  sinusoid  A  cos  coct  is  applied  across  terminals  c  and 

d.  To  obtain  the  signal  m(t)w(t),  we  may  place  this  electronic  switch  (terminals  a  and  b)  in 
series  (Fig.  4.5b)  or  across  (in  parallel)  m(l),  as  shown  in  Fig.  4.5c.  These  modulators  are 
known  as  the  series  bridge  diode  modulator  and  the  shunt  bridge  diode  modulator,  respec¬ 
tively.  This  switching  on  and  off  of  m(t )  repeats  for  each  cycle  of  the  carrier,  resulting  in  the 
switched  signal  m(t)w(t),  which  when  bandpass-filtered,  yields  the  desired  modulated  signal 
(2/n)m(t)  cos  coct. 

Another  switching  modulator,  known  as  the  ring  modulator,  is  shown  in  Fig.  4.6a.  During 
the  positive  half-cycles  of  the  carrier,  diodes  D\  and  Dt,  conduct,  and  Di  and  D4  are  open. 
Hence,  terminal  a  is  connected  to  c,  and  terminal  b  is  connected  to  d.  During  the  negative 
half-cycles  of  the  carrier,  diodes  D\  and  £>3  are  open,  and  D2  and  D4  are  conducting,  thus 
connecting  terminal  a  to  d  and  terminal  b  to  c.  Hence,  the  output  is  proportional  to  m(t)  during 
the  positive  half-cycle  and  to  - m(t )  during  the  negative  half-cycle.  In  effect.  m(t)  is  multiplied 
by  a  square  pulse  train  wo(/),  as  shown  in  Fig.  4.6b.  The  Fourier  series  for  wo(f)  as  found  in 
Eq.  (2.86)  is 


(4.7a) 


and 


v,(f)  =  m(t)w0(t)  =  - 


n 


4 


|m(0 


The  signal  m(t)w0(t)  is  shown  in  Fig.  4.6d.  When  this  waveform  is  passed  through  a  bandpass 
filter  tuned  to  coc  (Fig.  4.6a),  the  filter  output  will  be  the  desired  signal  (4/n)m(t)  cos  coct. 
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Figure  4.6 

Ring  modulation. 


Figure  4.7 

Frequency  mixe 
or  converter. 


woU) 


(b) 


v,-  =  m(t)wo(t) 


A 


(d) 


In  this  circuit  there  are  two  inputs:  m(t)  and  cos  coct.  The  input  to  the  final  bandpass  filter 

does  not  contain  either  of  these  inputs.  Consequently,  this  circuit  is  an  example  of  a  double 
balanced  modulator. 


Example  4.2  Frequency  Mixer  or  Converter 

c  hall  analy ze  a  frequency  mixer,  or  frequency  converter,  used  to  change  the  carrier  angular 
frequency  of  n  modulaied  signal  m(,)cos  ,rom  mMla(mvJKy 


This  can  be  done  by  multiplying  ro(f)cos  coct  by  2  cos  w  ■  t  where  m  -  (Or+M  or 

*  ^  -  *ow7'„  ?8e:,r  - 


The  product  x(l)  is 


x(t)  =  2m(t )  cos  (oct  cos  wmixr 

=  tw(r)[cos  (Wc  -  comix)t  +  cos  (Wc  +  OJmix)t] 

It  we  select  a>mjx  =  <wc  —  co/.  then 


*(')  =  m(t)[  cos 


w''  +  cos(2wc-W/)f] 
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Figure  4.7 

Frequency  mixer 
or  converter. 
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If  we  select  com\x  =  coc  -f  coi ,  then 


jc(r)  =  m(t)[ cos  coit  +  cos  (2 coc  +  oo/)t] 


In  either  case,  as  long  as  coc  —  a>i  >  2ttB  and  coj  >  2i r£,  the  various  spectra  in  Fig.  4.7b 
will  not  overlap.  Consequently,  a  bandpass  filter  at  the  output,  tuned  to  cu/,  will  pass  the 
term  m(t)  cos  coit  and  suppress  the  other  term,  yielding  the  output  m(t)  cos  co/t.  Thus, 
the  carrier  frequency  has  been  translated  to  coi  from  coc. 

The  operation  of  frequency  mixing/conversion  (also  known  as  heterodyning)  is  basi¬ 
cally  a  shifting  of  spectra  by  an  additional  (Omi*.  This  is  equivalent  to  the  operation  of 
modulation  with  a  modulating  carrier  frequency  (the  mixer  oscillator  frequency  a>mjx) 
that  differs  from  the  incoming  carrier  frequency  by  a>i.  Any  one  of  the  modulators 
discussed  earlier  can  be  used  for  frequency  mixing.  When  we  select  the  local  carrier 
frequency  com\x  =  coc  +  &>/,  the  operation  is  called  super-heterodyning,  and  when  we 
select  &>mix  =  (oc  —  coi,  the  operation  is  sub-heterodyning. 


Demodulation  of  DSB-SC  Signals 

As  discussed  earlier,  demodulation  of  a  DSB-SC  signal  essentially  involves  a  multiplication 
with  the  carrier  signal  and  is  identical  to  modulation  (see  Fig.  4. 1 ).  At  the  receiver,  we  multiply 
the  incoming  signal  by  a  local  carrier  of  frequency  and  phase  in  synchronism  with  the  incoming 
carrier.  The  product  is  then  passed  through  a  low-pass  filter.  The  only  difference  between  the 
modulator  and  the  demodulator  lies  in  the  input  signal  and  the  output  filter.  In  the  modulator, 
message  m(t)  is  the  input  and  the  multiplier  output  is  passed  through  a  bandpass  filter  tuned 
to  o)c ,  whereas  in  the  demodulator,  the  DSB-SC  signal  is  the  input  and  the  multiplier  output 
is  passed  through  a  low-pass  filter.  Therefore,  all  the  modulators  discussed  earlier  without 
multipliers  can  also  be  used  as  demodulators,  provided  the  bandpass  filters  at  the  output  are 
replaced  by  low-pass  filters  of  bandwidth  B. 

For  demodulation,  the  receiver  must  generate  a  carrier  that  is  synchronous  in  phase 
and  in  frequency  with  the  incoming  carrier.  These  demodulators  are  synonymously  called 
synchronous  or  coherent  (also  homodyne)  demodulators. 
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Example  4.3  Analyze  the  switching  demodulator  that  uses  the  electronic  switch  (diode  bridge)  in  Fig.  4.5 
as  a  switch  (either  in  series  or  in  parallel). 


The  input  signal  is  m(t)  cos  a)ct.  The  carrier  causes  the  periodic  switching  on  and  off  of  the 
input  signal.  Therefore,  the  output  is  m(t)  cos  coct  x  w(f).  Using  the  identity  cos  *  cos  v  = 
0.5[cos  (jc  +  y)  +  cos  (x  -  y)],  we  obtain 


m(t)  cos  coct  x  w(t)  =  m(t)  cos  coct 


COS  CDct  —  -  COS  3 (Dct  + 


)] 


=  -m(t)  cosz  (oct  +  terms  of  the  form  m(t)  cos  ruoct 

TC 

1  1 

=  —m(t)  H — m(t)  cos  2 coct  -f  terms  of  the  form  m(t)  cos  na)ct 

7T  71 


Spectra  of  the  terms  of  the  form  m(t)  cos  ruoct  are  centered  at  ±ncoc  rad/s  and  are  filtered 
out  by  the  low-pass  filter  yielding  the  output  (\/jz)m(t).  It  is  left  as  an  exercise  for  the 
reader  to  show  that  the  output  of  the  ring  demodulator  in  Fig.  4.6a  (with  the  low-pass  filter 
at  the  output)  is  (2/n)m(t)  (twice  that  of  the  switching  demodulator  in  this  example). 


4.3  AMPLITUDE  MODULATION  (AM) 

In  the  last  section,  we  began  our  discussion  of  amplitude  modulation  by  introducing  DSB- 
SC  amplitude  modulation  because  it  is  easy  to  understand  and  to  analyze  in  both  time  and 
frequency  domains.  However,  analytical  simplicity  is  not  always  accompanied  by  an  equiv¬ 
alent  simplicity  in  practical  implementation.  The  (coherent)  demodulation  of  DSB-SC  signal 
requires  the  receiver  to  possess  a  carrier  signal  that  is  synchronized  with  the  incoming  carrier. 
This  requirement  is  not  easy  to  achieve  in  practice.  Because  the  modulated  signal  may  have 
trave  ed  hundreds  ol  miles  and  could  even  suffer  from  some  unknown  frequency  shift,  the 
bandpass  received  signal  in  fact  has  the  form  of 

r(t)  =  Acm(i  -  t0) cos  [(a>f  +  Aoi)(/  -  r0)]  =  Acm(l  -  t0)  cos  [(<uc  +  A a>)t  -  6d)] 
in  which  A co  represents  the  Doppler  effect  while 


8d  =  {coc  +  A  co)td 

sophisticated^nouffh'tn11  '°  T°  Ut'lize  the  coherent  demodulator,  the  receiver  must  be 

Sved  Ml|  rtn  S“hen',i‘"a  l<K'al  0SCilla'°r  +  A'»>'  -  fell  PU^y  6°™ 

This  cost  should  be  panlculri7™ldTin  broader '°  'mpl""en' m6  could  tl“i,e  COS'ly' 
for  every  transmitter.  ‘  broadcasting  systems,  which  have  many  receivers 

[along  w^Ae Im^ulat^^^n^j^^^j ^ tI?nSm*tter to send  a carrier ^  cos 
at  the  receiver.  In  this  case  the  tnn<m  „  ^ L  th  ‘ there  1S  no  need  to  generate  a  earner 

which  increases  its  cost  as  a  trade-off  In^  l°  transmit  at  a  much  h‘gher  power  level, 
transmitter  for  every  receiver  substmf  i  P°lnt'|°'Point  communications,  where  there  is  one 
D  receiver,  substantial  complexity  in  the  receiver  system  can  be  justified 
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it  the  cost  is  ottset  by  a  less  expensive  transmitter.  On  the  other  hand,  for  a  broadcast  system 
with  a  huge  number  of  receivers  for  each  transmitter,  where  any  cost  saving  at  the  receiver 
is  multiplied  by  the  number  of  receiver  units,  it  is  more  economical  to  have  one  expensive 
high-power  transmitter  and  simpler,  less  expensive  receivers.  For  this  reason,  broadcasting 
systems  tend  to  favor  the  trade-off  by  migrating  cost  from  the  (many)  receivers  to  the  (fewer) 
transmitters. 

The  second  option  of  transmitting  a  carrier  along  with  the  modulated  signal  is  the  obvious 
choice  in  broadcasting  because  of  its  desirable  trade-offs.  This  leads  to  the  so-called  AM 
(amplitude  modulation),  in  which  the  transmitted  signal  <pAM  (r)  is  given  by 

<Pam  (0  =  A  cos  Met  +  m(t)  cos  wct  (4.8a) 

=  [A  +  m(t)]  cos  o)ct  (4.8b) 

The  spectrum  of  <pAM(f)  is  basically  the  same  as  that  of  <pDSB-sC(0  =  m(t)  cos  coct  except  for 
the  two  additional  impulses  at  ±fC9 

PamM  <=>  \[M(f+fc)  +  M(f-fc)]  +  ^[8(f+fc)  +  8tf-fcn  (4.8c) 

Upon  comparing  cpAM(t)  with  <PDSB.SCW  =  m(t)  cos  coct,  it  is  clear  that  the  AM  signal  is 
identical  to  the  DSB-SC  signal  with  A  +  m(t)  as  the  modulating  signal  [instead  of  m(t)].  The 
value  of  A  is  always  chosen  to  be  positive.  Therefore,  to  sketch  (pAM  (/),  we  sketch  the  envelope 
I A  +  m(t)  |  and  its  mirror  image  —\A  +  m(t)\  and  fill  in  between  with  the  sinusoid  of  the  carrier 
frequency  fc.  The  size  of  A  affects  the  time  domain  envelope  of  the  modulated  signal. 

Two  cases  are  considered  in  Fig.  4.8.  In  the  first  case,  A  is  large  enough  to  ensure  that 
A  4-  m(t)  >  0  is  always  nonnegative.  In  the  second  case,  A  is  not  large  enough  to  satisfy 
this  condition.  In  the  first  case,  the  envelope  has  the  same  shape  as  m(t)  (although  riding 
on  a  direct  current  of  magnitude  A).  In  the  second  case,  the  envelope  shape  differs  from  the 
shape  of  m(t)  because  the  negative  part  of  A  4  m(t)  is  rectified.  This  means  we  can  detect 
the  desired  signal  m(t)  by  detecting  the  envelope  in  the  first  case  when  A  4-  m{t)  >  0.  Such 
detection  is  not  possible  in  the  second  case.  We  shall  see  that  envelope  detection  is  an  extremely 
simple  and  inexpensive  operation,  which  does  not  require  generation  of  a  local  carrier  for  the 
demodulation.  But  we  have  shown  that  the  envelope  of  AM  has  the  information  about  m(t) 
only  if  the  AM  signal  [A  4-  m(t)]  cos  coct  satisfies  the  condition  A  4-  m(t)  >  0  for  all  t. 

Let  us  now  be  more  precise  about  the  definition  of  “envelope/'  Consider  a  signal 
E(t)  cos  coct.  If  E(t )  varies  slowly  in  comparison  with  the  sinusoidal  carrier  cos  a>ct ,  then  the 
envelope  of  E(t)  cos  coct  is  \E(t)\.  This  means  [see  Eq.  (4.8b)]  that  if  and  only  ifA4-m(/)  >  0 
for  all  /,  the  envelope  of  <pAM  (t)  is 


| A  +  m(t)  |  =  A  4-  m(t) 

In  other  words,  for  envelope  detection  to  properly  detect  m(t ),  two  conditions  must  be  met: 

(a)  fc  bandwidth  of  m(t) 

(b)  A  4-  m(t)  >  0 

This  conclusion  is  readily  verified  from  Fig.  4.8d  and  e.  In  Fig.  4.8d,  where  A  4-  m(t)  >  0, 
A  4-  m(t)  is  indeed  the  envelope,  and  m(t)  can  be  recovered  from  this  envelope.  In  Fig.  4.8e, 
where  A  4-  m(t )  is  not  always  positive,  the  envelope  | A  4-  m(t)\  is  rectified  from  A  +  m(t ), 
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Figure  4.8 

AM  signal  and 
its  envelope. 


m(t) 


and  m{t)  cannot  be  recovered  from  the  envelope.  Consequently,  demodulation  of  <pm(t)  in 

F'g-  4^,amou"ts  t0  s,mP'e  envel°pe  detection.  Thus,  the  condition  for  envelope  detection 
of  an  AM  signal  is 


to  nl  I  inv  •  °f  J  h  Ar  °  Ulready  SatisfieS  condition  (4-9a).  In  this  case  there  is  no  need 

DSB  SC siena , T!T»  HKV  pe  °f  thC  DSB'SC  Signal  cos  ”ct  is  m(t),  and  such  a 

that  m(t)  >  0  for  -dl  'r  ,h  'V  enve,ope  detect‘on- In  the  following  discussion  we  assume 

that  m(t)  t  0  for  all  r;  that  ,s. «(, )  can  be  negative  over  some  range  of ,. 

value!  r )^res^c /i ve I y '(fig  This^ne -  **  ^  maX'mum  and  the  minimum 

envelope  detection  (4.9a)  is  equivalent  to  ^  **  ^  “  ~mp'  Hence’ the  cond,tlon  0 


Thus,  the  minimum  carrier  amplitude  reauired  for  •  u-,- 

This  is  ’“itt  de“  Fis-  4.8.  We  deline  die  m<xlula,i  7**°  "  “ 


Example 


(4.10a) 
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For  envelope  detection  to  be  distortionless,  the  condition  is  A  >  mp.  Hence,  it  follows 
that 


0</x  <  1  (4.10b) 

is  the  required  condition  for  the  distortionless  demodulation  of  AM  by  an  envelope 
detector. 

When  A  <  mp ,  Eq.  (4.10a)  shows  that  /x  >  1  (overmodulation).  In  this  case,  the  option  of 
envelope  detection  is  no  longer  viable.  We  then  need  to  use  synchronous  demodulation.  Note 
that  synchronous  demodulation  can  be  used  for  any  value  of  jx,  since  the  demodulator  will 
recover  signal  A  +  m(t).  Only  an  additional  dc  blocker  is  needed  to  remove  the  dc  voltage  A. 
The  envelope  detector,  which  is  considerably  simpler  and  less  expensive  than  the  synchronous 
detector,  can  be  used  only  for  /x  <  1 . 


Message  Signals  m(t)  with  Nonzero  Offset:  On  rare  occasions,  the  message  signal 
m{t)  will  have  a  nonzero  offset  such  that  its  maximum  rnmax  and  its  minimum  wmjn  are  not 
symmetric,  that  is. 


^min  7^  ^max 

In  such  a  case,  it  can  be  recognized  that  any  offset  to  the  envelope  does  not  change  the  shape 
of  the  envelope  detector  output.  In  fact,  one  should  note  that  constant  offset  does  not  carry  any 
fresh  information.  In  this  case,  envelope  detection  would  still  remain  distortionless  if 

0  <  /x  <  1  (4.11a) 

with  a  modified  modulation  index  definition  of 


M  = 


Wmax  ^min 

2 A  +  mmax  +  rnmm 


(4.11b) 


Xample  4.4  Sketch  (pAM(t)  for  modulation  indices  of  /x  =  0.5  and  /x  =  1,  when  m(t)  =  Z?cos  c omt . 

This  case  is  referred  to  as  tone  modulation  because  the  modulating  signal  is  a  pure  sinusoid 
(or  tone). 

In  this  case,  mmax  =  b  and  rnmin  =  —b.  Hence  the  modulation  index  according  to 
Eq.  (4.10a)  is 


l* 1  = 


b 

A 


Hence,  b  =  fiA  and 


m(t)  =  b  cos  a)mt  =  /xA  cos  comt 
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Therefore, 


(PamU)  =  [A  +m(t)] cos  coct  =A[  1  +//cos  o>w/]cos  a)ct 
Figure  4.9  shows  the  modulated  signals  corresponding  to  \x  =  0.5  and  p  =  1 ,  respectively. 


Figure  4.9 

Tone-modulated 
AM:  (a)  /<  «  0.5; 

(b)  M  =  1  • 


Sideband  and  Carrier  Power 

The  advantage  of  envelope  detection  in  AM  has  its  price.  In  AM,  the  carrier  term  does  not 
carry  any  information,  and  hence,  the  carrier  power  is  wasteful  from  this  point  of  view, 


V’am  (f)  —  A  cos  coct  +  m(t)  cos  a>ct 
earner  sidebands 


The  carrier  power  Pc  is  the  mean  square  value  ofAcos  coct,  which  is  A2/ 2.  The  sideband  power 
Ps  is  the  power  ot  m(t)  cos  a>ct,  which  is  0.5  mz(t)  [see  Eq.  (3.94)].  Hence, 


*  -  2  '' 

u^l  for  convenient resides  in  the  sid«*and  power,  whereas  the  carrier  power  is 

(wasted)  oowel  and  th"  h  T  and  demod“lation.  The  total  power  is  the  sum  of  the  carrier 
(wasted)  power  and  the  s.deband  (useful)  power.  Hence,  the  power  efficiency,  is 


useful  power  p 
jj  - - ~~ - s 

total  power  Pc  -f  ps 
For  the  special  case  of  tone  modulation. 


m2{t) 


A2  +  m2(t) 


100% 


m(0  =  UA  cos  comt 


m2(t)  = 


(PA)2 


Hence 


and 


2 
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with  the  condition  that  0  <  fi  <  1.  It  can  be  seen  that  r)  increases  monotonically  with  n,  and 
n max  occurs  at  /n  =  1,  for  which 


*?max  —  33% 


Thus,  tor  tone  modulation,  under  the  best  conditions  (/i  =  1 ),  only  one-third  of  the  transmitted 
power  is  used  for  carrying  messages.  For  practical  signals,  the  efficiency  is  even  worse — on  the 
order  of  25%  or  lower—  in  comparison  to  the  DSB-SC  case.  The  best  condition  implies  n  =  1 . 
Smaller  values  of  fi  degrade  efficiency  further.  For  this  reason,  volume  compression  and  peak 
limiting  are  commonly  used  in  AM  to  ensure  that  full  modulation  (/it  =  1)  is  maintained  most 
of  the  time. 


Example  4.5  Determine  r/  and  the  percentage  of  the  total  power  carried  by  the  sidebands  of  the  AM  wave 
for  tone  modulation  when  (a)  fi  =  0.5  and  (b)  /r  =  0.3. 

For  n  =  0.5, 


u2  (0  5)2 

r]  =  -4— ,100%  =  '  '  ,  100%  =  11.11% 

2  +  H2  2  +  (0.5)2 

Hence,  only  about  1 1%  of  the  total  power  is  in  the  sidebands.  For  fi  =  0.3, 

(0.3)2 


*1  = 


-100%  =  4.3% 


2  +  (0.3)2 

Hence,  only  4.3%  of  the  total  power  is  the  useful  information  power  (in  sidebands). 


Generation  of  AM  Signals 

In  principle,  the  generation  of  AM  signals  is  identical  to  any  DSB-SC  modulation  discussed 
in  Sec.  4.2  except  that  an  additional  carrier  component  A  cos  coct  needs  to  be  added  to  the 
DSB-SC  signal. 


Demodulation  of  AM  Signals 

Like  DSB-SC  signals,  the  AM  signal  can  be  demodulated  coherently  by  a  locally  generated 
carrier.  Coherent,  or  synchronous,  demodulation  of  AM  defeats  the  purpose  of  AM,  however, 
because  it  does  not  take  advantage  of  the  additional  carrier  component  A  cos  coct.  As  we  have 
seen  earlier,  in  the  case  of  fi  <  1,  the  envelope  of  AM  signal  follows  the  message  signal  m(t). 
Hence,  we  shall  consider  two  noncoherent  methods  of  AM  demodulation  under  the  condition 
of  0  <  /i  <  1:  (1)  rectifier  detection  and  (2)  envelope  detection. 


Rectifier:  If  an  AM  signal  is  applied  to  a  diode  and  a  resistor  circuit  (Fig.  4.10),  the 
negative  part  of  the  AM  wave  will  be  removed.  The  output  across  the  resistor  is  a  half-wave 
rectified  version  of  the  AM  signal.  Visually,  the  diode  acts  like  a  pair  of  scissors  by  cutting  off 
any  negative  half-cycle  of  the  modulated  sinusoid.  In  essence,  at  the  rectifier  output,  the  AM 


1 96  AMPLITUDE  MODULATIONS  AND  DEMODULATIONS 


Figure  4.10 

Rectifier  detector 
for  AM. 


signal  is  multiplied  by  w(t).  Hence,  the  half-wave  rectified  output  vR(t)  is 
vrU)  =  {(A  +  m(r)]cos  ooct )  w(r) 

—  |A  +  nid)  |  cos  oj,  t  —  +  —  ^cos  a>ct  —  -  cos  3 a)ct  +  -  cos  5 coct  — 

I 

-  -[A  +  m(t)]  +  other  terms  of  higher  frequencies 


)] 


(4.12) 

(4.13) 

(4.14) 


When  i 'r(i)  is  applied  to  a  low-pass  filter  of  cutoff  B  Hz,  the  output  is  [A  +  m(t)]/n,  and  all  the 
other  terms  of  frequencies  higher  than  B  Hz  are  suppressed.  The  dc  term  A  In  may  be  blocked 

by  a  capacitor  (Fig.  4.10)  to  give  the  desired  output  m(t)/n.  The  output  can  be  doubled  by 
using  a  full-wave  rectifier. 

It  is  interesting  to  note  that  because  of  the  multiplication  with  w(r),  rectifier  detection  is  in 
cttect  synchronous  detection  performed  without  using  a  local  carrier.  The  high  carrier  content 
ensures  t  at  its  zero  crossings  are  periodic  and  the  information  about  frequency  and 
phase  of  the  carrier  at  the  transmitter  is  built  in  to  the  AM  signal  itself. 

enveUi^oTihp061^11!^  i'"  ^  .Cnve,ope  detector’  the  output  of  the  detector  follows  the 

envd wdete  L  nUH  ^  ^  Sh°Wn  in  ^  411a  funCti°nS  ** 

ZcZ^TclZe^  P”S1,IVe  CyCk  °f  theinpUt  Signa1’ the  input  8rows  and  may  exceed 
to  char«e  ud  to  the  ne  w  ^  ^apacity  tummg  on  the  diode  and  allowing  the  capacitor  C 
£ k X it ft T  °K  input  Signal  cycle-  As  ‘he  input  signal  falls  below  this 
^~dl  *  t‘k  CapaCit°r  V°ltage  (which  is  very  nearly  the  peak  voltage), 
rate  (with  tuime  constant  /fCt  CapaC'tor  now  discharges  through  the  resistor  if  at  a  slow 
input  signal  rises  above  the  capacToTvtltagTthe'diod'  ^  ^  d'ama  repCatS' A"  ^ 
peri^S  t°  thC  P^  va.ue  °f  this  (new)  cycle.  TTie  capacitor  d^sdi^es^ov^y^urir^theortoff 

signal  and  then  de^ys^ow^llntil'the^xtposiUve^65  7  ‘°  **  Peak  V°ltage  °*  ^ 
output  voltage  vc(r),  closely  follows  , he  r  1  Cyde’  as  shown  in  Fig.  4. 1  lb.  Thus,  the 

important,  the  slow  capacity  discharge  via  tlT"18  •  e'1Velope  ot  the  inPut  AM  s>gnal-  Eclually 
pauty  discharge  via  the  resistor  R  allows  the  capacity  voltage  to  follow 


Figure  4.1 1 

Envelope 
detector  for  AM. 


w 
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a  declining  envelope.  Capacitor  discharge  between  positive  peaks  causes  a  ripple  signal  of 
frequency  coc  in  the  output.  This  ripple  can  be  reduced  by  choosing  a  larger  time  constant 
RC  so  that  the  capacitor  discharges  very  little  between  the  positive  peaks  (RC  \/coc).  If 
RC  were  made  too  large,  however,  it  would  be  impossible  for  the  capacitor  voltage  to  follow 
a  fast  declining  envelope  (Fig.  4.1  lb).  Because  the  maximum  rate  of  AM  envelope  decline 
is  dominated  by  the  bandwidth  B  of  the  message  signal  m(t),  the  design  criterion  of  RC 
should  be 


l/o)c  <&RC  <  l/(2;rZ?)  or  2nB  <  <£  coc 

RC 

The  envelope  detector  output  is  vc(t)  =  A  -h  m(t)  with  a  ripple  of  frequency  coc.  The  dc  term 
A  can  be  blocked  out  by  a  capacitor  or  a  simple  RC  high-pass  filter.  The  ripple  may  be  reduced 
further  by  another  (low-pass)  RC  filter. 


4-4  BANDWIDTH-EFFICIENT  AMPLITUDE 
MODULATIONS 

As  seen  from  Fig.  4.12,  the  DSB  spectrum  (including  suppressed  carrier  and  AM)  has  two 
sidebands:  the  upper  sideband  (USB)  and  the  lower  sideband  (LSB),  both  containing  complete 
information  about  the  baseband  signal  m(t).  As  a  result,  for  a  baseband  signal  m(t)  with 
bandwidth  B  Hz,  DSB  modulations  require  twice  the  radio-frequency  bandwidth  to  transmit. 
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Figure  4.12 

(a)  Original 
message  spec¬ 
trum  and 

(b)  the  redundant 
bandwidth 
consumption  in 
DSB  modula¬ 
tions. 


(a) 


Figure  4.13 

SSB  spectra  from 
suppressing  one 
DSB  sideband. 


(a)  Baseband 


(d)  LSB 


A. 

~Vc 


To  improve  the  spectral  efficiency  of  amplitude  modulation,  there  exist  two  basic  schemes  to 
either  utilize  or  remove  the  100%  spectral  redundancy: 


Single-sideband  (SSB)  modulation,  which  removes  either  the  LSB  or  the  USB  so  that  for 
one  message  signal  m(t ),  there  is  only  a  bandwidth  of  B  Hz. 


Quadrature  amplitude  modulation  (QAM),  which 
two  messages  over  the  same  bandwidth  of  2 B  Hz. 


utilizes  spectral  redundancy  by  sending 


4.4.1  Amplitude  Modulation:  Single  Sideband  (SSB) 


via  bandpass  filtering ^uch^'  t  ^  **  ^  Ca"  **  suPPressed  from  the  DSB  sig"a‘ 
bandpass  hitenng.  Such  a  scheme,  in  which  only  one  sideband  is  transmitted,  is  known  as 
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Figure  4.14 

Transfer  function 
of  an  ideal  n/2 
phase  shifter 
(Hilbert  trans¬ 
former). 
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single-sideband  (SSB)  transmission,  and  it  requires  only  one-half  the  bandwidth  of  the  DSB 
signal. 

An  SSB  signal  can  be  coherently  (synchronously)  demodulated  just  like  DSB-SC  signals. 
For  example,  multiplication  of  a  USB  signal  (Fig.  4. 1 3c)  by  cos  coct  shifts  its  spectrum  to  the 
left  and  right  by  fc,  yielding  the  spectrum  in  Fig.  4. 1 3e.  Low-pass  filtering  of  this  signal  yields 
the  desired  baseband  signal.  The  case  is  similar  with  LSB  signals.  Since  the  demodulation  of 
SSB  signals  is  identical  to  that  of  DSB-SC  signals,  the  transmitters  can  now  utilize  only  half 
the  DSB-SC  signal  bandwidth  without  any  additional  cost  to  the  receivers.  Since  no  additional 
carrier  accompanies  the  modulated  SSB  signal,  the  resulting  modulator  outputs  are  known  as 
suppressed  carrier  signals  (SSB-SC). 


Hilbert  Transform 

We  now  introduce  for  later  use  a  new  tool  known  as  the  Hilbert  transform.  We  use  x/,(t)  and 
~H{x(t)}  to  denote  the  Hilbert  transform  of  a  signal  x(t) 

1  f00  x(a) 

*h(0  =  H{x(t)}  =  -  — — da  (4.15) 

X  J- OO  t  -  a 

Observe  that  the  right-hand  side  of  Eq.  (4.15)  has  the  form  of  a  convolution 

x(t)  *  — 

7Tt 

Now,  application  of  the  duality  property  to  pair  12  of  Table  3.1  yields  \/tc t  <=>  —j  sgn  if). 
Hence,  application  of  the  time  convolution  property  to  Eq.  (4. 15)  convolution  yields 

Xh(f)  =  ~jX  if)  sgn  if)  (4.16) 


From  Eq.  (4.16),  it  follows  that  if  m{t)  is  passed  through  a  transfer  function  Hif)  = 
—j  sgn  if ),  then  the  output  is  mh{t),  the  Hilbert  transform  of  m(t).  Because 


Hif)  =  -j  sgn  if) 

— j  =  1  •  e~in^  f  >  0 
—  j  =  1  •  eini2  f  <  0 


(4.17) 

(4.18) 


it  follows  that  \H(f)\  =  1  and  that  Ohif)  =  —n/2  for/  >  0  and  n/ 2  for /  <  0,  as  shown 
in  Fig.  4.14.  Thus,  if  we  change  the  phase  of  every  component  of  mit)  by  n/2  (without 
changing  any  component’s  amplitude),  the  resulting  signal  is  w^(r),  the  Hilbert  transform  of 
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Figure  4.1 5 

Expressing  SSB 
spectra  in  terms 
of  M+{f)  and 


(a) 


(b) 


MSf  +  fc) 


~fc 


0 


M+(f  +  fc) 


~fc 


0 


»<(/).  Therefore,  a  Hilbert  transformer  is  an  ideal  phase  shifter  that  shifts  the  phase  of  every 
positive  spectral  component  by  -jt/2. 


Time  Domain  Representation  of  SSB  Signals 

Because  the  building  blocks  of  an  SSB  signal  are  the  sidebands,  we  shall  first  obtain  a  time 
domain  expression  for  each  sideband. 

Figure  4. 1 5a  shows  the  message  spectrum  M  (f  ).  Figure  4.15b  shows  its  right  half  M+(f) 
and  F,g.  4.15c  shows  its  left  half  From  Fig.  4.15b  and  c,  we  observe  that 


M+(f)  M(f)-M(0_M(f)i[l  +  Sgn(/-)]=  ^[M(f)+jMh(f)]  (4.19a) 

M_(f)  =  M  (f)u(-f)  =  M(f)  1  [1  -  sgn  (/•)]  =  I  [M(f) -jMh(f))  (4.19b) 


-  -  f-  4- ,5d  *  isclear 


^usbIT)  —  M+(f  -fc)  +  M-(f  +fc ) 


1 

2 


W  (f  -fc)  +  M(f  +/c)]  _/c)  _  Mh(f  +fc)] 
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From  the  frequency-shifting  property,  the  inverse  transform  of  this  equation  yields 

^usb  (0  =  m(0  cos  a >ct  -  mh(t)  sin  coct  (4.20a) 

Similarly,  we  can  show  that 

<Plsb  (0  =  »(0 cos  (Oct  +  mh(t)  sin  coct  (4.20b) 

Hence,  a  general  SSB  signal  ^)SSB(/)  can  be  expressed  as 

<Pssb  W  =  m(0  cos  coct  =F  mh(t)  sin  coct  (4.20c) 

where  the  minus  sign  applies  to  USB  and  the  plus  sign  applies  to  LSB. 

Given  the  time-domain  expression  of  SSB-SC  signals,  we  can  now  confirm  analytically 
(instead  of  graphically)  that  SSB-SC  signals  can  be  coherently  demodulated: 

<PSSb  (0  cos  a>ct  =  [m(t)  cos  wct  q=  mh(t)  sin  coct]  2 cos  coct 
=  m(f)[l  -l-  cos  2coct]  q=  mh(t)  sin  2 cjct 
=  m(t )  +  [»!(/)  cos  2wct  qF  wi/,(/)  sin  2 coct] 

V  -  .  I  ^ 

SSB-SC  signal  with  carrier  frequency  2coc 

Thus,  the  product  (pSSB(t )  •  2 cos  a)ct  yields  the  baseband  signal  and  another  SSB  signal  with  a 
carrier  frequency  2 coc.  The  spectrum  in  Fig.  4. 1 3e  shows  precisely  this  result.  A  low-pass  filter 
will  suppress  the  unwanted  SSB  terms,  giving  the  desired  baseband  signal  m(t).  Hence,  the 
demodulator  is  identical  to  the  synchronous  demodulator  used  for  DSB-SC.  Thus,  any  one  of 
the  synchronous  DSB-SC  demodulators  discussed  earlier  in  Sec.  4.2  can  be  used  to  demodulate 
an  SSB-SC  signal. 


Example  4.6  Tone  Modulation:  SSB 

Find  ^SSB(/)  for  the  simple  case  of  a  tone  modulation,  that  is,  a  modulating  signal  that  is  a 
sinusoid  m(t)  =  cos  comt.  Also  demonstrate  the  coherent  demodulation  of  this  SSB  signal. 

Recall  that  the  Hilbert  transform  delays  the  phase  of  each  spectral  component  by  tt/2. 
In  the  present  case,  there  is  only  one  spectral  component,  of  frequency  com.  Delaying  the 
phase  of  m{t)  by  7t/2  yields 

mh(t)  =  cos  (comt  -  =  sin  comt 

\  2/ 

Hence,  from  Eq.  (4.20c), 

^ssb(0  =  cos  comt  cos  coct  =f  sin  comt  sin  coct 
=  cos  (coc  ±  co,„)t 

Thus, 

^usb  ~  cos  (d>c  “h  com)t  and  ^lsb  M  =  cos  (coc  —  com)t 

To  verify  these  results,  consider  the  spectrum  of  m(t)  (Fig.  4.16a)  and  its  DSB-SC 
(Fig.  4.16b),  USB  (Fig.  4.16c),  and  LSB  (Fig.  4.16d)  spectra.  It  is  evident  that  the  spectra 
in  Fig.  4.16c  and  d  do  indeed  correspond  to  the  <pVSBU)  and  <pLSBU)  derived  earlier. 


202  AMPLITUDE  MODULATIONS  AND  DEMODULATIONS 


Figure  4.16 
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Finally,  the  coherent  demodulation  of  the  SSB  tone  modulation  can  be  achieved  by 

(PssbU)2cos  coct  =  2  cos  (a»c  ±  com)t  cos  coct 
=  cos  comt  +  cos  (a>c  +  (0m)t 

which  can  be  sent  to  a  low-pass  filter  to  retrieve  the  i 


message  tone  cos  a>mt. 


SSB  Modulation  Systems 

m-ZIT"'5'  ,olmmK  SSB  signals:  phase  shifting,  selective  tillering. 
mSltS  !  *  m0dUla,i0n  «  P-cm.  and  each  one  generally 

Thinhl  shm  IthS  SP!C,r“m  haW  We  P°"a'  the  origin, 
its  implemenwtioihijT/^desigluitef^qcff^  *  ^ F'8  4'7'  whkh  ^ 
positive  spectral  component  by  1/2  Hen«  u/al'lh  '"'  Wh‘Ch  ddays  thc  pha“  °f  £2 

phase  shifter  is  unrealizable  This  ^tecause  of  m  " ‘ra"sformer  No,e that  an  ideal  H''  " 
of  n  at  zero  freouencv  Who  T  of  the  recluirement  for  an  abrupt  phase  change 

content^ ^  dc  null  and  very  littl^  low-frequency 

does  not  affect  the  accuracy  of  SSB  modulation  ?  ***  alm°St  n°  ^  ***** 

signals,  a  DSB-SC  sigildTp^sTed^nuh  mhSt  COmmonly  used  method  of  generating  SSB 

band.  To  obtain  the  USB  the  filter  h  if  US  arP  cutoff  filter  to  eliminate  the  undesired  side- 
and  completely  suppret  a,l  11  Tt  a"  comP°nents  above  frequency/,  unattenua.ed 

mponents  below  frequency/,.  Such  an  operation  requires  an 


Figure  4.17 

Using  the 
phase-shift 
method  to 
generate  SSB. 


Figure  4.18 

(a)  Relative 
power  spectrum 
of  speech  signal 
and  (b)  the 
corresponding 
USB  spectrum. 
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Figure  4.17 

Using  the 
phase-shift 
method  to 
generate  SSB. 


Figure  4.18 

(a)  Relative 
power  spectrum 
of  speech  signal 
and  (b)  the 
corresponding 
USB  spectrum. 
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ideal  filter,  which  is  unrealizable.  It  can,  however,  be  approximated  closely  if  there  is  some 
separation  between  the  passband  and  the  stopband.  Fortunately,  the  voice  signal  provides 
this  condition,  because  its  spectrum  shows  little  power  content  at  the  origin  (Fig.  4.18a).  In 
addition,  articulation  tests  have  shown  that  for  speech  signals,  frequency  components  below 
300  Hz  are  not  important.  In  other  words,  we  may  suppress  all  speech  components  below 
300  Hz  (and  above  3500  Hz)  without  affecting  intelligibility  appreciably.  Thus,  filtering  of 
the  unwanted  sideband  becomes  relatively  easy  for  speech  signals  because  we  have  a  600  Hz 
transition  region  around  the  cutoff  frequency  fc.  To  minimize  adjacent  channel  interference, 
the  undesired  sideband  should  be  attenuated  at  least  40  dB. 

For  very  high  carrier  frequency  fc ,  the  ratio  ot  the  gap  band  (600  Hz)  to  the  carrier 
frequency  may  be  too  small,  and,  thus,  a  transition  of  40  dB  in  amplitude  over  600  Hz  may 
be  difficult.  In  such  a  case,  a  third  method,  known  as  Weaver’s  method.1  utilizes  two  stages 
of  SSB  amplitude  modulation.  First,  the  modulation  is  earned  out  using  a  smaller  carrier 
frequency fCx .  The  resulting  SSB  signal  effectively  widens  the  gap  to  2 fC]  (shaded  spectrum  in 


204  AMPLITUDE  MODULATIONS  AND  DEMODULATIONS 


Fig.  4. 1 8b).  Now.  by  treating  this  signal  as  the  new  baseband  signal,  it  is  possible  to  accomplish 
SSB  modulation  to  a  higher  carrier  frequency. 

Detection  of  SSB  Signals  with  a  Carrier  (SSB+C) 

We  now  consider  SSB  signals  with  an  additional  carrier  (SSB+C).  Such  a  signal  can  be 
expressed  as 


Figure  4.19 

Quadrature 

amplitude 

modulation. 


^ssb+c  =  ^  cos  +  [w(0  cos  coct  +  mij(t)  sin  coct] 

and  m(t)  can  be  recovered  by  synchronous  detection  [multiplying  <pSSB+c  by  cos  coct ]  if  the 
carrier  component  A  cos  u)ct  can  be  extracted  (by  narrowband  filtering  of)  <pSSB+c.  Alternatively, 
if  the  carrier  amplitude  A  is  large  enough,  m(t)  can  also  be  (approximately)  recovered  from 
^ssb+c  by  envelope  or  rectifier  detection.  This  can  be  shown  by  rewriting  <Pssb+c  as 

^ssb+c  =  +  w(0]  cos  (oct  +  m/,(f)  sin  coct 

=  E(t)  cos  (coct  +  0)  (4.21) 


where  E(f),  the  envelope  of  <pSSB+c,  is  given  by  [see  Eq.  (3.41a)] 
E(t)  =  {[4  +  m(t)]2  +  m2h(t)}l/2 


=  A 


l  +  2m(f)  +  ml(t) 


A2 


A2 


If  A  »  |m(/)|,  then  in  general*  A  »  |m*(/)|t  and  the  terms  m2{t)/A 2  and  m2(t)/A2  can  be 
ignored.  Thus,  ' 


E(t)  ~  A 


1  + 


2  m(t) 


~\ 1/2 


Using  Taylor  series  expansion  and  discarding  higher  order  terms  [because  \m(t)/A\  «  •]• 
we  get 


E(t)  ~  A 


1  + 


m(/)‘ 


=  A  +  m(t) 

"  “  [I  am'  ’hal “  V8'  Ca,Tier' lh'  SSB+C  can  be  demodulated  by  an  envelope  deteciof. 

ending  ra,UireS  ,he  COndi,ion  whereas  for  SSB+C.  the 

th  in  that  in  AM  „  T  '"“V"  SSB  case,  the  required  carrier  amplitude  is  much  larger 
than  .ha,  AM.  and.  consequently,  the  efficiency  of  SSB+C  is  pathetically  low. 

4.4.2  Quadrature  Amplitude  Modulation  (QAM) 

(QAM? offered  afuSre  atema!' ve  S' SSBSC  ' Oa'm'  ‘|Ua*a'llre  amPli,ude  mod“'a'^ 

ood-SC.  QAM  can  be  exactly  generated  withou 


*  This  may  no.  be  true  for  all ,.  but  it  is  true  for  most  r. 
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Figure  4. 1 9 

Quadrature 

amplitude 

modulation. 


requiring  sharp  cutoff  bandpass  filters.  QAM  operates  by  transmitting  two  DSB  signals  via 
carriers  of  the  same  frequency  but  in  phase  quadrature,  as  shown  in  Fig.  4.19.  This  scheme  is 
known  as  quadrature  amplitude  modulation  (QAM),  or  quadrature  multiplexing. 

As  shown  Fig.  4.19,  the  boxes  labeled  —jt/2  are  phase  shifters  that  delay  the  phase  of  an 
input  sinusoid  by  -jt/2  rad.  If  the  two  baseband  message  signals  for  transmission  are  m\ (t)  and 
m2(t),  the  corresponding  QAM  signal  y>QAM  (f),  the  sum  of  the  two  DSB-modulated  signals,  is 

•Pqam  (0  =  m  (0  cos  wct  +  m2(t)  sin  coct 

Both  modulated  signals  occupy  the  same  band.  Yet  two  baseband  signals  can  be  separated  at 
the  receiver  by  synchronous  detection  if  two  local  carriers  are  used  in  phase  quadrature,  as 
shown  in  Fig.  4. 1 9.  This  can  be  shown  by  considering  the  multiplier  output  x\  (/)  of  the  upper 
arm  of  the  receiver  (Fig.  4. 19): 

X|  (r)  =  2 (pQAM  (0  cos  coct  =  2[m\  (r)  cos  coct  +  m2(t)  sin  coct ]  cos  coct 

=  m\(t)  +  m\ (t) cos  2ooct  +  m2(t)  sin  2coct  (4.22a) 

The  last  two  terms  are  bandpass  signals  centered  around  2 fc.  In  fact,  they  actually  form  a  QAM 
signal  with  2fc  as  the  carrier  frequency.  They  are  suppressed  by  the  low-pass  filter,  yielding 
the  desired  demodulation  output  m\ (/).  Similarly,  the  output  of  the  lower  receiver  branch  can 
be  shown  to  be  m2(t). 

jc2 (t)  =  2 (pqm(D  sin  (oct  =  2[m\(t) cos  coct  +  m2(t) sin  a>cr]sin  coct 

=  W2(0  —  m2(0  cos  +  m  i  (/)  sin  2 (oct  (4.22b) 

Thus,  two  baseband  signals,  each  of  bandwidth  B  Hz,  can  be  transmitted  simultaneously 
over  a  bandwidth  2 B  by  using  DSB  transmission  and  quadrature  multiplexing.  The  upper 
channel  is  also  known  as  the  in-phase  (I)  channel  and  the  lower  channel  is  the  quadrature 
(Q)  channel.  Both  signals  m\ (/)  and  m2(t)  can  be  separately  demodulated. 

Note,  however,  that  QAM  demodulation  must  be  totally  synchronous.  An  error  in  the  phase 
or  the  frequency  of  the  carrier  at  the  demodulator  in  QAM  will  result  in  loss  and  interference 
between  the  two  channels.  To  show  this  let  the  carrier  at  the  demodulator  be  2  cos  ( coct  +  6). 
In  this  case, 

X]  (t)  =  2\m\ (r)  cos  coct  +  m2(t)  sin  u>ct]  cos  (coct  +  9) 

=  m\ (t)  cos  9  -  m2(t)  sin  9  +  m\ (t) cos  (2 coct  +  9)  +  m2(t)  sin  (2 coct  +  9) 
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The  low-pass  filter  suppresses  the  two  signals  modulated  by  earner  of  frequency  2 fc,  resulting 
in  the  first  demodulator  output 


Figure  4.20 

Spectra  of  the 
modulating 


signal  and 


wi(/)cos  0  —  m2  (r)sin  0 


corresponding 
DSB,  SSB,  and 
VSB  signals. 


Thus,  in  addition  to  the  desired  signal  we  also  receive  signal  mi(t)  in  the  upper  receiver 
branch.  A  similar  phenomenon  can  be  shown  for  the  lower  branch.  This  so-called  cochannel 
interference  is  undesirable.  Similar  difficulties  arise  when  the  local  frequency  is  in  error  (see 
Prob.  4.4-1).  In  addition,  unequal  attenuation  of  the  USB  and  the  LSB  during  transmission 
also  leads  to  cross  talk  or  cochannel  interference. 

Quadrature  multiplexing  is  used  in  analog  color  television  to  multiplex  the  so-called 
chrominance  signals,  which  carry  information  about  colors.  There,  synchronization  is  achieved 
by  periodic  insertion  of  a  short  burst  of  carrier  signal  (called  a  color  burst  in  the  transmitted 
signal,  as  explained  in  Sec.  4.9).  Digital  satellite  television  transmission  also  applies  QAM. 

With  respect  to  bandwidth  requirement,  SSB  is  similar  to  QAM  but  less  exacting  in  terms 
ot  the  carrier  frequency  and  phase  or  the  requirement  of  a  distortionless  transmission  medium. 
However,  SSB  is  difficult  to  generate  if  the  baseband  signal  m(t)  has  significant  spectral  con¬ 
tent  near  the  direct  current  (dc). 


Figure  4.21 

VSB  modulator 
and 

demodulator. 


4.5  AMPLITUDE  MODULATIONS:  VESTIGIAL 
SIDEBAND  (VSB) 


As  discussed  earlier,  the  generation  of  exact  SSB  signals  is  rather  difficult:  generally,  the 
message  signal  m(t)  must  have  a  null  around  direct  current.  A  phase  shifter  required  in  the 
phase  shift  method  is  unrealizable,  or  realizable  only  approximately.  The  generation  of  DSB 
signals  is  much  simpler  but  requires  twice  the  signal  bandwidth.  The  vestigial-sideband  (VSB) 
modulation  system,  also  called  asymmetric  sideband,  is  acomoromise  hetween  DSB  and  SSB. 


^vsbV)  =  [M<f  +fc)  +  M(f  -£)]//,(/•)  (4.23) 


(4.23) 


incoming  VSB  signal  <pVSB(r)  by  2  cos  (oct ■ 1116 


4.5  Amplitude  Modulations:  Vestigial  Sideband  (VSB) 
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Figure  4.20 

Spectra  of  the 
modulating 
signal  and 
corresponding 
DSB,  SSB,  and 
VSB  signals. 


Figure  4.21 

VSB  modulator 
and 

demodulator. 
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product  e(t)  is  given  by 

e(t)  =  2<pVSB(f)cos  (Oct  <=>  [4>vsb  (f  +/c)  +  ^vsb  (f  -  fc )] 

The  signal  e(t)  is  further  passed  through  the  low-pass  equalizer  filter  of  transfer  function  H()(f). 
The  output  of  the  equalizer  filter  is  required  to  be  m(t).  Hence,  the  output  signal  spectrum  is 
given  by 


M(f)  =  [Qvssif  +fc)  +  4>vsb  (f  —  fc)\Ho(f) 


Substituting  Eq.  (4.23)  into  this  equation  and  eliminating  the  spectra  at  ±2 fc  [suppressed  by  a 
low-pass  filter  H0(f)],  we  obtain 


M(f)  =  M  (f)[Hi(f  +fc)+Hi(f  -fc)]H0(f) 


Hence 


1 

Ho(f)  =  //,(/'  +/«•)  +  H,(j  -fc) 


\f\<B 


(4.24) 


(4.25) 


Note  that  because  Hj(f)  is  a  bandpass  filter,  the  terms  H,(j  ±fc)  contain  low-pass  components. 
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Complementary  VSB  Filter  and  Envelope  Detection 
of  VSB+C  Signals 

As  a  special  case  of  filter  at  the  VSB  modulator,  we  can  choose  Hj(f)  such  that 


Htf  +fe )  +  Htf  -fc)  =1  [f\<B  (4.26) 


Then  output  filter  is  just  a  simple  low-pass  filter  with  transfer  function 


H0(f)  =1  \f\  <  B 

The  resulting  VSB  signal  plus  carrier  (VSB+C)  can  be  envelope-detected.  This  demod¬ 
ulation  method  may  be  proved  by  using  exactly  the  same  argument  used  in  proving  the  case 
for  SSB+C  signals.  In  particular,  because  of  Eq.  (4.26),  we  can  define  a  new  low-pass  filter 

F(f)  =/•[  1  -  2  Htf  -fc)]  =  — y  [1  -  2  Hi(f  +fc)]  \f\<B 


Upon  defining  a  new  (complex)  low-pass  signal  as 

mv{t)  <=*  My(J)  =  F(f)M{f) 

we  can  rewrite  the  VSB  signal  as 

4>vsb(/-)  =  M{f-fe)  +  M<f+fe)  AW  ~/c)  -  +/c) 

2  2 j 


(4.27a) 


Vvsb(')  =  m(t)  cos  2nfct  +  Wv(f)sin  2 nfct 


(4.27b) 


C  learly,  both  the  SSB  and  the  VSB  modulated  signals  have  the  same  form,  with  mh(t)  in 
SSB  replaced  by  a  low-pass  signal  mv{t)  in  VSB.  Applying  the  same  analysis  from  the  SSB+C 

envelope  detection,  a  large  carrier  addition  to  Vvsb(0  would  allow  the  envelope  detection  of 
VSB-fC. 

\\e  have  shown  that  SSB+C  requires  a  much  larger  carrier  than  DSB+C  (AM)  for  enve- 
ope  detection.  Because  VSB+C  is  an  in-between  case,  the  added  carrier  required  in  VSB  is 
larger  than  that  in  AM,  but  smaller  than  that  in  SSB-f  C 


Example  4.7  =  20  kHz,  and  fee  baseband  signal  bandwidth 

ove,  2  kHz,  ,s  shown  F,g.  4.22a.  Find  ,he  output  filter  H„if)  required  lor  distortionless 


reception. 


Figure  4.22b  shows  the  low-pass  segments  of  Htf  +  fc)  +  H  (f  -  f ).  We  are  interested 
by  the  ompuUilter) '^h VCr  ***  baSeband  (the  remainin8  undesired  portion  is  suppressed 

5£ze.' ,he  r of  ° ,o  2  kH'  »nd  is  1  °>er  2  r  s 

the  reciprocal  of  the  spectnrm  to  Fig.  4.22b  W W 
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Figure  4.22 
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Figure  4.23 

Transmitter  filter 
Ht if),  receiver 
front-end  filter 
HR{f),  and  the 
receiver  output 
low-pass  filter 
Hotf)  in  VSB 
television 
systems. 


Use  of  VSB  in  Broadcast  Television 

VSB  is  very  attractive  for  television  broadcast  systems  because  it  is  a  clever  compromise 
between  SSB  and  DSB.  The  baseband  video  signal  of  television  occupies  an  enormous  band¬ 
width  of  4.5  MHz,  and  a  DSB  signal  needs  a  bandwidth  of  9  MHz.  It  would  seem  desirable 
to  use  SSB  to  conserve  bandwidth.  Unfortunately,  this  creates  several  problems.  First,  the 
baseband  video  signal  has  sizable  power  in  the  low-frequency  region,  and  consequently  it 
is  difficult  to  suppress  one  sideband  completely.  Second,  in  designing  a  broadcast  receiver, 
we  prefer  an  envelope  detector  over  a  synchronous  one  to  reduce  the  receiver  cost.  We  saw 
earlier  that  SSB+C  has  a  very  low  power  efficiency;  moreover,  use  of  SSB  would  increase  the 
receiver  cost. 

The  spectral  shaping  of  television  VSB  signals  can  be  illustrated  by  Fig.  4.23.  The  vestigial 
spectrum  is  controlled  by  two  filters:  the  transmitter  RF  filter  HT(f)  and  the  receiver  RF  filter 
Hftif).  Jointly  we  have 

Hiif)  =  HT(f)HR(f) 


Hence,  the  design  of  the  receiver  output  filter  H„(f)  follows  Eq.  (4.25). 
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Figure  4.24 

Television  signal 
spectra:  (a)  DSB 
video  signal  plus 
audio; 

(b)  signal 
transmitted. 


(a) 


/,  MHz 


The  DSB  spectrum  of  a  television  signal  is  shown  in  Fig.  4.24a.  The  vestigial  shaping 
filter  HT(f)  cuts  off  the  lower  sideband  spectrum  gradually,  starting  at  0.75  MHz  to  1 .25  MHz 
below  the  carrier  frequency  fc,  as  shown  in  Fig.  4.24b.  The  receiver  output  filter  H„(f)  is 
designed  according  to  Eq.  (4.25).  The  resulting  VSB  spectrum  bandwidth  is  6  MHz.  Compare 
this  with  the  DSB  bandwidth  of  9  MHz  and  the  SSB  bandwidth  of  4.5  MHz. 


4.6  LOCAL  CARRIER  SYNCHRONIZATION 


n  a  suppressed-carrier,  amplitude-modulated  system  (DSB-SC,  SSB-SC,  and  VSB-SC),  tl 
coherent  receiver  must  generate  a  local  carrier  that  is  synchronous  with  the  incoming  card 
frequency  and  phase).  As  discussed  earlier,  any  discrepancy  in  the  frequency  or  phase  of  tl 

local  carrier  gives  rise  to  distortion  in  the  detector  output. 

C  onsider  an  SSB-SC  case  where  a  received  signal  is 


w(t)  cos  |(<uc  +  Au))t  +  8]  -  mh(t)  sin  [(<of  +  A co)t  +  S] 

anli  D°PPlCT  freW  shifl-  The  local  earner  remains  as 
2  cos  <*r.  The  produet  of  .he  reee„ed  signal  and  rhe  local  caroler  is  e (,).  given  by 


m  ~  2C“  W'tcosfru,.  +  Arur  +  S)  -  sin  +  Aall  + 

-  m(t)  cos  (A (at  +  S)  -  mh(t)  sin  (A cot  +  8) 

+  !”(f)cos  l&Oc  +  +  3]  -  m/, (r)  sin  [(2a>c  +  A co)t  +  3]  (4.28) 

bandpass  SSB-SC  signal  around  2 coc  +  Au> 

The  bypass  eonrponen,  is  fi„ered  on,  by  rhe  receiver  low-pass  hirer,  leaving  .he  onrpn. 


c„(t)  =  m(t)  cos  (A cot  +  5) 


w*(0  sin  (Acot  +  8) 


4.7  FREC 


(4.29) 
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If  A co  and  8  are  both  zero  (no  frequency  or  phase  error),  then 

eQ(t)  =  m(t) 


as  expected. 

In  practice,  if  the  radio  wave  travels  a  distance  of  d  meters  at  the  speed  of  light  c,  then  the 
phase  delay  is 


8  =  —  ( coc  +  A  co)d/c 

which  can  be  any  value  within  the  interval  [-7T,  +7r].  Two  oscillators  initially  of  identical 
frequency  can  also  drift  apart.  Moreover,  if  the  receiver  or  the  transmitter  is  traveling  at  a 
velocity  of  ve ,  then  the  maximum  Doppler  frequency  shift  would  be 

A,  _  vef 
A/ max  —  Jc 
C 

The  velocity  ve  depends  on  the  actual  vehicle  (e.g.  spacecraft,  airplane,  and  car).  For  example, 
if  the  mobile  velocity  ve  is  108  km/h,  then  for  carrier  frequency  at  100  MHz,  the  maximum 
Doppler  frequency  shift  would  be  10  Hz.  Such  a  shift  of  every  frequency  component  by  a  fixed 
amount  Aa>  destroys  the  harmonic  relationship  among  frequency  components.  For  A/  =  10 
Hz,  the  components  of  frequencies  1000  and  2000  Hz  will  be  shifted  to  frequencies  1010  and 
2010  Hz.  This  destroys  their  harmonic  relationship,  and  the  quality  of  nonaudio  signals. 

It  is  interesting  to  note  that  audio  signals  are  highly  redundant,  and  unless  A /  is  very  large, 
such  a  change  does  not  destroy  the  intelligibility  of  the  output.  For  audio  signals  A/  <  30 
Hz  does  not  significantly  affect  the  signal  quality.  A  A /  exceeding  30  Hz  results  in  a  sound 
quality  similar  to  that  of  Donald  Duck.  But  intelligibility  is  not  completely  lost. 

Generally,  there  are  two  ways  to  recover  the  incoming  carrier  at  the  receiver.  One  way  is 
for  the  transmitter  to  transmit  a  pilot  (sinusoid)  signal  that  is  either  the  exact  carrier  or  directly 
related  to  the  carrier  (e.g.,  a  pilot  at  half  the  carrier  frequency).  The  pilot  is  separated  at  the 
receiver  by  a  very  narrowband  filter  tuned  to  the  pilot  frequency.  It  is  amplified  and  used  to 
synchronize  the  local  oscillator.  Another  method,  when  no  pilot  is  transmitted,  is  for  the  receiver 
to  process  the  received  signal  by  means  of  a  nonlinear  device  to  generate  a  separate  carrier 
component  to  be  extracted  by  means  of  narrow  bandpass  filters.  Clearly,  effective  and  narrow 
bandpass  filters  are  very  important  to  both  methods.  Moreover,  the  bandpass  filter  should  also 
have  the  ability  to  adaptively  adjust  its  center  frequency  to  combat  significant  frequency  drift 
or  Doppler  shift.  Aside  from  some  typical  bandpass  filter  designs,  the  phase-locked  loop  (FLL), 
which  plays  an  important  role  in  carrier  acquisition  of  various  modulations,  can  be  viewed  as 
such  a  narrow  and  adaptive  bandpass  filter.  The  principles  of  PLL  will  be  discussed  later  in 
this  chapter. 

FREQUENCY  DIVISION  MULTIPLEXING  (FDM) 

Signal  multiplexing  allows  the  transmission  of  several  signals  on  the  same  channel.  In  Chapter 
6,  we  shall  discuss  time  division  multiplexing  (TDM),  where  several  signals  time-share  the 
same  channel.  In  FDM,  several  signals  share  the  band  of  a  channel.  Each  signal  is  modulated 
by  a  different  carrier  frequency.  The  various  carriers  are  adequately  separated  to  avoid  overlap 
(or  interference)  between  the  spectra  of  various  modulated  signals.  These  carriers  are  referred 
to  as  subcarriers.  Each  signal  may  use  a  different  kind  of  modulation  (e.g.,  DSB-SC,  AM, 
SSB-SC,  VSB-SC,  or  even  frequency  modulation  or  phase  modulation).  The  modulated-signal 
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Figure  4.25 
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spectra  may  he  separated  by  a  small  guard  band  to  avoid  interference  and  to  facilitate  signal 
separation  at  the  receiver. 

When  all  the  modulated  spectra  have  been  added,  we  have  a  composite  signal  that  may 
be  considered  to  be  a  baseband  signal  to  further  modulate  a  radio-frequency  (RF)  carrier  for 
the  purpose  of  transmission. 

At  the  receiver,  the  incoming  signal  is  first  demodulated  by  the  RF  carrier  to  retrieve  the 
composite  baseband,  which  is  then  bandpass-filtered  to  separate  each  modulated  signal.  Then 
each  modulated  signal  is  demodulated  individually  by  an  appropriate  subcarrier  to  obtain  all 
the  basic  baseband  signals. 

One  simple  example  of  FDM  is  the  analog  telephone  long-haul  system.  There  are  two 
types  of  long-haul  telephone  carrier  systems:  the  legacy  analog  L-carrier  hierarchy  systems 
and  the  digital  T-camer  hierarchy  systems  in  North  America  (or  the  E-carrier  in  Europe).  Both 
were  tan  ar  lzed  by  the  predecessor  of  the  International  Telecommunications  Union  known 

w  r|'i,:\aS,h^rC^TTlCU,"'l'O,nSUllalifln'crria'i0nan'l>!I>h',n|qucctTc‘leg|aph|quf) 

L  ,  ,  r  r".  !  ?na,0E  lelcph°"e  hierareh>  “liliKS  FDM  and  SSB  modulation 

here  and  deter  the  digital  hierarchy  discussion  until  Chapter  6. 

voice  ch^n^ff5  L~Ter  h‘urarChy'2  CaCh  VO'Ce  channeI  is  modulated  using  SSB+C.  Twelve 
As  shown TttJ 7-^  T  e  8r°UP  °CCUpying  the  bandwidth  between  60  and  1 08  kHz. 

is  achieved  by  maintaining  the  channel  carrier  separation  of  4  kHz. 

supergroups  >>mierat'sL!  nRh|  gloups  torm  a  supergroup  via  FDM.  Multiplexing  10 

which  consists  of  %oo  ^  ehrgr0“P  and  mult'plexing  6  supergroups  forms  a  jumbo  group. 

AutuZ* of  16.984  MHz  in  the  L4  system. 

interference  reduction  and  for  inserting  diS"™13  addltl°nal  frecluency  gaps  are  provided  for 

the  baseband  input  of  a  microwave  radio  ihaS'Sdte  mU'tipleXed  signal  Can  be  fed  'nt° 

v  rauio  cnannel  or  directly  into  a  coaxial  transmission  system. 


4.8  PHASE-LOCKED  LOOP  AND  APPLICATIONS 


Phase-Locked  Loop  (PLL) 

The  phase-locked  loop  is  a  verv  ininm-t™.  a 

frequency  of  the  carrier  component  of  an  •  CV1Ce  typical|y  used  to  track  the  phase  and  the 
mer  component  of  an  incoming  signal.  It  is,  therefore,  a  useful  device  for 


Figure  4.26 

Phase-locked 
loop  and  its 
equivalent 
circuit. 
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Figure  4.26 
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the  synchronous  demodulation  of  AM  signals  with  a  suppressed  carrier  or  with  a  little  carrier 
(pilot).  It  can  also  be  used  for  the  demodulation  of  angle-modulated  signals,  especially  under 
low  SNR  conditions.  It  also  has  important  applications  in  a  number  of  clock  recovery  systems 
including  timing  recovery  in  digital  receivers.  For  these  reasons,  the  PLL  plays  a  key  role  in 
nearly  every  modern  digital  and  analog  communication  system. 

A  PLL  has  three  basic  components: 

1.  A  voltage-controlled  oscillator  (VCO). 

2.  A  multiplier,  serving  as  a  phase  detector  (PD)  or  a  phase  comparator. 

3.  A  loop  filter  H (s). 

Basic  PLL  Operation 

The  operation  of  the  PLL  is  similar  to  that  of  a  feedback  system  (Fig.  4.26a).  In  a  typical 
feedback  system,  the  feedback  signal  tends  to  follow  the  input  signal.  If  the  feedback  signal 
is  not  equal  to  the  input  signal,  the  difference  (known  as  the  error)  will  change  the  feedback 
signal  until  it  is  close  to  the  input  signal.  A  PLL  operates  on  a  similar  principle,  except  that 
the  quantity  fed  back  and  compared  is  not  the  amplitude,  but  the  phase.  The  VCO  adjusts 
its  own  frequency  such  that  its  frequency  and  phase  can  track  those  of  the  input  signal.  At 
this  point,  the  two  signals  are  synchronous  (except  for  a  possible  difference  of  a  constant 
phase). 

The  voltage-controlled  oscillator  (VCO)  is  an  oscillator  whose  frequency  can  be  linearly 
controlled  by  an  input  voltage.  If  a  VCO  input  voltage  is  e(>(t),  its  output  is  a  sinusoid  with 
instantaneous  (angular)  frequency  given  by 

<y(r)  =  coc  +  ce0{t )  (4.30) 

where  c  is  a  constant  of  the  VCO  and  a >c  is  the  free-running  frequency  of  the  VCO  [when 
e„(t)  =  0].  The  multiplier  output  is  further  low-pass-filtered  by  the  loop  filter  and  then  applied 
to  the  input  of  the  VCO.  This  voltage  changes  the  frequency  of  the  oscillator  and  keeps  the 
loop  locked  by  forcing  the  VCO  output  to  track  the  phase  (and  hence  the  frequency)  of  the 
input  sinusoid. 
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Figure  4.23 

Equivalent 
circuits  of  a 
linearized  PLI 


where  c  and  B  are  constant  parameters  of  the  PLL. 

Let  the  incoming  signal  (input  to  the  PLL)  be  A  sin  [a)ct  +  0/(/)J.  If  the  incoming  signal 
happens  to  be  A  sin  [co()t  +  VKOL  it  can  still  be  expressed  as  A  sin  [coct  +  0, (r)],  where  0/(f)  = 
(co()  —  coc)t  +  \l/(t).  Hence,  the  analysis  that  follows  is  general  and  not  restricted  to  equal 
frequencies  of  the  incoming  signal  and  the  free-running  VCO  signal. 

The  multiplier  output  is 


If  the  VCO  output  is  B  cos  [coct  +  0o(t )],  then  its  instantaneous  frequency  is  coc  +  0o{t). 
Therefore, 

0o(t)  =ce0(t)  (4.31) 


AB  sin  (o)ct  +  ft)  cos  (coct  +  ft,)  =  y  [sin  (ft  -  0o)  +  sin  (2 coct  +  ft  +  0o)] 

The  sum  Irequency  term  is  suppressed  by  the  loop  filter.  Hence,  the  effective  input  to  the  loop 
filter  is  5 AB  sin  [ft(f)  —  ft>(/)].  If  h(t)  is  the  unit  impulse  response  of  the  loop  filter, 

eo(0  =  h(t)  *  -AB  sin  [ft(r)  -  ft,(r)] 

1  [• 

=  2AB  Jn  h(t  ~  sin  tft(^)  -  ft, U)]  dx  (4.32) 

Substituting  Eq.  (4.32)  into  Eq.  (4.31)  and  letting  K  =  •  CB  lead  to 

0o(t)  =  AK  f  h{t  -  x)  sin  ftOt)  dx  (4.33) 

J  0 

where  9e(t)  is  the  phase  error,  defined  as 


W0\ij 

h^Rg.^f  26b°nS  ,al0ng  W'th  Eq’  (4'3I)I  Immediate|y  suggest  a  model  for  the  PLL,  as  shown 

Different  loop  fi Iters  can  enable^he  PLL toc'aDt  ^  ^h'00*5  ^  ^  and  the  l0°P  ^  ^ 
of  frequency  variation.  On  the  other  hand  the  I  ^  ^  lnput  Slgnals  with  different  Wf 
frequency  variation.  °°P  ga,n  can  a^ect  the  range  of  the  trackable 


Small-Error  PLL  Analysis 

linear  (time-invariant)  system'showiTin  Fi^  27^^  ttf^  “  ^  4'26b  redUCCS  -°  ^ 

&  •  Straightforward  feedback  analysis  gives 


=  _  AKH(s) 
i(s)  1  +  TTakhiT) 


(4.34) 
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Figure  4.27 

Equivalent 
circuits  of  a 
linearized  PLL. 
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Therefore,  the  PLL  acts  as  a  filter  with  transfer  function  AKH(s)/[s  +  AKH(s)],  as  shown  in 
Fig.  4.27b.  The  error  0,(5)  is  given  by 


0,(5)  =  0/(5)  -  0O(5)  = 


®oC0  1 

0/W  J 


0/(0 


5 

s  +  AKH(s) 


0/(0 


(4.35) 


One  of  the  important  applications  of  the  PLL  is  in  the  acquisition  of  the  frequency  and  the 
phase  for  the  purpose  of  synchronization.  Let  the  incoming  signal  be  A  sin  (coot  +  (fio)-  We  wish 
to  generate  a  local  signal  of  frequency  cop  and  phase*  cpp.  Assuming  the  quiescent  frequency 
of  the  VCO  to  be  cot .,  the  incoming  signal  can  be  expressed  as  A  sin  [coct  +  0/(0],  where 


0/(0  =  (cop  ~  o>c)t  +  <po 


and 


coo  -  (*>c  <Po 

0/(5)  = - ^ —  +  — 

5Z  5 


Consider  the  special  case  of  H(s)  =  1.  Substituting  this  equation  into  Eq.  (4.35), 


5  [<do-o>c  .  W] 

0'<s)  =  +  7j 


(co0  —  coc)/AK  (cop  —  coc)/AK 
5  5  +  AK 


<PQ 


s  +  AK 


Hence, 


(4.36a) 


Observe  that 


con  —  (Or 

lim  6e{t)  =  ^  (4.36b) 

t-+o o  AK 

Hence,  after  the  transient  dies  (in  about  4/4A"  seconds),  the  phase  error  maintains  a  constant 
value  of  (coo  -  coc)/AK.  This  means  that  the  PLL  frequency  eventually  equals  the  incoming 


With  a  difference  tt/2. 
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frequency  a*).  There  is,  however,  a  constant  phase  error.  The  PLL  output  is 


B  cos 


Uuq/  +  <P0 


cop  -a>c~[ 

AK 


ror  a  secona-oraer  kul  using 


H(s)  = 
®e(s)  = 


s  +  a 


s  +  AKH(s) 

„2 


0/W 


(4.37a) 

(4.37b) 


s2  +  AK(s  +  a)  |  s 
The  final  value  theorem  directly  yields,4 


pno -a*  +  n j 


lim  0e (r)  =  lim  s0f(j)  =  0 

t-+00  5~ >0 


(4.38) 

In  this  case,  the  PLL  eventually  acquires  both  the  frequency  and  the  phase  of  the  incoming 
signal. 

1 1  si  ng  smal  1-error  analysis,  it  can  be  shown  that  a  first-order  loop  cannot  track  an  incoming 
signal  whose  instantaneous  frequency  varies  linearly  with  time.  Such  a  signal  can  be  tracked 
within  a  constant  phase  (constant  phase  error)  by  using  a  second-order  loop  [Eq.  (4.37)],  and 
it  can  be  tracked  with  zero  phase  error  by  using  a  third-order  loop.5 

It  must  be  remembered  that  the  preceding  analysis  assumes  a  linear  model,  which  is  valid 
only  when  9,(t)  <<  n/2.  This  means  the  frequencies  wo  and  a>c  must  be  very  close  for  this 
analysis  to  be  valid.  For  a  general  case,  one  must  use  the  nonlinear  model  in  Fig.  4.26b.  For 
such  an  analysis,  the  reader  is  referred  to  Viterbi,5  Gardner.6  or  Lindsey.7 

First-Order  Loop  Analysis 


Because  6e  =  9,  -  0(„ 


@o(t)  —  AK  sin  6e(t) 


0e  —  9j  —  AK  sin  9e(t) 


(4.39) 

signal  be  A  ^  ^  acquisition’  Let  the  incoming 

W  'et  the  VC0  have  a  quiescent  frequency  wc.  Hence, 


6i(t)  —  (wo  -  COc)t  +  (p0 


and 


°e  -  (wo  -  wf)  -  AK  sin  Ge (t)  (4.40) 

*  Actually  h(r)  =  2Bsinc  (2 nBl)  where  ft  k  k  t 

which  suppresses  the  high-frequency  signal  centeSdaTS  ‘S* '°?  ^  ™s  is  a  >°w-pass,  narrowband  filter. 

B  Hz.  at  2eoc.  This  makes  H(s)  =  |  over  a  low-pass  narrow  band  of 


Figure  4.28 

Trajectory  of  a 
first-order  PLL. 


4.8  Phase-Locked  Loop  and  Applications  217 


For  a  better  understanding  of  PLL  behavior,  we  use  Eq.  (4.40)  to  sketch  0e  vs.  0e. 
Equation  (4.40)  shows  that  9e  is  a  vertically  shifted  sinusoid,  as  shown  in  Fig.  4.28.  To  satisfy 
Eq.  (4.40),  the  loop  operation  must  stay  along  the  sinusoidal  trajectory  shown  in  Fig.  4.28.  When 
9e  =  0,  the  system  is  in  equilibrium,  because  at  these  points,  0e  stops  varying  with  time.  Thus 
0e  =  9\,  02  ,  03,  and  04  are  all  equilibrium  points. 

If  the  initial  phase  error  9e( 0)  =  9e o  (Fig.  4.28),  then  9e  corresponding  to  this  value  of  6e 
is  negative.  Hence,  the  phase  error  will  start  decreasing  along  the  sinusoidal  trajectory  until  it 
reaches  the  value  03 ,  where  equilibrium  is  attained.  Hence,  in  steady  state,  the  phase  error  is  a 
constant  03.  This  means  the  loop  is  in  frequency  lock;  that  is,  the  VCO  frequency  is  now  coo, 
but  there  is  a  phase  error  of  #3.  Note,  however,  that  if  |&>o  —  coc\  >  A  A\  there  are  no  equilibrium 
points  in  Fig.  4.28,  the  loop  never  achieves  lock,  and  0e  continues  to  move  along  the  trajectory 
forever.  Hence,  this  simple  loop  can  achieve  phase  lock  provided  the  incoming  frequency  coo 
does  not  differ  from  the  quiescent  VCO  frequency  coc  by  more  than  AK. 

In  Fig.  4.28,  several  equilibrium  points  exist.  Half  of  these  points,  however,  are  unsta¬ 
ble  equilibrium  points,  meaning  that  a  slight  perturbation  in  the  system  state  will  move  the 
operating  point  farther  away  from  these  equilibrium  points.  Points  9\  and  03  are  stable  points 
because  any  small  perturbation  in  the  system  state  will  tend  to  bring  it  back  to  these  points. 
Consider,  for  example,  the  point  03.  If  the  state  is  perturbed  along  the  trajectory  toward  the 
right,  0e  is  negative,  which  tends  to  reduce  9e  and  bring  it  back  to  G3.  If  the  operating  point 
is  perturbed  from  63  toward  the  left,  9e  is  positive,  0e  will  tend  to  increase,  and  the  operating 
point  will  return  to  63.  On  the  other  hand,  at  point  02  if  the  point  is  perturbed  toward  the  right, 
be  is  positive,  and  0e  will  increase  until  it  reaches  03.  Similarly,  if  at  02  the  operating  point  is 
perturbed  toward  the  left,  9e  is  negative,  and  0e  will  decrease  until  it  reaches  9\.  Hence,  02  is 
an  unstable  equilibrium  point.  The  slightest  disturbance,  such  as  noise,  will  dislocate  it  either 
to  6\  or  to  63.  In  a  similar  way,  we  can  show  that  04  is  an  unstable  point  and  that  9\  is  a  stable 
equilibrium  point. 

The  equilibrium  point  03  occurs  where  9e  =  0.  Hence,  from  Eq.  (4.40), 


03  =  sin 


cop  —  coc 

AK 


If  03  <&  7t/2,  then 

coo- co c 

03  — - 

3  AK 

which  agrees  with  our  previous  result  of  the  small-error  analysis  [Eq.  (4.36b)]. 

The  first-order  loop  suffers  from  the  fact  that  it  has  a  constant  phase  error.  Moreover,  it 
can  acquire  frequency  lock  only  if  the  incoming  frequency  and  the  VCO  quiescent  frequency 
differ  by  not  more  than  AK  rad/s.  Higher  order  loops  overcome  these  disadvantages,  but  they 
create  a  new  problem  of  stability.  More  detailed  analysis  can  be  found  in  the  book  by  Gardner.6 


2 1 8  AMPLITUDE  MODULATIONS  AND  DEMODULATIONS 


Figure  4.29 

Generation  of 
coherent 
demodulation 
carrier  using 
signal  squaring 


m(t)  cos  cd(./ 
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PLL 
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Generalization  of  PLL  Behaviors 

To  generalize,  suppose  that  the  loop  is  locked ,  meaning  that  the  frequencies  of  both  the  input 
and  the  output  sinusoids  are  identical.  The  two  signals  are  said  to  be  mutually  phase  coherent 
or  in  phase  lock.  The  VCO  thus  tracks  the  frequency  and  the  phase  of  the  incoming  signal.  A 
PLL  can  track  the  incoming  frequency  only  over  a  finite  range  of  frequency  shift.  This  range 
is  called  the  hold-in  or  lock  range.  Moreover,  if  initially  the  input  and  output  frequencies  are 
not  close  enough,  the  loop  may  not  acquire  lock.  The  frequency  range  over  which  the  input 
will  cause  in  the  loop  to  lock  is  called  the  pull-in  or  capture  range.  Also  if  the  input  frequency 
changes  too  rapidly,  the  loop  may  not  lock. 

II  the  input  sinusoid  is  noisy,  the  PLL  not  only  tracks  the  sinusoid,  but  also  cleans  it  up.  The 
PLL  can  also  be  used  as  a  frequency  modulation  (FM)  demodulator  and  frequency  synthesizer, 
as  shown  later  in  the  next  chapter.  Frequency  multipliers  and  dividers  can  also  be  built  using 
I  LL.  The  PLL,  being  a  relatively  inexpensive  integrated  circuit,  has  become  one  of  the  most 
frequently  used  communication  circuits. 


In  space  vehicles,  because  ot  the  Doppler  shift  and  oscillator  drift,  the  frequency  of  the 
received  signal  has  much  uncertainty.  The  Doppler  shift  of  the  carrier  itself  could  be  as  high 
as  ±75  kHz,  whereas  the  desired  modulated  signal  band  may  be  just  10  Hz.  To  receive  such  a 
signal  by  conventional  receivers  would  require  a  filter  of  bandwidth  150  kHz,  when  the  desired 
signal  has  a  bandwidth  of  only  10  Hz.  This  would  cause  an  undesirable  increase  (by  a  factor 
ot  1 5,000)  in  the  noise  received  because  the  noise  power  is  proportional  to  the  bandwidth.  The 

J  Prov^s  convenient  here  because  it  tracks  the  received  frequency  continuously,  and  the 
filter  bandwidth  required  is  only  10  Hz. 


Carrier  Acquisition  in  DSB-SC 


We  shall  now  discuss  two  methods  of  carrier  regeneration  using  PLL 
signal  squaring  and  the  Costas  loop. 


at  the  receiver  in  DSB-SC: 


Figure  4.30 

Costas 

phase-locked 
loop  for  the 
generation  of  a 
coherent 
demodulation 
carrier. 


signal  iss^uamdTnHH  ,  T  °UlUne  0t  th,S  scheme  is  g^en  Fig.  4.29.  The  incoming 

rzrrnpassrgh  a  narr°w  (high-e)  bandPass  fiiter  tuned  *>  The 

is  applied  to  a  PLL  to  ,h  sinuso‘  LOS  with  some  residual  unwanted  signal.  This  signal 
Ihroul  a  2  I  L  ^  SmUSoid  of  twice  carrier  frequency  which  is  passed 

with  the  incomingTam'er.The^ analysis 'is  frequency  synchronism 

>  raightforward.  The  squarer  output  x (t)  is 


*(0  =  [w(/)COS  Q)ct]2 


Now  m2(t)  is  a  nonnegative  signal, 
to  w(f),  which  generally  has  a  zero 
component  of  m2(t)/ 2,  be  k.  We  can 


and  therefore  has  a  nonzero  average  value  [in  contrast 
average  value].  Let  the  average  value,  which  is  the  dc 
now  express  m2(t)/ 2  as 


(0  =  k  +  (f)(1) 
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Figure  4.30 

Costas 

phase-locked 
loop  for  the 
generation  of  a 
coherent 
demodulation 
carrier. 


where  </>(f)  is  a  zero  mean  baseband  signal  [ m2(t)/2  minus  its  dc  component].  Thus, 


1  ,  1  , 

x (t)  =  -ffr(f)  +  -wr(f)cos  2 (oct 
1  , 

=  -wr(f)  +  k  cos  2 wct  +  cos  2 coct. 


The  bandpass  filter  is  a  narrowband  (high-0  filter  tuned  to  frequency  2coc.  It  completely  sup¬ 
presses  the  signal  m2(f),  whose  spectrum  is  centered  at  u>  =  0.  It  also  suppresses  most  of  the 
signal  4>{t)  cos  2 a>ct.  This  is  because  although  this  signal  spectrum  is  centered  at  2coc,  it  has 
zero  (infinitesimal)  power  at  2coc  since  <p(t)  has  a  zero  dc  value.  Moreover  this  component 
is  distributed  over  the  band  of  4 B  Hz  centered  at  2coc.  Hence,  very  little  of  this  signal  passes 
through  the  narrowband  filter.*  In  contrast,  the  spectrum  of  k  cos  2 coct  consists  of  impulses 
located  at  ±2 coc.  Hence,  all  its  power  is  concentrated  at  2coc  and  will  pass  through.  Thus, 
the  filter  output  is  k  cos  2 coct  plus  a  small  undesired  residue  from  4>(t)  cos  2wct.  This  residue 
can  be  suppressed  by  using  a  PLL,  which  tracks  k  cos  2 coct.  The  PLL  output,  after  pass¬ 
ing  through  a  2: 1  frequency  divider,  yields  the  desired  carrier.  One  qualification  is  in  order. 
Because  the  incoming  signal  sign  is  lost  in  the  squarer,  we  have  sign  ambiguity  (or  phase 
ambiguity  of  n)  in  the  carrier  generated.  This  is  immaterial  for  analog  signals.  For  a  digital 
baseband  signal,  however,  the  carrier  sign  is  essential,  and  this  method,  therefore,  must  be 
modified. 


Costas  Loop:  Yet  another  scheme  for  generating  a  local  carrier,  proposed  by  Costas,8 
is  shown  in  Fig.  4.30.  The  incoming  signal  is  m(t)  cos  (coct  +  0,-).  At  the  receiver,  a 
VCO  generates  the  carrier  cos  (a>ct  +  0„).  The  phase  error  is  Qe  =  6,  -  60.  Various  sig¬ 
nals  are  indicated  in  Fig.  4.30.  The  two  low-pass  filters  suppress  high-frequency  terms  to 
yield  m(t)  cos  0,  and  m(t)  sin  0„  respectively.  These  outputs  are  further  multiplied  to  give 
m2(t)  sin  26 e.  When  this  is  passed  through  a  narrowband,  low-pass  filter,  the  output  is  R  sin  20„ 
where  R  is  the  dc  component  of  m2(t)/2.  The  signal  R  sin  20,  is  applied  to  the  input  of 
a  VCO  with  quiescent  frequency  a>c.  The  input  R  sin  20,  increases  the  output  frequency. 


*  This  will  also  explain  why  we  cannot  extract  the  carrier  directly  from  m{t)  cos  coct  by  passing  it  through  a 
narrowband  filter  centered  at  a),.  The  reason  is  that  the  power  of  m(l )  cos  ioct  at  eoc  is  zero  because  m(t)  has  no  dc 
component  |the  average  value  of  m( I)  is  zero). 
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which,  in  turn,  reduces  6e.  This  mechanism  was  fully  discussed  earlier  in  connection  with 
Fig.  4.26. 

Carrier  Acquisition  in  SSB-SC 

For  the  purpose  of  synchronization  at  the  SSB  receiver,  one  may  use  highly  stable  crystal 
oscillators,  with  crystals  cut  for  the  same  frequency  at  the  transmitter  and  the  receiver.  At 
very  high  frequencies,  where  even  quartz  crystals  may  not  give  adequate  performance,  a  pilot 
carrier  may  be  transmitted.  These  are  the  same  methods  used  for  DSB-SC.  However,  the 
received-signal  squaring  technique  as  well  as  the  Costas  loop  used  in  DSB-SC  cannot  be  used 
for  SSB-SC.  This  can  be  seen  by  expressing  the  SSB  signal  as 

V>ssb(0  =  m(t)  cos  wct  m/,(f)  sin  coct 
=  £(/)cos[avf  +  0(f)] 


where 


E(t)  =  JmHt)  +  mj;(f) 
__ i  f  ±w/,(/)' 


Squaring  this  signal  yields 


0(f)  =  tan" 


m(t) 


IPssbM  =  £‘(0  cos2  [coct  +  0(f)] 

£2(f) 

=  I  *  +  cos  [2a>cf  +  26>(r)] } 

The  signal  E2( 0  is  eliminated  by  a  bandpass  filter.  Unfortunately,  the  remaining  signal  is  not  a 
pure  sinusoid  of  frequency  2wc  (as  was  the  case  for  DSB).  There  is  nothing  we  can  do  to  remove 
the  tnne-vary.ng  phase  20(0  from  this  sinusoid.  Hence,  for  SSB.  the  squaring  technique  does 
not  work.  The  same  argument  can  be  used  to  show  that  the  Costas  loop  will  not  work  either. 
I  hese  conclusions  also  apply  to  VSB  signals. 

4.9  NTSC  TELEVISION  BROADCASTING  SYSTEM 

taa  tag  history;  As  !^ly  aT  1928UC  ' FfcTT”'  T"  Am,l°S  "  broadCaS'i0? 

broadcast in‘>  The  ntsp  iw  ,•  ,  I  ,  E  Jenkins  launched  his  station  W3XK  for  television 

black-and-white  TV  bn  1 1  V'01  e  evisi0n  System  Committee)  first  issued  its  standard  for 

1,8  ",  1941  and  i,S  NTSC  standard  in  1953.  Althougl- 

of  how  ihe  NTSC  system  functions  syslen"n  lhe  nexl  chapter,  from  a  good  understanding 

The  ,iSUal  imo?es  by  el“trical  signals, 

as  picture  elements  (or  pixels).  A  lan>  3  ”mes“b?,vided  into  several  small  squares,  known 
clearer  reproduction  (better  resolin'  nU[”  Cf  (d  P'cture  elements  in  a  given  image  means 
entire  picture  is  ^  Fi*'  43  0-  The  information  of  the 

level  of  the  pixels  taken  in  a  certain  an  ®lectric  s,gnal  proportional  to  the  brightness 

"  a  certain  sequence.  We  start  from  the  upper  left-hand  corner  with 
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(a)  (b) 


Figure  4.32 

Scanning  pattern 
(raster). 


element  number  1  and  scan  the  first  row  of  pixels  (Fig.  4.32).  Then  we  come  back  to  the  start 
of  the  second  row,  scan  the  second  row,  and  continue  this  way  until  we  have  finished  the  last 
row.  The  electric  signal  thus  generated  during  the  entire  scanning  interval  has  the  information 
contained  in  the  picture. 

The  image  is  furnished  by  the  television  camera  tube.  There  exist  a  variety  of  camera 
tubes.  The  image  orthicon  is  one  example.  In  this  tube,  the  optical  system  generates  a  focused 
image  on  a  photo  cathode,  which  eventually  produces  an  electrically  charged  image  on  another 
surface,  known  as  the  target  mosaic.  What  this  means  is  that  every  point  on  the  target  mosaic 
surface  acquires  a  positive  electric  charge  proportional  to  the  brightness  of  the  image.  Thus, 
instead  of  a  light  image,  we  have  a  charge  image.  An  electron  gun  now  scans  the  target  mosaic 
surface  with  an  electron  beam  in  the  manner  shown  in  Fig.  4.32.  The  beam  is  controlled  by 
a  set  of  voltages  across  horizontal  and  vertical  deflection  plates.  Periodic  sawtooth  signals 
(Fig.  4.33)  are  applied  to  these  plates.  The  beam  scans  the  horizontal  line  1-2  in  53.5  fxs  and 
quickly  flies  back,  in  10  /z s,  to  the  left  to  point  3  and  scans  line  3-4.  and  so  on.  On  the  target 
mosaic,  where  there  is  a  high  positive  charge  (corresponding  to  a  higher  brightness  level),  more 
electrons  from  the  beam  will  be  absorbed,  and  the  return  beam  will  have  fewer  electrons,  giving 
a  smaller  current.  Areas  corresponding  to  darker  elements  (less  positive  charge)  will  return 
a  large  current.  The  scanning  lines  are  not  perfectly  horizontal  but  have  a  small  downward 
slope,  since  during  the  horizontal  deflection  the  beam  is  also  continuously  deflected  downward 
owing  to  a  slower  vertical  deflection  signal  (Fig.  4.33b).  When  all  the  horizontal  lines  have 
been  scanned,  the  vertical  deflection  signal  goes  to  zero,  which  means  the  beam  goes  back  to 
point  I  again  and  is  ready  to  start  the  next  frame. 

Scanning  is  continuous  at  a  rate  of  60  picture  frames  per  second.  The  electrical  signal  thus 
generated  is  a  video  signal  corresponding  to  the  visual  image.  With  some  modifications  (to  be 
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Figure  4.33 

(a)  Horizontal 
deflection  signal. 

(b)  Vertical 
deflection  signal. 


53.5  /is4* - ►!  I-*— 10  /is 


Ar 


(a) 


discussed  later),  the  video  signal  modulates  the  video  carrier  of  frequency  fc  in  the  form  of 
VSB-AM  (Fig.  4.24).  This  video  earner  is  also  transmitted  along  with  the  frequency-modulated 
audio  earner  of  frequency/**  which  is  4.5  MHz  higher  than  the  video  carrier  frequency /c,  that 
is,  fa  =/c  +  4.5  MHz. 

The  receiver  is  similar  to  an  oscilloscope.  An  electron  gun  with  horizontal  and  vertical 
deflection  plates  generates  an  electron  beam  that  scans  the  screen  exactly  in  the  same  pattern 
and  in  synchronism  with  the  scanning  at  the  transmitter.  When  the  electron  beam  flies  back 
horizontally  after  completing  each  horizontal  line,  it  will  leave  an  unwanted  flyback  trace  on 
the  screen  unless  a  blanking  pulse,  known  as  the  horizontal  blanking  pulse,  is  added  during  the 
flyback  interval  at  the  end  of  each  horizontal  sweep.  Similarly,  a  vertical  blanking  pulse  is  added 
at  the  end  of  each  vertical  sweep  to  eliminate  the  unwanted  vertical  retrace.  These  blanking 
pulses  are  added  at  the  transmitter  itself.  We  also  need  to  add  scan-synchronization  information 
at  the  transmitter.  This  is  done  by  adding  a  large  pulse  to  each  blanking  pulse.  A  typical  video 
signal  is  shown  in  Fig.  4.34.  It  is  evident  that  over  the  entire  flyback  interval,  the  blanking 
pulse  (at  the  black  level)  will  eliminate  the  trace.  Similarly,  vertical  blanking  and  synchronizing 
pulses,  which  are  much  wider  than  the  corresponding  horizontal  pulses,  are  added  to  the  video 
signal  at  the  end  ot  each  vertical  sweep.  The  video  signal  now  VSB-modulates  the  carrier.  We 
also  add  a  earner  at  this  point  (see  Fig.  4.24).  This  VSB+C  signal  is  transmitted  along  with  the 
frequency-modulated  audio  signal.  The  transmitter  block  diagram  is  shown  in  Fig.  4.35a,  the 
receiver  block  diagram  in  Fig.  4.35b.  This  is  a  superheterodyne  receiver  (see  Example  4.2).  The 

7*2l  ,7g:  SUPerhe'er0dyne  receiver  wiU  be  d'scussed  later  in  Sec.  5.6.  The  converter 

T're  SpeCtmm  (vide°  as  we"  as  frequency-modulated  audio)  to  the  IF 

12  e  v  InT/fn ?  "OW  a7  d  3nd  enve>°Pe-detected.  The  audio  signal  is  still  of  the 

X sT/naTh  mnl^7s  W'  H  *  ^  °f  «  MHz  h  is  «P««ed  and  demodulated.  The 
ideo  signal  is  amplified.  Synchronizing  pulses  are  separated  and  applied  to  the  vertical  and 

and  then  applied  tol'hTpilt'uremb^0  S'8na'  ,SC'amped  to  the  blanking  P^ses  (dc  restoration) 

Bandwidth  Considerations 

sc“c  rndard  is  495  per  frame- The  ume  required 

Hence,  each  frame  is  considered  to  have  a  totalTfl^r0  reqU'red  f°r  3°  honZOntal 

Images  must  be  transmitted  in  a  rapid  sucSon  Iff  ^  °f  WhiCh  °n'y  495  “*  „ 

pia  succession  of  frames  to  create  the  illusion  of  continuity 

*  In  Europe,  a  total  of  625  horizontal  lines  is  used. 
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Figure  4.34 

Television  video 
signal. 
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Figure  4.35 

(a)  Television 
transmitter,  (b) 
Television 
receiver. 


(a) 
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and  to  avoid  the  flicker  and  jerky  motion  seen  in  old  movies  of  the  Charlie  Chaplin  era.  Because 
of  the  retinal  property  of  retaining  an  image  for  a  brief  period  even  after  the  object  is  removed, 
it  is  necessary  to  transmit  about  40  images,  or  frames,  per  second.  In  television  we  transmit 
only  30  frames  per  second  to  conserve  bandwidth.  To  eliminate  the  flicker  effect  caused  by 
the  low  frame  rate,  the  495  lines  are  scanned  in  two  successive  patterns.  In  the  first  scanning 
pattern,  called  the  first  field,  the  entire  image  is  scanned  using  only  247.5  lines  (solid  lines 
shown  in  Fig.  4.32).  In  the  second  scanning  pattern,  or  second  field,  the  image  is  scanned  again 
by  using  247.5  lines  interlaced  between  lines  of  the  first  field  (shown  dashed  in  Fig.  4.32).  The 
two  fields  together  constitute  a  complete  image,  or  frame.  Thus,  in  reality  there  are  only  30 
complete  frames  per  second,  and  a  total  equivalent  of  525  x  525  x  30  =  8.27  x  K)6  pixels 
per  second.*  We  can  estimate  the  transmission  bandwidth  of  a  video  signal  by  observing  that 
transmitting  a  video  signal  amounts  to  transmitting  8.27  x  1 06  pieces  of  information  (or  pulses) 
per  second.  Hence,  the  theoretical  bandwidth  required  is  half  this,  namely,  4.135  MHz  (see 


Sec.  6. 1.3). 


Video  Spectrum 

To  begin,  consider  a  simple  case  of  transmission  of  a  still  image.  The  scanning  procedure 
discussed  earlier  is  equivalent  to  scanning  an  array  of  the  same  image  repeating  itself  in  both 


dimensions,  as  shown  in  Fig.  4.36a.  The  brightness  level  b  for  this  figure  is  a  function  of  x 


(horizontal)  and  y  (vertical)  and  can  be  expressed  as  b( x,  y).  Because  the  picture  repeats  in 
the  .v  as  well  as  the  y  dimension,  b(x,  y)  is  a  periodic  function  of  both  x  and  y,  with  periods  of 
a  and  p,  respectively.  Hence,  b(x,  y)  can  be  represented  by  a  two-dimensional  Fourier  series 
with  fundamental  frequencies  In/a  and  2jt//5,  respectively, 


(4.41a) 


II  the  scanning  beam  moves  with  a  velocity  vx  and  vv  in  the  x  and  y  directions,  respectively, 
then  x  =  vxt  and  y  =  vyt,  and  the  video  signal  e(t)  is 


(4.41b) 


But  a/vx  is  the  time  required  to  scan  one  horizontal  line,  and  /$/vy  is  the  time  required  to  scan 
the  complete  image. 


or 


Vx  30(525) 


30 


and 


00  00 


e(t)~  H  H  Bm«exp[/27T(  15,750m  +  30n)f]  (4.41c) 
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Figure  4.36 

(a)  Model  for 
scanning  process 
using  doubly 
periodic  image 
fields. 

(b)  Spectrum  of 
the  monochrome 
video  signal. 


A 


(a) 


The  video  signal  is  periodic  with  fundamentals  //,  =  15.75  kHz  (horizontal-sweep  frequency) 
and /,.  =  30  Hz.  The  harmonics  are  spaced  at  15.75  kHz  intervals,  and  around  each  harmonic 
is  clustered  a  satellite  of  harmonics  30  Hz  apart,  as  shown  in  Fig.  4.36b. 

This  spectrum  was  derived  for  still-picture  transmission.  When  motion  or  change  occurs 
from  frame  to  frame,  b{x,  y)  will  not  be  periodic,  and  the  spectrum  will  not  be  a  line  spec¬ 
trum,  but  will  have  spreading  or  smearing.  But  empty  spaces  still  exist  between  harmonics  of 
fh  (15.75  kHz).  We  take  advantage  of  these  gaps  to  transmit  the  additional  information  of  a 
color  television  signal  over  the  same  bandwidth. 

The  FCC  allows  a  6  MHz  bandwidth  for  television  broadcasting,  with  the  frequency 
allocations  as  shown  in  Table  4.1. 

Compatible  Color  Television  (CCTV) 

All  colors  can  be  synthesized  by  mixing  the  three  primary  colors— blue,  yellow,  and  red— in 
the  right  amounts.  In  television,  blue,  green  (the  combination  ot  blue  and  yellow),  and  red 
are  used  instead  for  the  practical  reason  of  the  availability  of  phosphors  that  glow  with  these 
colors  when  excited  by  an  electron  beam. 
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TABLE  4.1 

TV  Channel  Frequency  Assignments  in 


the  United  States 

Channel  Number 

Frequency  Band,  (MHz) 

VHF  2,  3,  4 

54-72 

VHF  5,  6 

76-88 

VHF  7-13 

174-216 

UHF  14-83 

470-890 

In  color  television  cameras,  the  optical  system  resolves  the  image  into  three  primary 
color  (red,  green,  and  blue)  images.  A  set  of  three  camera  tubes  produces  three  video  signals 
mr(t),  nig(t),  and  mb(t)  from  these  images.  We  could  transmit  the  three  video  signals  and 
synthesize  the  color  image  at  the  receiver  from  the  three  signals.  This,  however,  causes  two 
difficulties.  It  requires  three  times  as  much  bandwidth  as  that  of  monochrome  (black-and-white) 
television,  and,  second,  it  is  not  compatible  with  the  existing  monochrome  system  because  a 
monochrome  television  will  receive  only  one  of  the  primary  colors. 

These  problems  are  solved  by  using  signal  matrixing.  The  information  about  mr(t),mg(t), 
and  mb(i)  can  be  transmitted  by  three  signals,  each  being  a  linear  combination  of  mr(t),  mg(t ), 
and  mb(t),  provided  the  three  combinations  are  linearly  independent.  Thus,  we  can  transmit 
the  signals  mL(t),  /«/(/),  and  mQ(t)  given  by 

mL(t)  =  0.30wr(r)  +  0.59  mg(t)  +  0.1 1  mb(t) 
m,(t)  =  0.60 mr(t)  +  0.28 mg(t)  -  0.32 mb(t) 
mQ(t)  =  0.21mr(r)  -  0.52mg(/)  +  0.3 \mb(t) 

Signals  mr(t),  mg(t),  and  mb(t)  are  normalized  to  a  maximum  value  of  1  so  that  the  amplitude 
of  each  of  these  signals  lies  in  the  range  of  0  to  1 .  Hence,  mL(t)  is  always  positive,  whereas 
ni/d)  and  wy(r)  are  bipolar.  The  signal  mt(/)  is  known  as  the  luminance  signal  because  it  has 
been  lound  that  this  particular  combination  of  the  three  primary  color  signals  closely  matches 
the  luminance  ot  the  conventional  monochrome  video  signal.  Hence,  a  black-and-white  set 
need  use  only  this  signal  for  its  operation. 

I  Ik  signals  m/(r)  and  m@(f)  are  known  as  the  chrominance  signals.*  We  could  have 
chosen  some  other  combinations  instead  of  m,(t )  and  mQ(t).  But  these  particular  combinations 
are  chosen  because  they  use  certain  features  of  human  color  vision  efficiently,9  as  explained 


irinsmitt  -  i»  .  "d,Kl  a"a  dominance  Signals:  The  luminance  signal  mL( 0  ,s 

.  ,S  a  ™°"°C  rome  vldeo  S12nal  occupying  a  bandwidth  of  4.2  MHz.  The  chromi- 
c  ce  Signa  s  m/U  and  mQ(i)  also  have  the  same  bandwidth  (namely  4  2  MHz  each), 
objective  tests  have  shown,  however,  that  the  human  eye  does  not  perceive  changes  in 

a^rTuctf^lred?yelbwmgreen,nblue,eor^ny,collor  jlf  bet”  ^  Oration  of  colors.  Hue  refers  to  die 

the  color.  For  example,  a  deep  red  has  100%"  saturation  chWeen'  Sa,uratlon- or  color  intensity,  refers  to  the  punty  o 

lesser  amount  of  sa.ur-uion  s  ,  satu™t>t^butpmk^hich  is  a  dilution  of  re(J  with  white— will  have  a 

lesser  amount  or  saturation.  Saturation  is  given  bv  +  2  . , 

K  .  u  DyVm/(/)+Vf)’and  Misgiven  by  an  angle 

tan  Lmg(r)/m/(r)].  Each  color  has  a  certain  hue.  or  anplp  Prxr  0  .  ,Qo 

136°,  and  242°,  respectively.  ?  •  or  example,  red,  blue,  and  green  are  at  angles  of  19  • 
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Figure  4.37 

Multiplexing 
luminance  and 
chrominance 
signals. 


chrominance  (hue  and  saturation)  over  smaller  areas.  This  means  we  can  cut  out  high-frequency 
components  without  affecting  the  quality  of  the  picture,  since  the  eye  would  not  have  perceived 
them  anyway.  This  enables  us  to  limit  the  band  widths  of  the  m/(t)  and  mQ  ( t )  to  1 .6  and  0.6  MHz, 
respectively.  The  signal  mj  (/)  is  further  split  into  two  components,  mjn  (0  and  mj  ( t )  -  m/n  (t). 
The  high-frequency  component  m///  ( t )  consists  of  the  components  of  mj  ( t )  in  the  range  of 
0.6  to  1.6  MHz.  The  remaining  low-frequency  component  ra/(/)  —  m/nU)  consists  of  all  the 
spectral  components  of  m/(t)  in  the  range  of  0  to  0.6  MHz.  Signals  mQ(t)  and  m/(t)  —  m/u(t) 
are  sent  by  QAM,  whereas  rn/nit)  is  sent  by  LSB  (Figs.  4.37  and  4.38).  The  subcarrier  has 
frequency*  fcc  =  3.583125  MHz.  These  spectra  are  generated  as  shown  in  Fig.  4.37.  We 
generate  the  DSB-SC  spectrum  of  mj(t).  This  spectrum  is  the  sum  of  the  DSB-SC  spectra 
of  both  its  components  mjnit)  and  m/(t)  —  w///(f).  This  spectrum  occupies  a  band  of  2  to 
5.2  MHz.  However,  the  bandpass  filter  (2-4.2  MHz)  suppresses  the  USB  portion  of  miuit), 
leaving  only  the  LSB  spectrum  of  mjnit)  (Fig.  4.38).  We  still  have  the  DSB-SC  spectrum  for 
m /(t)  —  m/nit).  Thus,  xj(t)  consists  of  an  LSB  for/w///(/)  and  a  DSB-SC  for  m/(t)  —  m/n(t), 
and  can  be  expressed  as 

x,(t)  =  [mj(t)  -  miH(t)]  cos  a)cct  +  mJH (t)  cos  (occt  +  m/Hh(t)  sin  cocct 

' - V- - '  S - V - - 

DSB(QAM)  for  m/(/)— m///(/)  LSB  for 

=  mj{t)  cos  (i)cct  +  miHh(t)  sin  (occt  (4.42) 

Moreover,  the  signal  xqU)  =  mQ(t)  sin  cocc(t).  Hence,  the  composite  multiplexed  signal 
mc(t )  is 

mc(t)  =  mL{t)  +  mQ(t)  sin  (Dcct  +  m/(t)  cos  cocct  +  m/Hh(t)  sin  cocct 


*  fee  =  227.5//,  =  227.5  x  15.75  kHz  =  3.583125  MHz.  Thus,/cc  lies  midway  between  227//,  and  228//,.  This 
causes  the  chrominance  signals’  spectra  to  be  shitted  to  gaps  midway  between  harmonics  of//,  (Fig.  4.38d).  In 
practice, Z-c  is  made  slightly  smaller  than  227.5//,  (fee  =  3.579545  MHz)  to  avoid  an  objectionable  beat  frequency 
with  the  audio  carrier,^  which  lies  4.5  MHz  above  the  picture  earner.  Because/,  =Zc/227.5,  //,  =  15.7326  kHz, 
and  the  field  repetition  frequency  is  actually  59.94  rather  than  60. 
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Figure  4.38 

(a)  Band 
occupied  by 

(b)  Band 
occupied  by 
m^(/)sin  o)cct- 

(c)  Bands 
occupied  by  LSB 
of  m ///(/)  and 
DSB  of 

m/(t)  -  miH(t). 

(d)  Interleaving 
of  the 

chrominance 
and  luminance 
signal  spectra. 
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Spectrum  of 
mg(t)  sin  c occt 
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Figure  4.39 

Temporal  and 
spatial  phase 
reversals  of 
chrominance 
signals:  solid 
lines,  first  field; 
dashed  lines, 
second  field. 
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In  addition,  a  sample  of  the  subcarrier  (color  burst)  is  added  to  this  multiplexed  signal  for 
frequency  and  phase  synchronization  of  the  locally  generated  subcarrier  at  the  receiver.  The 
color  but  st  is  added  on  the  trailing  edge  of  the  horizontal  blanking  pulse.  This  composite  video 
signal  is  now  sent  by  VSB+C,  as  discussed  in  Sec.  4.5. 

I  he  Receiver:  Because  the  CCTV  system  is  required  to  be  compatible  with  monochrome 
receivers,  let  us  see  what  happens  if  we  apply  the  signal  mv(t)  to  a  monochrome  receiver.  It  may 
seem  necessary  to  remove  the  chrominance  signals  from  mv(/)  before  applying  the  signal  to  the 
picture  tube.  Fortunately,  this  step  is  not  necessary,  because  the  interference  of  the  chrominance 
signals  with  the  luminance  signal,  although  present  on  the  screen,  is  practically  invisible  to 
t  e  unian  eye.  This  happens  because  of  the  way  chrominance  signals  are  interlaced  in  the 
frequency  domain  and  because  of  the  persistence  of  human  vision  that  tends  to  average  out 
brightness  over  time  as  well  as  space. 

The  chrominance  signal  is  superimposed  on  the  luminance  signal.  Figure  4.39  shows  the 
nature  of  the  chrominance  signals.  Recall  that  <occ  =  227.5a,,,.  During  the  traversal  of  one 

r  !  W;H  be  227  5  chrominan^  signal  cycles.  Hence,  the  chrominance  signal 
changes  continuously  from  positive  to  negative,  and  vice  versa,  in  the  horizontal  direction. 
In  addition  because  there  are  227.5  cycles  in  one  line,  if  a  chrominance  signal  begins  with  a 
positive  cycle  at  the  start  of  a  line,  it  will  end  with  a  positive  cycle  at  the  end  of  the  line.  The  next 
horizontal  line  will  begin  with  a  negative  cycle  of  the  chrominance  signal  (Fig  4  39).  Hence, 
m  any  given  frame,  the  chrominance  signal  not  only  reverses  its  phase  along  the  horizontal 

tet  „«  Tdo  "S  Ph“Ksal10"* ,he  O',  direction  (on  ihe  nex.  horieonl.1 

"nel.  Bu,  th,S  ,x  no,  all  Dunng  one  held,  the  ch,onti„,„ce  sig„„l  completes  227.5  x  525 
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Figure  4.39 

Temporal  and 


cycles,  and  it  returns  to  a  given  spot  during  the  next  field  with  opposite  polarity.  Hence,  the 
chrominance  signals  reverse  phase  spatially  (in  the  vertical  and  horizontal  directions)  as  well 
as  temporally  at  any  given  spot.  Because  the  human  eye  is  not  sensitive  to  rapid  variations  in 
time  or  space,  it  can  notice  only  space  and  time  averages.  This  makes  the  chrominance  signals 
practically  invisible  to  the  human  eye.  Thus  the  color  signal  is  compatible  with  an  unmodified 
monochrome  receiver. 

Demultiplexing:  In  a  color  receiver  the  received  signal  is  demodulated  exactly  as  in 
the  monochrome  case.  This  yields  my(t).  This  signal  must  now  be  demultiplexed  to  separate 
niL(t),  mj(t ),  and  niQ(t).  The  demultiplexing  is  shown  in  Fig.  4.40.  The  output  of  the  4.2  MHz 
filter  contains  AM^(r),  as  well  as  modulated  m/  ( t )  and  niQ(t)  (Fig.  4.38).  Because  of  the  frequency 
interlacing  discussed  earlier,  however,  these  signals  are  practically  invisible.  Hence,  the  output 
of  the  4.2  MHz  filter  serves  the  function  of  mi(t).  Next  we  use  carriers  in  phase  quadrature 
to  demodulate  mv(t).  To  determine  the  various  signals  in  Fig.  4.40,  we  observe  that  the  signal 
z(t)  in  Fig.  4.40  consists  of  modulated  w/(r)  and  niQ(t ),  plus  the  part  of  m^it)  in  the  band  of 
2  to  4.2  MHz.  Let  us  denote  this  high-frequency  component  of  niL(t)  by  muj(t).  Then, 

z(t)  =  mLH{t)  +  mQ(t)  sin  cocct  +  m/(t)  cos  cocct  +  mIHh(t)  sin  cocct 


Hence, 

*1  ( t )  =  2 tiiLH ( t )  cos  cocct  +  niQ(t)  sin  2 cocct  +  m/(r)(l  +  cos  2 coct)  +  mmh (/)  sin  2 cocct 

The  double-frequency  terms  will  be  suppressed  by  the  bandpass  filter.  In  addition,  the  signal 
2 mui  (/)  cos  cocct  will  be  invisible  because  of  the  frequency-interlacing  effect.  This  is  because 
the  spectrum  of  this  signal  is  the  spectrum  of  mi(t)  shifted  to  cocc  =  227.5a;/,,  and  it  will  become 
invisible  because  of  the  frequency  interlacing  discussed  earlier.  Hence,  the  filter  output  of  the 
0  to  1 .6  MHz  filter  yields  mi  (/).  Similarly,  the  output  of  the  0  to  0.6  MHz  filter*  yields  mq{t). 
The  three  signals  /n/,(f),  m/(f),  and  iwg(f)  are  then  matrixed  to  obtain  mr(t)9  mg(t ),  and  mh(t). 

A  color  carrier  is  generated  by  using  PLL.  For  this  purpose  we  separate  the  color  burst 
(Fig.  4.40)  and  apply  it  to  a  PLL  whose  output  is  the  locally  generated  carrier  that  tracks  the 
color  burst.  The  phase  of  the  locally  generated  carrier  is  adjustable.  This  is  called  tint  control. 


This  filter  will  suppress  mm  (/),  whose  components  lie  in  the  range  of  0.6  to  1 .6  MHz. 
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Figure  4.40 
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4.10MATLAB  EXERCISES 


In  this  section,  we  provide  MATLAB  exercises  to  reinforce  some  of  the  basic  concepts  on 
amplitude  modulations  covered  in  earlier  sections.  Our  examples  illustrate  the  modulation  and 
demodulation  of  DSB-SC,  AM,  SSB-SC,  and  QAM. 

DSB-SC  Modulation  and  Demodulation 

The  first  MATLAB  program,  triplesinc  .m,  is  to  generate  a  signal  that  is  (almost)  strictly 
band-limited  and  consists  of  three  different  delayed  version  of  the  sine  signal: 


%  (triplesinc. m) 

%  Baseband  signal  for  AM 
%  Usage  m=triplesinc ( t , Ta) 
function  m=triplesinc ( t , Ta) 

%  t  is  the  length  of  the  signal 


%  Ta  is  the  parameter,  equaling  twice  the  delay 


sig_l=sinc(2*t/Ta) ; 


sig__2=sinc  (2*t/Ta-l)  ; 
sig_3=sinc(2*t/Ta+l) ; 
m=2  *sig_l+sig_2+sig_3 ; 


end 
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Figure  4.41 

Example  signals 
in  time  and 
frequency 
domains  during 
DSB-SC 
modulation. 
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message  signal  and  the  DSB-SC  signal  for  both  time  and  frequency  domains  are  illustrated  in 
Fig.  4.41. 


%  (ExampleDSB.m) 

%  This  program  uses  triplesinc.m  to  illustrate  DSB  modulation 
%  and  demodulation 


ts=l . e-4 


t=-0.04:ts:0.04; 

Ta=0 . 01 ; 

m_sig=triplesinc ( t , Ta) ; 

Lf  f  t=length  ( t )  ;  Lf f t=2~ceil  (log2  (Lf  f  t)  )  ; 

M_fre=ff tshif t ( f f t (m_sig, Lf f t) ) ; 
freqm=(-Lfft/2:Lf ft/2-1) / (Lfft*ts) ; 

s_dsb=m_sig . *cos (2  *pi*500*t )  ; 

Lfft= length ( t ) ;  Lf f t=2~ceil (log2 (Lf f t) +1) ; 
S_dsb=ff tshif t ( f f t (s_dsb, Lf f t )  )  ; 
freqs=(-Lfft/2:Lf ft/2-1) / (Lfft^ts)  ; 

Trange= [-0.03  0.03  -2  2] 
figure (1) 

subplot (221) ; tdl=plot ( t , m_sig )  ; 

axis (Trange) ;  set ( tdl ,  # Linewidth ' ,2) ; 

xlabel ( 7 { \it  t)  (sec)');  ylabel('{\it  m}({\it  t})') 
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Figure  4.42 

Time  domain 
signals  during 
DSB-SC 

modulation  and 
demodulation. 
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subplot (223) ; td2=plot ( t, s_dsb)  ; 
axis (Trange) ;  set (td2, 'Linewidth' ,2) ; 

xlabel ('{\it  t}  (sec)');  ylabel('{\it  s}_{\rm  DSB}({\it  t})') 


Frange= [ -600  600  0  200] 

subplot (222) ; fdl=plot ( f reqm, abs (M_f re) ) ; 

axis (Frange) ;  set (fdl, ' Linewidth' , 2 ) ; 

xlabel ( ' { \ i t  f}  (Hz)');  ylabel ( ' ( {\it  f})') 

subplot (224) ; fd2=plot ( f reqs , abs (S_dsb) ) ; 

axis (Frange) ;  set (fd2, 'Linewidth'  ,2)  ; 

xlabel (  (\it  f)  (Hz)');  ylabel ('{ \it  S}_{rm  DSB) ({ \it  f  >  )  '  ) 


The  hrst  modulation  example,  ExampleDSBdeinf  ilt  .mis  based  on  a  strictly  low-pas 

bandar  "Sr(°;  KX,‘  WC  Wi"  g£nerate  3  different  message  signal  that  is  not  strictl; 
band-limited.  In  effect,  the  new  message  signal  consists  of  two  triangles- 


fiber  ^forder^T^'e'oriri80!  'm^emente^  with  a  finite  impulse  response  (FIR)  low- 

demodulator  signal  e(t)  J:~rt  3them(0,  the  dsb'sc  signai  n,{t)c°%a)ct 

pass  filtering  are  all  given  in  Fig  4  42  for  the  ^  *CmeTed  messaSe  si8nal  md{t)  attef 
domain  The  low  nat  Hlt^r  „  J  a  h  domain,  and  in  Fig.  4.43  for  the  frequ< 

result  shows  almost  no  distortion"  em°dulat°r  has  ^ndwidth  of  150  Hz.  The  demoduli 


Figure  4.43 

Frequency 
domain  signals 
during  DSB-SC 
modulation  and 
demodulation. 
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Figure  4.43 

Frequency 
domain  signals 
durina  DSB-SC 
modulation  and 
demodulation. 


/(Hz)  /(Hz) 


%  (ExampleDSBdemf ilt .m) 

%  This  program  uses  triangl.m  to  illustrate  DSB  modulation 
%  and  demodulation 

ts=l . e-4 ; 

t=-0 . 04 : ts : 0 . 04 ; 

Ta=0 . 01 ; 

m_sig=triangl ( (t+0 . 01) /0 . 01) -triangl ( (t-0 . 01) /0 . 01) ; 

Lm_s ig= length (m_sig) ; 

Lf f t=length (t ) ;  Lf f t=2~ceil (log2 (Lf f t)  )  ; 

M_fre=f ft shift (fft (m_sig/Lfft) )  ; 
freqm=(-Lfft/2:Lfft/2-l) / (Lfft*ts)  ; 

B_m=150;  %Bandwidth  of  the  signal  is  B_m  Hz. 
h=f irl (40 , [B_m*ts] ) ; 

t=-0 . 04 : ts : 0 . 04 ; 

Ta=0 . 01 ; f c=300 ; 
s_dsb=m_sig . *cos (2*pi*fc*t )  ; 

Lf  ft=length  (t)  ;  Lff  t=2/'ceil  (log2  (Lfft)  +1)  ; 

S_dsb=f f tshif t (fft (s_dsb/ Lf f t) ) ; 
freqs=(-Lfft/2:Lf ft/2-1) / (Lfft*ts) ; 

%  Demodulation  begins  by  multiplying  with  the  carrier 
s__dem=s_dsb.  *cos  (2*pi*fc*t)  *2 ; 
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S_dem=f f tshif t ( f f t (s_dem, Lf ft ) ) ; 

%  Using  an  ideal  LPF  with  bandwidth  150  Hz 
s_rec=f ilter (h, 1 , s_dem) ; 

S_rec=ff tshif t (fft (s_rec,Lfft) ) ; 


Trange= [ -0 . 025  0.025  -2  2]; 
figure (1) 

subplot (221) ; tdl=plot ( t , m_sig) ; 

axis (Trange) ;  set (tdl, 'Linewidth' ,1.5) ; 

xlabel ('{\it  t}  (sec)');  ylabel(/{\it  m}({\it  t})'); 

title ( 'message  signal'); 

subplot (222 ) ; td2=plot ( t, s_dsb) ; 

axis (Trange) ;  set (td2, 'Linewidth' ,1.5)  ; 

xlabel ( ' { \it  t}  (sec)');  ylabel('{\it  s}_{ \rm  DSB} ( { \it  t})') 

title ( 'DSB-SC  modulated  signal'); 

subplot (223 ) ; td3=plot ( t, s_dem) ; 

axis (Trange) ;  set(td3, 'Linewidth' ,1.5) ; 

xlabel ( ' { \it  t}  (sec)');  ylabel('{\it  e}({\it  t})') 

title ( ' { \it  e} ( { \it  t})'); 

subplot (224) ; td4=plot (t, s_rec)  ; 

axis (Trange) ;  set(td4, 'Linewidth' ,1.5) ; 

xlabel  ('{Ut  t}  (sec)');  ylabel('{\it  m}_d({\it  t))') 

title ( 'Recovered  signal'); 


') 


Frange= [ -700  700  0  200]; 
figure (2) 

subplot (221 ) ; fdl=plot ( freqm, abs (M_fre) ) ; 
axis (Frange) ;  set(fdl, ' Linewidth' , 1 . 5) ; 
xlabel  (  { \  i  t  f}  (Hz)');  ylabel('{\it  M}({\it  f}) 
title ( 'message  spectrum' ) ; 
subplot (222) ; fd2=plot ( freqs, abs (S_dsb) ) ; 
axis  (Frange)  ;  set  ( fd2 ,'  Linewidth'  ,  1  5)/ 

xlabel ( ' { \it  f)  (Hz)');  ylabel('{\it  S}_{rm  DSB}({\it  f))') 
title ( 'DSB-SC  spectrum'); 

subplot (223 ) ;fd3=plot( freqs, abs (S_dem) ) ; 

axis (Frange) ;  set (fd3, 'Linewidth'  1  5).’ 

xlabel  ('{\it  f)  (Hz)');  ylabel  ( '  {  \it  E>({Ut  f})'). 

title ( 'spectrum  of  {\it  e)({\it  t))')- 

subplot (224) ; f d4=plot ( freqs , abs ( S_rec ) ) • 

axis (Frange);  set(fd4, 'Linewidth' , 1 .5) 

xlabel (' {\it  f)  (Hz)');  ylabel (' {\it  M)_d({\it  f})')- 

title ( 'recovered  spectrum' ) ;  ' 


Figure  4.44 

Time  domain 
signals  in  AM 
modulation  and 
noncoherent 
demodulation. 


AM  Modulation  and  Demodulation 

In  this  exercise,  we  generate  a  conventional  AM  signal  with  m™i  \  a  f  ,,  -  1  By 
using  the  same  message  signal  m,  (,)  the  MATL  AR  n  modulation  index  of  H  -  ' 

erates  the  message  signal  th»  '  ”*ArLAB  program  Examp  1  eAMdemf  ilt  .m  gen 

8'  S,gnal'  'he  corresPonding  AM  signal,  ,h«  recdied  signal  in  noncoh«e« 
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Figure  4.44 

Time  domain 

2 

signals  in  AM 

modulation  and 

noncoherent 

1 

demodulation. 

f  0 

-2 

Message  signal 


-0.02  -0.01  0  0.01  0.02 
/(sec) 


Rectified  signal  without  local  carrier 


-0.02  -0.01  0  0.01 
/(sec) 


0.02 


/(sec) 
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demodulation,  and  the  rectified  signal  after  passing  through  a  low-pass  filter.  The  low- 
pass  filter  at  the  demodulator  has  bandwidth  of  150  Hz.  The  signals  in  time  domain 
are  shown  in  Fig.  4.44,  whereas  the  corresponding  frequency  domain  signals  are  shown 
in  Fig.  4.45. 

Notice  the  large  impulse  in  the  frequency  domain  of  the  AM  signal.  No  ideal  impulse  is 
possible  because  the  window  of  time  is  limited,  and  only  very  large  spikes  centered  at  the 
carrier  frequency  of  ±300  Hz  are  visible.  In  addition,  the  message  signal  bandwidth  is  not 
strictly  band-limited.  The  relatively  low  carrier  frequency  of  300  Hz  forces  the  LPF  at  the 
demodulator  to  truncate  some  message  components  in  the  demodulator.  Distortion  near  the 
sharp  corners  of  the  recovered  signal  is  visible. 


%  ( Example AMdemf ilt .m) 

%  This  program  uses  triangl.m  to  illustrate  AM  modulation 
%  and  demodulation 

ts=l . e-4 ; 

t=-0 . 04 : ts : 0 . 04 ; 

Ta=0 . 01 ;  fc=500; 

m_sig=triangl  (  (t+0.01) /0 . 01) -triangl ( (t-0 . 01) /0 . 01) ; 
Lm_sig=length (m_sig) ; 

Lfft=length(t)  ;  Lf  f  t=2/'ceil  ( log2  (Lf  f t )  )  ; 

M__fre=f  f  tshif  t  (fft  (m_sig, Lf f t ) )  ; 
freqm=(-Lfft/2:Lf ft/2-1) / (Lfft*ts) ; 

B_m= 150;  %Bandwidth  of  the  signal  is  B_m  Hz. 
h=f irl (40 , [B_m*ts] ) ; 
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Figure  4.45 

Frequency 
domain  signals 
in  AM 

modulation  and 

noncoherent 

demodulation. 


Message  spectrum 


/(Hz) 


/(Hz) 


%  AM  signal  generated  by  adding  a  carrier  to  DSB-SC 
s_am= ( l+m_sig) . *cos (2*pi*fc*t) ; 

Lfft=length(t) ;  Lfft=2~ceil (log2 (Lf ft) +1) ; 

S_am=f ftshift ( f ft (s_am, Lf ft) ) ; 
freqs= (-Lfft/2 :Lf ft/2-1 ) / (Lfft*ts) ; 

%  Demodulation  begins  by  using  a  rectifier 
s_dem=s_am . * (s_am>0) ; 

S_dem=f ftshift (f ft (s_dem,Lf ft) ) ; 

%  Using  an  ideal  LPF  with  bandwidth  150  Hz 
s~rec=f ilter (h, 1 , s_dem) ; 

S_rec=f ftshift (f ft (s_rec, Lf ft) ) ; 


Trange=[-0.025  0.025  -2  2]; 
figure (1) 


subplot (221 ) ;tdl=plot(t,m_sig) , 

axis (Trange) ;  set(tdl, 'Linewidth'  l  5) . 
xlabelcut  C)  (sec,’,;  ylsbel ,  ■  nu  ,  nit 
title ( 'message  signal');  }  ' 


subplot (222 ) ; td2=plot (t , s_am) ; 
axis (Trange) ;  set(td2, 'Linewidth' , 1 . 5) 
xlabel ( ' { \it  t)  (see)');  ylabel('{\it 
title ('AM  modulated  signal'); 


s)_{\rm  DSB) ( (\it 


t})  ') 
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subplot (223) ; td3=plot ( t , s_dem)  ; 

axis (Trange) ;  set(td3, ' Linewidth ',1 . 5 ) ; 

xlabel ( ' { \it  t}  (sec)');  ylabel('{\it  e}({\it  t})') 

title (' rectified  signal  without  local  carrier'); 

subplot (224) ; td4=plot ( t , s_rec ) ; 

Trangelow= [-0.025  0.025  -0.5  1]  ; 

axis (Trangelow) ;  set(td4# ' Linewidth' , 1 . 5 ) ; 

xlabel ( ' { \it  t}  (sec)');  ylabel  ('{ \it  m}__d ({ \it  t } ) ' ) 

title (' detected  signal'); 

Frange= [-700  700  0  200]; 
figure (2) 

subplot (221 ) ; fdl=plot ( freqm, abs (M_fre) ) ; 

axis (Frange) ;  set ( fdl , ' Linewidth' ,1.5); 

xlabel ( ' { \it  f}  (Hz)');  ylabel('{\it  M}({\it  f})'); 

title ( 'message  spectrum'); 

subplot (222 ) ; fd2=plot ( freqs, abs (S_am) )  ; 

axis ( Frange ) ;  set ( f  d2 , ' Linewidth ',1.5); 

xlabel ('{\it  f}  (Hz)');  ylabel  ('{ \it  S}__{rm  AM}  ({ \it  f})'); 
title ('AM  spectrum'); 

subplot (223 ) ; fd3=plot ( freqs , abs (S_dem) ) ; 

axi s ( Frange ) ;  set ( f d3 , ' Linewidth ',1.5); 

xlabel('{\it  f}  (Hz)');  ylabel('{\it  E}({\it  f})'); 

title (' rectified  spectrum'); 

subplot (224 ) ; fd4=plot ( freqs, abs (S_rec )  )  ; 

axis (Frange) ;  set (fd4, 'Linewidth' ,1.5); 

xlabel ('{\it  f}  (Hz)');  ylabel ('{ Nit  M}_d ({ \it  f})'); 

title (' recovered  spectrum'); 


SSB-SC  Modulation  and  Demodulation 

To  illustrate  the  SSC-SC  modulation  and  demodulation  process,  this  exercise  uses  the  same 
message  signal  m\ (/)  with  double  triangles  to  generate  an  SSB-SC  signal.  The  carrier  frequency 
is  still  300  Hz.  The  MATLAB  program  ExampleSSBdemf  ilt  .m  performs  this  function. 
Coherent  demodulation  is  applied  in  which  a  simple  low-pass  filter  with  bandwidth  of  150  Hz 
is  used  to  distill  the  recovered  message  signal. 

The  time  domain  signals  are  shown  in  Fig.  4.46,  whereas  the  corresponding  frequency 
domain  signals  are  shown  in  Fig.  4.47. 


%  (ExampleSSBdemf ilt  .m) 

%  This  program  uses  triangl.m 

%  to  illustrate  SSB  modulation  %  and  demodulation 
clear;clf ; 

ts=l . e-4 ; 
t=-0.04:ts:0.04; 

Ta=0 . 01 ;  f c=3  00 ; 

m_sig=triangl (  (t+0.01)/0.01) -triangl ( (t-0.01) /0.01) ; 
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Figure  4.46 

Time  domain 
signals  during 
SSB-SC  modula¬ 
tion  and 
coherent 
demodulation. 
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SSB-SC  modulated  signal 


t{  sec) 
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Figure  4.47 

Frequency 
domain  signals 
in  SSB-SC 
modulation  and 
coherent 
demodula¬ 
tion. 


/(Hz) 
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Lm_sig= length (m_sig) ; 

Lfft=length(t) ;  Lf f t=2*ceil (log2 (Lfft) ) ; 

M_fre=f ft shift ( f f t (m_sig, Lf f t )  )  ; 
freqm= (-Lfft/2:Lf ft/2-1) / (Lfft*ts) ; 

B_m=150;  %Bandwidth  of  the  signal  is  B_m  Hz. 
h=f irl ( 40 , [B_m*ts] ) ; 

s_dsb=m_sig. *cos (2*pi*fc*t) ; 

Lf f t=length (t ) ;  Lf f t=2~ceil (log2 (Lfft) +1) ; 

S_dsb=f f tshif t ( f f t (s_dsb/ Lf f t) )  ; 

L__lsb=f  loor  ( fc*ts*Lf  f t )  ; 

SSBf ilt=ones ( 1 , Lf  f t )  ; 

SSBfilt (Lf f t/2-L_lsb+l : Lf f t/2+L_lsb) = zeros ( 1 , 2*L_lsb) ; 

S_ssb=S_dsb. *SSBf ilt ; 

freqs= (-Lf f t/2 : Lf f t/2-1) / (Lfft*ts) ; 

s_ssb=real (if ft (fftshift (S_ssb) ) ) ; 

s_ssb=s_ssb ( 1 : Lm_sig) ; 

%  Demodulation  begins  by  multiplying  with  the  carrier 
s_dem=s_ssb. *cos (2*pi*fc*t) *2 ; 

S_dem=ff tshif t ( f f t (s_dem/ Lfft) ) ; 

%  Using  an  ideal  LPF  with  bandwidth  150  Hz 
s_rec=f ilter (h, 1 #  s_dem) ; 

S_rec=ff tshif t ( f f t (s_rec , Lf ft) ) ; 

Trange= [-0.025  0.025  -1  1]  ; 
figure (1) 

subplot (221) ; tdl=plot ( t/m_sig) ; 

axis (Trange) ;  set (tdl, 'Linewidth' ,1.5); 

xlabel ( ' { \it  t}  (sec)');  ylabel('{\it  m}({\it  t})'); 

title ( 'message  signal'); 

subplot (222) ; td2=plot ( t , s_ssb) ; 

axi s ( Trange ) ;  set ( td2 , ' Linewidth ',1.5); 

xlabel ( ' { \it  t)  (sec)');  ylabel('{\it  s}_{\rm  SSB}({\it  t})') 

title ( 'SSB-SC  modulated  signal'); 

subplot (223) ; td3=plot ( t , s_dem) ; 

axis (Trange) ;  set(td3, 'Linewidth' ,1.5) ; 

xlabel ('{\it  t}  (sec)');  ylabel('{\it  e}({\it  t})') 

title ('after  multiplying  local  carrier'); 

subplot (224) ; td4=plot ( t , s_rec ) ; 

axis (Trange) ;  set ( td4 , ' Linewidth' ,1.5) ; 

xlabel ('{\it  t}  (sec)');  ylabel('{\it  m}_d({\it  t})') 

title (' Recovered  signal'); 

Frange= [ -700  700  0  200]; 
figure (2 ) 

subplot (221) ;fdl=plot ( f reqm, abs (M_f re) )  ; 

axis ( Frange) ;  set ( fdl , ' Linewidth'  ,1.5)  ; 

xlabel ('{\it  f)  (Hz)');  ylabel('{\it  M}({\it  f})'); 
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Figure  4.48 
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Figure  4.49 

Frequency 
domain  signals 
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coherent 

demodulation  for 
the  first  message 


title ( 'message  spectrum'); 

subplot (222) ;fd2=plot (freqs;abs (S_ssb) )  ; 

axis (Frange) ;  set(fd2, 'Linewidth' , 1 . 5) ; 

x  label  (  '  { \  it  f)  (Hz)');  ylabel('{\it  S}__{rm  DSB}({\it  f}) 

title(  upper  sideband  SSB-SC  spectrum'); 

subplot ( 223 ) ; fd3=plot ( f reqs  f abs ( S_dem) ) \ 

axis (Frange) ;  set ( fd3 ,' Linewidth' , 1  5) .# 

xlabel ( ' { \it  f}  (Hz)');  ylabel('{\it  E}({\it  f>)'); 

title( 'detector  spectrum'); 

subplot (224) ; fd4=plot(f reqs, abs (S_rec) ) ; 

axis  (Frange)  ;  set  ( fd4 Linewidth'  1  5)/ 

xlabel  ( '  { \it  f>  (Hz)');  ylabel  ( '  {Ut  MLd(  {\it  f})')- 

title (' recovered  spectrum'); 


Figure  4.50 

Time  domain 
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modulation  and 
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- - — t^ciuuuuiauon 

and frequenc^i stays  at  300  H^buu^  dem°di;late  two  messaSe 

and  detected  TheOAM  '  L)  Hz,  but  two  signals  are  simultaneously  modulated 

tog“;t y  demodula,ed  by  with  600*'  -3 

by  the  same  low-pass  filter  of  order  40  The  MATLAR  1 ^  ^  Sig"a'  Pr°dUCt  “  Tjt 
completes  this  illustration  by  showing  S  Examp leQAMdemf! It.* 

demodulation  of  the  first  sienil  (t\*  \  u  domain  signals  during  the  modulation  and 

Wi(f)  are  shown  in  Fig.  4.48.  whereaslhe  f  SeC°nd  S‘8nal  7,16  time  domain  resul,S  ? 
Additionally,  the  time  domain  results  form  d°main  Signals  ^  sh°Wn  in  F'8'  4'49' 

domain  signals  are  shown  in  Fig  4  5 1  2  ^  shown  in  FlS-  4-50»  whereas  the  frequent) 
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Figure  4.49 
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Figure  4.51 
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%  ( Examp 1 eQ AMdem f i 1 t . m ) 

%  This  program  uses  triangl.m  and  triplesinc.m 

%  to  illustrate  QAM  modulation  %  and  demodulation 

%  of  two  message  signals 

clear ; cl f ; 

ts=l . e-4 ; 

t=-0.04:ts:0.04; 

Ta=0 . 01 ;  fc=300; 

%  Use  triangl.m  and  triplesinc.m  to  generate 

*  t»o  mess,,,  signals  of  different  shape,  and  spectra 

P-Sigl.trrangKIt.O.OH/O.OD-trianplat-O.OD/o  Oil- 

m_sig2=tnplesinc(t,Ta)  ;  '  ' 

Ln\_sig=length(m_sigl)  ; 

Lf ft=length(t) ;  Lf ft=2'ceil (log2 (Lfft) ) • 

Ml_f re=f f tshif t ( f f t (m_sigl , Lf f t ) ) ; 

M2_fte=f ftshif t (f ft (m_sig2 , Lfft) ) • 

freqm=(-Lfft/2:Lfft/2-l)/(Lfft* *ts) ; 

B_m=150;  %Bandwidth  of  the  signal  is  B_m  Hz. 

%  Design  a  simple  lowpass  filter  wi  t-H  k  j  • 

h=firl (40, [B_m*ts] ) ;  h  bandwidth  B_m  Hz . 

c*trier  — 
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S_qam=ff tshif t  ( f f t  (s_qam/  Lf ft)  )  ; 
freqs= (-Lfft/2 :Lf ft/2-1) / (Lfft*ts) ; 

%  Demodulation  begins  by  using  a  rectifier 
s_deml=s_qam.  *cos  (2*pi*fc*t)  *2  ; 

S_deml=ff tshif t (f f t (s_deml , Lf f t) )  ; 

%  Demodulate  the  2nd  signal 
s_dem2=s_qam.  *sin (2 *pi*fc*t )  *2; 

S_dem2=ff tshif t (f f t (s_dem2 , Lf f t) )  ; 

% 

%  Using  an  ideal  LPF  with  bandwidth  150  Hz 


s_recl=f ilter (h, 1 , s_deml) ; 

S_recl=f f tshif t ( f f t (s_recl , Lf f t) ) ; 
s_rec2  =  f ilter (h,  1 , s_dem2 )  ; 

S_rec2=ff tshif t ( f f t (s_rec2 , Lf f t) ) ; 

Trange= [-0.025  0.025  -2  2]; 

Trange2= [-0.025  0.025  -2  4] ? 
figured) 

subplot (221) ; tdl=plot (t/m_sigl) ; 

axis (Trange) ;  set (tdl , ' Linewidth' , 1 . 5) ; 

xlabel('{Nit  t}  (sec)');  ylabel  (  '  { \it  m}  ( { \it  t})'); 

title ( 'message  signal  1'); 

subplot (222 ) ; td2=plot ( t , s_qam) ; 

axis (Trange) ;  set (td2 , 'Linewidth' ,1.5) ; 

xlabel ( ' { \it  t}  (sec)');  ylabel('{\it  s}_{\rm  DSB}({\it  t})') 

title ('QAM  modulated  signal'); 

subplot (223 ) ; td3=plot ( t , s_deml ) ; 

axis (Trange2 ) ;  set (td3, 'Linewidth' ,1-5) ; 

xlabel ( ' { Nit  t}  (sec)');  ylabel('{\it  x}({\it  t})') 

title ('first  demodulator  output'); 

subplot (224 ) ; td4=plot ( t #  s_recl ) ; 

axis (Trange) ;  set (td4, 'Linewidth' , 1.5)  ; 

xlabel ( ' { \it  t}  (sec)');  ylabel ('{ \it  m}_{dl} ({ Nit  t})') 
title (' detected  signal  1'); 

figure (2) 

subplot (221) ; td5=plot (t/m_sig2) ; 

axis (Trange) ;  set (td5# 'Linewidth' ,1.5); 

xlabel (' {\it  t}  (sec)');  ylabel ('{Nit  m}({Nit  t})'); 

title ( 'message  signal  2'); 

subplot (222) ; td6=plot ( t , s_qam) ; 

axis (Trange) ;  set ( td6 , ' Linewidth' ,1.5); 

xlabel ('{Nit  t)  (sec)');  ylabel ('{Nit  s}_{Nrm  DSB} ( { Nit  t})') 

title ('QAM  modulated  signal'); 

subplot (223) ; td7  =plot ( t , s_dem2  )  ; 

axis (Trange2 ) ;  set (td7 , 'Linewidth' ,1.5) ; 

xlabel ('{Nit  t}  (sec)');  ylabel ('{Nit  e}_l({Nit  t})') 
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title (' second  demodulator  output'); 
subplot (224 ) ; td8=plot (t, s_rec2) ; 
axis (Trange) ;  set (td8, 'Linewidth'  ,1.5)  ; 

xlabel('{\it  t}  (sec)');  ylabel ( ' { \it  m}_(d2 } ( { Nit  t})') 
title ( 'detected  signal  2'); 

Frange= (-700  700  0  250]; 
figure (3 ) 

subplot (221) ; fdl=plot  ( f reqm, abs (Ml_fre) ) ; 

axis (Frange) ;  set(fdl, 'Linewidth'  ,  1.5)  ; 

xlabel ( ' { Nit  f)  (Hz)');  ylabel ('{ Nit  M] ({ Nit  f})'); 

title ( 'message  1  spectrum'); 

subplot ( 222 ) ;fd2=plot(f reqs, abs (S_qam) ) ; 
axis (Frange) ;  set (fd2, 'Linewidth' ,  1 . 5) ; 

xlabel ( ' {Nit  f)  (Hz)');  ylabel (' (Nit  S}_{rm  AM} ({ Nit  f})'); 

title('QAM  spectrum  magnitude'); 

subplot (223 ) ;fd3=plot(freqs,abs(S_deml) ) ; 

axis (Frange) ;  set(fd3, 'Linewidth' , 1 . 5) ; 

xlabel ('{Nit  f)  (Hz)');  ylabel ('{\it  E}_l({Nit  f})'); 

title ( 1  first  demodulator  spectrum' ) ; 

subplot (224) ; fd4=plot ( f reqs, abs (S_recl) ) ; 

axis (Frange) ;  set ( fd4, ' Linewidth', 1 . 5) ; 

xlabel ('{Nit  f}  (Hz)');  ylabel (' {Nit  M}_{dl} ( {Nit  f})'); 

ti tie ( 'recovered  spectrum  l'); 

figure (4) 

subplot (221 ) ; f dl=plot ( f reqm, abs (M2_f re) ) ; 
axis (Frange) ;  set ( fdl, 'Linewidth' , 1 . 5) ; 
xlabel ('{Nit  f)  (Hz)');  ylabel ('{\it  M} ( {Nit  f})')- 
ti tie (' message  2  spectrum'); 


!_rec2 ) ) ; 

•1.5); 

Ait  M}_{d2 } ( {Nit  f } )  ' )  ; 


E)_2 ( { Nit  f}) ') ; 


nn  AM} ({\it  f})  ')  ; 
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PROBLEMS 


4.2- 1  You  are  given  the  baseband  signals  (i)  m(t)  =  cos  10007rr;  (ii)  m(t)  =  2  cos  10007rf  -f 

sin  2000;rf;  (iii)  m(t)  =  cos  1000;r/cos  3000;r/;  (iv)  m(t)  =  exp(— 10|/|).  For  each  one,  do 
the  following. 

(a)  Sketch  the  spectrum  of  m(t). 

(b)  Sketch  the  spectrum  of  the  DSB-SC  signal  m(t)  cos  10,0007rf. 

(c)  Identify  the  upper  sideband  (USB)  and  the  lower  sideband  (LSB)  spectra. 

4.2- 2  Repeat  Prob.  4.2-1  if  (i)  m(t)  =  sinc(lOO^r);  (ii)  m(t)  =  (1  +  /2)_l;  (iii)  m(t)  =  *>-l°l'-l|. 

Observe  that  10|/—  1 1  jS£>-10|/|  delayed  by  1  second.  For  the  last  case  you  need  to  consider  both 
the  amplitude  and  the  phase  spectra. 

4.2- 3  You  are  asked  to  design  a  DSB-SC  modulator  to  generate  a  modulated  signal  km(t)  cos  {coct  +  0), 

where  m(t)  is  a  signal  band-limited  to  B  Hz.  Figure  P4.2-3  shows  a  DSB-SC  modulator  available 
in  the  stockroom.  The  carrier  generator  available  generates  not  cos  coct ,  but  cos3  coct.  Explain 
whether  you  would  be  able  to  generate  the  desired  signal  using  only  this  equipment.  You  may  use 
any  kind  of  filter  you  like. 

(a)  What  kind  of  filter  is  required  in  Fig.  P4.2-3? 

(b)  Determine  the  signal  spectra  at  points  b  and  c\  and  indicate  the  frequency  bands  occupied  by 
these  spectra. 

(c)  What  is  the  minimum  usable  value  of  o>c? 

(d)  Would  this  scheme  work  if  the  carrier  generator  output  were  sin3  coctl  Explain. 

(f)  Would  this  scheme  work  if  the  carrier  generator  output  were  cos"  coct  for  any  integer  n  >  2? 

figure  P.4.2-3 


4.2-4  You  are  asked  to  design  a  DSB-SC  modulator  to  generate  a  modulated  signal  km(t)  cos  coct  with 
the  carrier  frequency  fc  =  500  kHz  (coe  =  2 tt  x  500, 000).  The  following  equipment  is  available 
in  the  stockroom:  (i)  a  signal  generator  of  frequency  1 00  kHz:  (ii)  a  ring  modulator;  (iii)  a  bandpass 
filter  tuned  to  500  kHz. 
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Figure  P.4.2-5 


Figure  P.4.2-7 


(a)  Show  how  you  can  generate  the  desired  signal. 

(b)  If  the  output  of  the  modulator  is  k  •  m{t)  cos  coct ,  find  k. 

4.2-5  Amplitude  modulators  and  demodulators  can  also  be  built  without  using  multipliers.  In  Fig.  P4.2- 
5,  the  input  0(f)  =  m(f),  and  the  amplitude  A  »  |0(f)|.  The  two  diodes  are  identical  with  a 
resistance  r  ohms  in  the  conducting  mode  and  infinite  resistance  in  the  cutoff  mode.  Show  that 
the  output  eQ(t)  is  given  by 

2  R 

eo(0  =  — —  w(f)m(f) 

R  +  r 

where  w(f)  is  the  switching  periodic  signal  shown  in  Fig.  2.20a  with  period  2 Tr/a>c  seconds. 

(a)  Hence,  show  that  this  circuit  can  be  used  as  a  DSB-SC  modulator. 

(b)  How  would  you  use  this  circuit  as  a  synchronous  demodulator  for  DSB-SC  signals. 


In  1  i^.  P4_  5,  if  0(f)  sin  (coct  -f  0),  and  the  output  eQ (t)  is  passed  through  a  low-pass  filter, 
show  that  this  circuit  can  be  used  as  a  phase  detector,  that  is,  a  circuit  that  measures  the  phase 
difference  between  two  sinusoids  of  the  same  frequency  (coc). 

Hint:  Show  that  the  filter  output  is  a  dc  signal  proportional  to  sin  d. 

4'2'7  over  achtri!  k  Tu  ^  t0  5000  Hz- « be  transmitted  simultaneously 

Ilunle!!e7inV  hmflP  g  SChCme  Sh°Wn  in  Fi§-  P4-2'7-  The  «gnal  at  point  b  is  the 

si  af a  intfui,  W  ICh  "®w  modula‘es  a  carrier  of  frequency  20,000  Hz.  The  modulated 
signal  at  point  c  is  transmitted  over  a  channel. 

(a)  Sketch  signal  spectra  at  points  a,  b,  and  c. 

(b)  What  must  be  the  bandwidth  of  the  distortionless  channel? 

Design  a  receiver  to  recover  signals  m(  ( t )  and  m2(t)  from  the  modulated  signal  at  point  c. 


Figure  P.4.2-8 


Figure  P.4.2-9 


4 


F«9ure  P.4.3-1 
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Figure  P.4.2-8 


Figure  P.4.2-9 


4.2-8  System  shown  in  Fig.  P4.2-8  is  used  for  scrambling  audio  signals.  The  output  y (f )  is  the  scrambled 
version  of  the  input  m(t). 

(a)  Find  the  spectrum  of  the  scrambled  signal  y(t). 

(b)  Suggest  a  method  of  descrambling  y(r)  to  obtain  m(t). 

A  slightly  modified  version  of  this  scrambler  was  first  used  commercially  on  the  25-mile  radio¬ 
telephone  circuit  connecting  Los  Angeles  and  Santa  Catalina  island. 


(Scrambled  output) 


4.2-9  An  amateur  audio  scrambler/descrambler  pair  is  shown  in  Fig.  P4.2-9. 

(a)  Graphically  find  and  show  the  spectra  of  signals  *(/),  y(f),  and  z(t)  when  coq  =  20, 000;r. 

(b)  Graphically  find  and  show  the  spectra  of  signals  y(t)  and  z(t)  when  ojq  =  30, 0007T. 

(c)  Show  whether  or  not  we  can  descramble  z(t)  to  recover  m(t). 


4.3-1  In  an  amplitude  modulation  system,  the  message  signal  is  given  by  Fig.  P4.3-1  and  the  carrier 
frequency  is  1  kHz.  The  modulator  output  is 

$Am(0  =  2[fc  +  0.5 m(t)]  cos  (Dct 

(a)  Determine  the  average  message  power. 

(b)  If  b  =  1,  determine  the  modulation  index  and  the  modulation  power  efficiency. 

(c)  Sketch  the  modulated  signal  of  part  (a)  in  the  time  domain. 

(d)  If  b  —  0.5,  repeat  parts  (a)  and  (b). 


figure  P.4.3-1 
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4.3- 2  Sketch  the  AM  signal  [A  +  m(t)]  cos  a)ct  for  the  periodic  triangle  signal  m(t)  shown  in  Fig.  P4.3-1 

corresponding  to  the  modulation  index:  (a)  \i  —  0.5;  (b)  /i  =  1;  (c)  /x  =  2;  (d)  n  =  oo.  How  do 
you  interpret  the  case  //  =  oo? 

4.3- 3  For  the  AM  signal  with  m(t )  shown  in  Fig.  P4.3-3  and  fi  =  0.75: 

(a)  Find  the  amplitude  and  power  of  the  carrier. 

(b)  Find  the  sideband  power  and  the  power  efficiency  r\. 


Figure  P.4.3-3  —  t*\ 


4.3  4  UO  Sketch  the  time  domain  DSB-SC  signal  corresponding  to  the  message  signal  m(t)  in  Fig. 


(b)  The  DSB-SC  signal  of  part  (a)  is  applied  at  the  input  of  an  envelope  detector.  Show  that  the 
output  ol  the  envelope  detector  is  not  m(t\  but  |m(f)|.  Show  that,  in  general,  if  an  AM  signal 
A  +  m(0|  cos  wct  is  envelope-detected,  the  output  is  | A  +  m(t)\.  Hence,  show  that  the  condition 
lor  recovering  m(t)  from  the  envelope  detector  is  A  4-  m(t)  >  0  for  all  t. 


4.3-5  In  a  DSB-SC  amplitude  modulation  system,  the  message  signal  is  m(t)  =  e~3,u(t  -  2)  and  the 
earner  signal  is  cos  2000717. 

(a)  Find  and  sketch  the  Fourier  transform  of  the  message  signal. 

(b)  Find  and  sketch  the  Fourier  transform  of  the  modulated  signal. 


4.3-7 


Figure  P.4.3-7 


4.3-8  In  the  earlv  Have  r»f  mriin  a**  • 


Figure  P.4.3-8 


<PAM«) 


X(t) - 

Low-pass 


® 


DC  block 


© 
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4.3- 9  DSB-SC  signals  can  also  be  demodulated  by  using  a  pair  of  crystals.  Given  a  DSB-SC  signal 

0(f),  show  that  by  constructing  its  sum  and  its  difference  with  the  local  oscillator  output  cos  (Dct, 
we  can  demodulate  0(f)  without  using  multipliers. 

4.4- 1  In  a  QAM  system  (Fig.  4. 19),  the  locally  generated  carrier  has  a  frequency  error  Aco  and  a  phase 

error  <5;  that  is,  the  receiver  carrier  is  cos  [(< oc  -I-  Aco)t  +  8]  or  sin  [(coc  H-  Aco)t  4-  8].  Show  that 
the  output  of  the  upper  receiver  branch  is 

m\(t)  cos  [(Aa>)f  -f  8]  -  W2(f)  sin  [(A  co)t  4-  <$] 

instead  of  m\  (f),  and  the  output  of  the  lower  receiver  branch  is 

m\ (f)  sin  [(Ao;)f  +  8]  +  W2(f)  cos  [(A co)t  4-  8] 


instead  of  fW2(0- 

4.4- 2  A  modulating  signal  m(f)  is  given  by: 

(a)  m(t)  =  cos  1007rf  4-  2  cos  3007rf 

(b)  m(t)  =  sin  1007rf  sin  5007rf 

In  each  case: 

(i)  Sketch  the  spectrum  of  m(f). 

(ii)  Find  and  sketch  the  spectrum  of  the  DSB-SC  signal  2 m(t)  cos  lOOO^f. 

(iii)  From  the  spectrum  obtained  in  part  (ii),  suppress  the  LSB  spectrum  to  obtain  the  USB 
spectrum. 

(iv)  Knowing  the  USB  spectrum  in  part  (ii),  write  the  expression  ^USB(f)  for  the  USB  signal. 

(v)  Repeat  parts  (iii)  and  (iv)  to  obtain  the  LSB  signal  ^LSB  (f). 

4.4- 3  For  the  signals  in  Prob.  4.4-2  and  a  carrier  frequency  (oc  of  1000  7r,  use  Eq.  (4.20)  to  determine 

the  time  domain  expressions  ^LSB  (f)  and  ^USB  (f). 

Hint:  If  m(t)  is  a  sinusoid,  its  Hilbert  transform  m^t)  is  the  sinusoid  m(f)  phase-delayed  by 
71  / 2  rad. 

4.4- 4  Find  <pLSB(f)  and  <pUSB(f )  for  the  modulating  signal  m(t)  =  7r£sinc2  (2nBt)  with  B  =  2000  Hz 

and  carrier  frequency  fc  =  10,000  Hz.  Follow  these  steps: 

(a)  Sketch  spectra  of  m(f)  and  the  corresponding  DSB-SC  signal  2m(t)  cos  coct. 

(b)  To  find  the  LSB  spectrum,  suppress  the  USB  in  the  DSB-SC  spectrum  found  in  part  (a). 

(c)  Find  the  LSB  signal  <pLSBM’  which  is  the  inverse  Fourier  transform  of  the  LSB  spectrum 
found  in  part  (b).  Follow  a  similar  procedure  to  find  <pUSB  (f ). 

4.4- 5  Given  that  mh(t)  is  the  Hilbert  transform  of  m(f),  do  the  following. 

(a)  Show  that  the  Hilbert  transform  of  mh(t )  is  -m(t). 

(b)  Show  also  that  the  energies  of  m(f)  and  mh(t )  are  identical. 

4.4- 6  An  LSB  signal  is  demodulated  coherently.  Unfortunately,  because  of  the  transmission  delay,  the 

received  signal  carrier  is  not  2  cos  ioct  as  sent;  rather,  it  is  2  cos  [(coc  4-  A w)t  4-  8].  The  local 
oscillator  is  still  cos  (oct.  Show  the  following. 

(a)  When  8  =  0,  the  output  y(t)  is  the  signal  m(f)  with  all  its  spectral  components  shifted  (offset) 
by  A co. 
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Figure  P.4.5-1 


Hint:  Observe  that  the  output  y(t)  is  identical  to  the  right-hand  side  of  Eq.  (4.20a)  with  coc 
replaced  with  A co. 

(b)  When  Aco  =  0,  the  output  is  the  signal  m(t )  with  phases  of  all  its  spectral  components  shifted 
by  8. 

Hint:  Show  that  the  output  spectrum  Y(f)  =  M(f)eis  for  f  >  0,  and  equal  to  M{f)e~^ 
when  /  <  0. 

(c)  In  each  of  these  cases,  explain  the  nature  of  distortion. 

Hint:  For  part  (a),  demodulation  consists  of  shifting  an  LSB  spectrum  to  the  left  and  right 
by  (oc  4-  A  a)  and  low-pass-filtering  the  result.  For  part  (b),  use  the  expression  (4.20b)  for 
^lsb^’  multiply  it  by  the  local  carrier  2  cos  (coct  -I-  6),  and  low-pass-filter  the  result. 

4.4- 7  A  USB  signal  is  generated  by  using  the  phase  shift  method  (Fig.  4.17).  If  the  input  to  this  system 

is  ntfj(t)  instead  of  m{t ),  what  will  be  the  output?  Is  this  signal  still  an  SSB  signal  with  bandwidth 
equal  to  that  of  m(t)l  Can  this  signal  be  demodulated  [to  get  back  m(t)]l  If  so,  how? 

4.5- 1  A  vestigial  filter  Hj(f)  shown  in  the  transmitter  of  Fig.  4.21  has  a  transfer  function  as  shown  in 

Fig.  P4.5-1.  The  earner  frequency  is^  =  10  kHz  and  the  baseband  signal  bandwidth  is  4  kHz. 
Find  the  corresponding  transfer  function  of  the  equalizer  filter  Hn(f)  shown  in  the  receiver  of 
Fig.  4.21. 

Hint:  Use  Eq.  (4.25). 


4.5-3 


width  4  kHz  The  I  f  l,°n  SyStem‘  The  baseband  **nal  »  an  audio  signal  of  band- 

h7)  has  a  svmme^  r  qUenCy  “  ^  SlWose  that  the  transmission  vestigial  filter 

' V '  hdS  d  (Symmetnc)  frequency  response  as  shown  in  Figure  P4.5-2. 

(a)  Design  and  illustrate  a  receiver  system  block  diagram. 

<b)  Find  the  bandwidth  of  this  transmission. 

Describe  and  sketch  the  necessary  equalizer  filler  response  for  distortionless  reception 

T  Wi'h  bM<iW“  400  kH-  a“8"ed 

achieved  h,  a.  ideal  BPF  bandpass  SUe,  HrV“  l.'^“-"”.dlaerM’  °f  Fig“re  P4'5'3' “ 
and adetailed iS^s^SbtoS“g^18'  W  5-3  *P«ifyine  H*  carrier  Ireqocncy 


Figure  P.4.5-2 


Figure  P.4.5-3 
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Figure  P.4.5-2 


Figure  P.4.5-3 


r(t) 


2  cos  (wct) 


Ban< 

dpass  filte 

r 

/ 

0  2.3  MHz  2.8  MHz 

f,  MHz 


(b)  For  distortionless  detection,  derive  and  plot  the  receiver  filter  frequency  response  needed  at 
the  front  end  of  the  demodulator. 


4.8-1  A  second-order  PLL  is  implemented  with  a  nonideal  loop  filter 


H(s)  = 


s  +  a 
s  +  b 


in  which  b  >  0  is  very  small. 

(a)  Applying  small-signal  analysis,  determine  the  transfer  function  between  Se(s)  and  0/(5). 

(b)  Find  the  steady  state  PLL  phase  error  for  an  incoming  phase 


0/(0  =  (<*>o  ~coc)t  +  (po 


r  ANGIE  MODULATION  AND 
J>  DEMODULATION 


As  discussed  in  the  previous  chapter,  a  carrier  modulation  can  be  achieved  by  modu¬ 
lating  the  amplitude,  frequency,  and  phase  of  a  sinusoidal  carrier  of  frequency  fc- 
In  that  chapter,  we  focused  on  various  linear  amplitude  modulation  systems  and  their 
demodulations.  Now  we  discuss  nonlinear  frequency  modulation  (FM)  and  phase  modulation 
(PM),  often  collectively  known  as  angle  modulation. 


5.1  NONLINEAR  MODULATION 

In  AM  signals,  the  amplitude  of  a  carrier  is  modulated  by  a  signal  «(/),  and,  hence,  th 
information  content  of  m(t)  is  in  the  amplitude  variations  of  the  carrier.  As  we  have  seer 
the  other  two  parameters  of  the  carrier  sinusoid,  namely  its  frequency  and  phase,  can  als 
C  Va.ne  m  P™P°rt,on  to  *^c  message  signal  as  frequency-modulated  and  phase-modulatei 
modulation^PM)6  ^  C  n°W  descr'be  tbe  essence  of  frequency  modulation  (FM)  and  phas 

False  Start 

to  mduL2ti  inJtS  infancy-  H°wever’ there  was  a"  acti  ve  search  for  technique 

width  (sideband  eftrir,  '  '.  f  ^ is  P^ional  to  the  modulated  signal  band 
bandwidth  M  '  '  s  Wcre  ocused  on  finding  a  modulation  scheme  that  would  reduce  th 

a  ““  rcdUaion  als»  users,  and  .here  wer 

bandwidth!).  The  idea  of  ml"atln8  sidebands  <"°  Mdeband'|  ” 

varied  in  proportion  m  th,.  ,  '  t,on  where  the  carrier  frequency  would  b 

co(t)  would  be  varied  with  ^  CaiTier  angU'ar  freqUenL' 

If  the  peak  amplitude  of  m(t)  is  then  the  ^  WherC  *  'S  arbitrary  C°nStan' 
frequency  would  be  coc  +  km  and',,  _  z  dXimum  and  minimum  values  of  the  came 

would  remain  within  this  band  with  a' bandwidthT011"6^'  the  SpeCtra‘  comp0nent 

that  controlling  the  constant  parameter  L  -  ”!P  centered  at  Mc-  The  understanding  wa 

this  is  true,  there  was  also  the  hope  that  bv"  C°ntr°  ***  modulated  si8nai  bandwidth.  Whil< 
information  bandwidth  arbitrarily  small  This  Sma"  *’  WC  C°U'd  makC  ^ 

cation  heaven.  Unfortunatelv  exnerime  t  i  .  ability  was  seen  as  a  passport  to  communi 

seriously  wrong.  The  FM  bandwidth,  ukturned*  that  the  underlyinS  reasoning  wa 

2j2  ou  ’ IS  always  greater  than  (at  best  equal  to 
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the  AM  bandwidth.  In  some  cases,  its  bandwidth  was  several  times  that  of  AM.  Where  was 
the  fallacy  in  the  original  reasoning?  We  shall  soon  find  out. 

The  Concept  of  Instantaneous  Frequency 

While  AM  signals  carry  a  message  with  their  varying  amplitude,  FM  signals  can  vary  the 
instantaneous  frequency  in  proportion  to  the  modulating  signal  m(t).  This  means  that  the 
carrier  frequency  is  changing  continuously  every  instant.  Prima  facie,  this  does  not  make 
much  sense,  since  to  define  a  frequency,  we  must  have  a  sinusoidal  signal  at  least  over  one 
cycle  (or  a  half-cycle  or  a  quarter-cycle)  with  the  same  frequency.  This  problem  reminds  us 
of  our  first  encounter  with  the  concept  of  instantaneous  velocity  in  a  beginning  mechanics 
course.  Until  the  presentation  of  derivatives  via  Leibniz  and  Newton,  we  were  used  to  thinking 
of  velocity  as  being  constant  over  an  interval,  and  we  were  incapable  of  even  imagining  that 
velocity  could  vary  at  each  instant.  We  never  forget,  however,  the  wonder  and  amazement  that 
was  caused  by  the  contemplation  of  derivative  and  instantaneous  velocity  when  these  concepts 
were  first  introduced.  A  similar  experience  awaits  the  reader  with  respect  to  instantaneous 
frequency. 

Let  us  consider  a  generalized  sinusoidal  signal  (pit)  given  by 

(p(t)  =  A  cos  Oit)  (5.1) 

where  Oit)  is  the  generalized  angle  and  is  a  function  of  t.  Figure  5.1  shows  a  hypothetical 
case  of  0(t).  The  generalized  angle  for  a  conventional  sinusoid  A  cos  (coct  +  Oo)  is  a  straight 
line  coct  +  Oo ,  as  shown  in  Fig.  5.1.  A  hypothetical  case  general  angle  of  0(t)  happens  to  be 
tangential  to  the  angle  (coct  +  Oo)  at  some  instant  t.  The  crucial  point  is  that,  around  r,  over 
a  small  interval  At  -*  0,  the  signal  (pit)  =  A  cos  Git)  and  the  sinusoid  A  cos  icuct  +  Oo)  are 
identical;  that  is, 


(pit)  =  A  cos  ia)ct  +  $o)  t\  <  t  <  t2 

We  are  certainly  justified  in  saying  that  over  this  small  interval  A/,  the  angular  frequency  of 
(pit)  is  coc.  Because  icoct  +  Oo)  is  tangential  to  Oit ),  the  angular  frequency  of  (pit)  is  the  slope 
of  its  angle  Oit)  over  this  small  interval.  We  can  generalize  this  concept  at  every  instant  and 
define  that  the  instantaneous  frequency  coj  at  any  instant  t  is  the  slope  of  Oit)  at  t.  Thus,  for  (pit) 
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Now  we  can  see  the  possibility  of  transmitting  the  information  of  m(t)  by  varying  the  angle  0  of 
a  carrier.  Such  techniques  of  modulation,  where  the  angle  of  the  carrier  is  varied  in  some  manner 
with  a  modulating  signal  m(f),  are  known  as  angle  modulation  or  exponential  modulation. 
Two  simple  possibilities  are  phase  modulation  (PM)  and  frequency  modulation  (FM).  In 
PM,  the  angle  6(t)  is  varied  linearly  with  m(t): 

0{t)  =  coct  +  0o  +  kpm(t) 

where  kp  is  a  constant  and  coc  is  the  carrier  frequency.  Assuming  Go  =  0,  without  loss  of 
generality, 


in  Eq.  (5.1),  the  instantaneous  angular  frequency  and  the  generalized  angle  are  related  via 

(5.2a) 


dO 

mm  -  j, 


m 


= r , 

J- 00 


(a)  da 


(5.2b) 


9(t)  =  a)ct  +  kpm(t ) 


(5.3a) 


The  resulting  PM  wave  is 


9m  (/)  =  A  cos  [togt  +  kpm(t)]  (5.3b) 

The  instantaneous  angular  frequency  a>,(/)  in  this  case  is  given  by 


de 


"<(')  =  -^=(vc  +  kpm(t) 


(5.3c) 


Hence  m  PM,  the  instantaneous  angular  frequency  w,  varies  linearly  with  the  derivative  of 
e  modulating  signaUf  the  instantaneous  frequency  w,  is  varied  linearly  with  the  modulating 
signal,  we  have  FM.  Thus,  in  FM  the  instantaneous  angular  frequency  is 


<w,(f)  =  o)c  +  kfm(t) 

where  kf  is  a  constant.  The  angle  0(f)  is  now 

m 


(5.4a) 


=  /  [a>f  +  kfm(a)]da 

J-00 

=  a>ct  +  kf  f  m(a)da 
d  —  00 

Here  w.  have  assumed  ,he  consram  term  in  «(,)  «o  b,  wilhou,  |oss  of  generalily.  FM 


Vm (')  -  A  cos  ^r  + 1,  j'  „(o)  del 


(5.5) 
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Relationship  between  FM  and  PM 

From  Eqs.  (5.3b)  and  (5.5),  it  is  apparent  that  PM  and  FM  not  only  are  very  similar  but  are 
inseparable.  Replacing  m(t)  in  Eq.  (5.3b)  with  / m(a)dot  changes  PM  into  FM.  Thus,  a  signal 
that  is  an  FM  wave  corresponding  to  m(t)  is  also  the  PM  wave  corresponding  to  fm(ce)da 
(Fig.  5.2a).  Similarly,  a  PM  wave  corresponding  to  m(t)  is  the  FM  wave  corresponding  to 
m(t)  (Fig.  5.2b).  Therefore,  by  looking  only  at  an  angle-modulated  signal  cp(t ),  there  is  no  way 
of  telling  whether  it  is  FM  or  PM.  In  fact,  it  is  meaningless  to  ask  an  angle-modulated  wave 
whether  it  is  FM  or  PM.  It  is  analogous  to  asking  a  married  man  with  children  whether  he  is 
a  father  or  a  son.  This  discussion  and  Fig.  5.2  also  show  that  we  need  not  separately  discuss 
methods  of  generation  and  demodulation  of  each  type  of  modulation. 

Equations  (5.3b)  and  (5.5)  show  that  in  both  PM  and  FM  the  angle  of  a  carrier  is  varied 
in  proportion  to  some  measure  of  m(t).  In  PM,  it  is  directly  proportional  to  m(t ),  whereas  in 
FM,  it  is  proportional  to  the  integral  of  m(t).  As  shown  in  Fig.  5.2b,  a  frequency  modulator 
can  be  directly  used  to  generate  an  FM  signal  or  the  message  input  m(t)  can  be  processed 
by  a  filter  (differentiator)  with  transfer  function  H(s )  =  s  to  generate  PM  signals.  But  why 
should  we  limit  ourselves  to  these  cases?  We  have  an  infinite  number  of  possible  ways  of 
processing  m(t)  before  FM.  If  we  restrict  the  choice  to  a  linear  operator,  then  a  measure  of 
m(t)  can  be  obtained  as  the  output  of  an  invertible  linear  (time-invariant)  system  with  transfer 
function  H  ( s )  or  impulse  response  h(t).  The  generalized  angle-modulated  carrier  v?em  (0  can  be 
expressed  as 


<pEM  (0  =  A  cos  +  \l/(t)]  (5.6a) 

=  Acos^a)ct  +  f  m(a)  h(t  -  a)  dot  j  (5.6b) 

As  long  as  H(s)  is  a  reversible  operation  (or  invertible),  m(t)  can  be  recovered 
from  x//(t)  by  passing  it  through  a  system  with  transfer  function  [//($)]“ 1  as  shown  in 
Fig.  5.3.  Now  PM  and  FM  are  just  two  special  cases  with  h(t)=kp8(t)  and  h(t)  =  kfu(t)y 
respectively. 

This  shows  that  if  we  analyze  one  type  of  angle  modulation  (such  as  FM),  we  can  readily 
extend  those  results  to  any  other  kind.  Historically,  the  angle  modulation  concept  began  with 
FM,  and  here  in  this  chapter  we  shall  primarily  analyze  FM,  with  occasional  discussion  of 
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PM.  But  this  does  not  mean  that  FM  is  superior  to  other  kinds  of  angle  modulation.  On  the 
contrary,  tor  most  practical  signals,  PM  is  superior  to  FM.  Actually,  the  optimum  performance 
is  realized  neither  by  pure  PM  nor  by  pure  FM,  but  by  something  in  between. 

Power  of  an  Angle-Modulated  Wave 

Although  the  instantaneous  frequency  and  phase  of  an  angle-modulated  wave  can  vary  with 
time,  the  amplitude  A  remains  constant.  Hence,  the  power  of  an  angle-modulated  wave 
(PM  or  FM)  is  always  A2/ 2,  regardless  of  the  value  of  kp  or  kf. 


Example  5. 1  Sketch  FM  and  PM  waves  for  the  modulating  signal  m(t)  shown  in  Fig.  5.4a.  The  constants 

7  an  p  are  x  10  and  10;r,  respectively,  and  the  carrier  frequency  fc  is  100  MHz. 


Figure  5.4 

FM  and  PM 
waveforms. 


Example  5.2 


For  FM: 


—  coc  4-  kfm(t) 
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Dividing  throughout  by  2tt,  we  have  the  equation  in  terms  of  the  variable  f  (frequency  in 
hertz).  The  instantaneous  frequency f  is 


fi=fc  +  ^Lm(t) 


=  108  4-  105m(/) 

{fi) min  =  108  +  105  [m(r)]mi„  =  99.9  MHz 
(fi)  max  =  108  +  105[m(0]max  =  100.1  MHz 


'max 


Because  m(t)  increases  and  decreases  linearly  with  time,  the  instantaneous  frequency 
increases  linearly  from  99.9  to  100. 1  MHz  over  a  half-cycle  and  decreases  linearly  from 
100.1  to  99.9  MHz  over  the  remaining  half-cycle  of  the  modulating  signal  (Fig.  5.4b). 

PM  for  m{t)  is  FM  for  m(t).  This  also  follows  from  Eq.  (5.3c). 

For  PM: 


=  108  +5  m{t) 

{fi) min  =  108  +  5  [«(0]nin  =  108  -  10s  =  99.9  MHz 
tfW  =  108  +  5[w(r)]max  =  100.1  MHz 


Because  m(t)  switches  back  and  forth  from  a  value  of  —20,000  to  20,000,  the  carrier 
frequency  switches  back  and  forth  from  99.9  to  100.1  MHz  every  half-cycle  of  m(t)y  as 
shown  in  Fig.  5.4d. 


This  indirect  method  of  sketching  PM  [using  m(t)  to  frequency-modulate  a  carrier]  works 
as  long  as  m(t)  is  a  continuous  signal.  If  m{t)  is  discontinuous,  it  means  that  the  PM  sig¬ 
nal  has  sudden  phase  changes  and,  hence,  m(t)  contains  impulses.  This  indirect  method 
fails  at  points  of  the  discontinuity.  In  such  a  case,  a  direct  approach  should  be  used  at  the 
point  of  discontinuity  to  specify  the  sudden  phase  changes.  This  is  demonstrated  in  the  next 
example. 


Sample  5.2  Sketch  FM  and  PM  waves  for  the  digital  modulating  signal  m(t)  shown  in  Fig.  5.5a.  The 
constants  kf  and  kp  are  2n  x  105  and  n/ 2,  respectively,  and/c  =  100  MHz. 


For  FM: 


Because  m(t)  switches  from  1  to  -1  and  vice  versa,  the  FM  wave  frequency  switches 
back  and  forth  between  99.9  and  1 00. 1  MHz,  as  shown  in  Fig.  5.5b.  This  scheme  of  carrier 
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Figure  5.5 

FM  and  PM 
waveforms. 


(b) 


(d) 


frequency  modulation  by  a  digital  signal  (Fig.  5.5b)  is  called  frequency  shift  keying 
(FSK)  because  information  digits  are  transmitted  by  keying  different  frequencies  (see 
Sec.  7.8). 

For  PM: 


fi=fc  +  ~m(t)  =  108  +  \m{t) 

Zn  4 

The  derivative  m(t )  (Fig.  5.5c)  is  zero  except  at  points  of  discontinuity  of  m(t)  where 
impulses  of  strength  ±2  are  present.  This  means  that  the  frequency  of  the  PM  signal  stays 
the  same  except  at  these  isolated  points  of  time!  It  is  not  immediately  apparent  how  an 
instantaneous  frequency  can  be  changed  by  an  infinite  amount  and  then  changed  back  to 
the  original  frequency  in  zero  time.  Let  us  consider  the  direct  approach: 


<Ppm(')  =  A  cos  [coct  -f  kpm(t)] 

=  A  cos  [wc/  +  ^m(t) j 

_  |  A  sin  (oct  when  m(t)  =  —  1 
|  —A  sin  coct  when  m(t)  —  1 


i  ms  m  wave  is  shown  in  Fig.  5.54  This  scheme  of  carrier  PM  by  a  digital  signal  is 
CJ  phase  shl  t  kevmS  (pSK)  because  information  digits  are  transmitted  by  shift- 
mg  the  carrier  phase.  Note  that  PSK  may  also  be  viewed  as  a  DSB-SC  modulation 
by  m(t). 

Tht  PM  wa\e^)pM(/)  in  this  case  has  phase  discontinuities  at  instants  where  impulses 
nhlsl  lS,  ,nStantS’ thC  Canier  phase  shifts  by  *  instantaneously.  A  finite 

aurees  with  n  .impbes  ‘nbn‘te  instantaneous  frequency  at  these  instants.  This 

agrees  with  our  observation  about  m(t).  J 

i,  1  If  °f  l*T  dl'CrmlmlJ"-v  in  mi  (»  at  the  instant  where  minis  discontinuous 

“i:  tirvzzr  of  ■.«  a, imaa.  ■„  .he  r*** 

the  otose  dL Z  I  f  "  g“  by  2  <fr°m  the  discontinuity.  Hence. 

:fh>,  rlt,  y  WM(,)  ,s  ***  -  W2)  X  2  =  rr  rad,  which  confirm*  on. 
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When  m(t)  is  a  digital  signal  (as  in  Fig.  5.5a),  ^pm(0  shows  a  phase  discontinuity 
where  m(t)  has  a  jump  discontinuity.  We  shall  now  show  that  to  avoid  ambiguity  in 
demodulation,  in  such  a  case,  the  phase  deviation  kpm{t)  must  be  restricted  to  a  range 
(—n,  tt).  For  example,  if  kp  were  3jt/2  in  the  present  example,  then 


In  this  case  —  A  sin  coct  when  m(t)  —  1  or  —1/3.  This  will  certainly  cause 

ambiguity  at  the  receiver  when  A  sin  coct  is  received.  Specifically,  the  receiver  cannot 
decide  the  exact  value  of  m(t).  Such  ambiguity  never  arises  if  kpm(t)  is  restricted  to  the 
range  ( -n ,  n). 


What  causes  this  ambiguity?  When  m(t )  has  jump  discontinuities,  the  phase  of  <pVM(t) 
changes  instantaneously.  Because  a  phase  <p()  +  2nn  is  indistinguishable  from  the  phase  <pa , 
ambiguities  will  be  inherent  in  the  demodulator  unless  the  phase  variations  are  limited  to  the 
range  (- n ,  tt).  This  means  kp  should  be  small  enough  to  restrict  the  phase  change  kpm(t)  to 
the  range  (— jt,  tt). 

No  such  restriction  on  kp  is  required  if  m(t)  is  continuous.  In  this  case  the  phase  change  is 
not  instantaneous,  but  gradual  over  time,  and  a  phase  <p„  +  2mx  will  exhibit  n  additional  carrier 
cycles  in  the  case  of  phase  of  only  (p„.  We  can  detect  the  PM  wave  by  using  an  FM  demodulator 
followed  by  an  integrator  (see  Prob.  5.4-1).  The  additional  n  cycles  will  be  detected  by  the 
FM  demodulator,  and  the  subsequent  integration  will  yield  a  phase  2 rnr.  Hence,  the  phases  <pa 
and  (p0  -f  2rm  can  be  detected  without  ambiguity.  This  conclusion  can  also  be  verified  from 
Example  5.1,  where  the  maximum  phase  change  A <p  =  IOjt. 

Because  a  band-limited  signal  cannot  have  jump  discontinuities,  we  can  also  say  that  when 
m(t)  is  band-limited,  kp  has  no  restrictions. 


5.2  BANDWIDTH  OF  ANGLE-MODULATED  WAVES 


Unlike  AM,  angle  modulation  is  nonlinear  and  no  properties  of  Fourier  transform  can  be 
directly  applied  for  its  bandwidth  analysis.  To  determine  the  bandwidth  of  an  FM  wave,  let  us 
define 


(5.7) 


and  define 


^(r)  =  A  ei&'t+kfaO)]  = 


(5.8a) 


such  that  its  relationship  to  the  FM  signal  is 

?FM(')  =  Re[vw(r)] 

Expanding  the  exponential  e>kfaU)  of  Eq.  (5.8a)  in  power  series  yields 


(5.8b) 


<prM(t)=A  1  +jk/a(t)  —  —u  (r)  -I - \-j—an(t)-\ -  ei<Uct  (5.9a) 
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and 


=  A 


k1  k 3 

cos  (oct  —  kfa(t)  sin  a)ct  —  -^a (/)  cos  a)ct  +  —a  ( t )  sin  H - 


(5.9b) 


The  modulated  wave  consists  of  an  unmodulated  carrier  plus  various  amplitude-modulated 
terms,  such  as  a(t)  sin  a>ct ,  cr(t)  cos  a)ct,  a3(/)sin  cucf, —  The  signal  a(f)  is  an  integral 
of  m(t).  If  M  (f)  is  band-limited  to  B ,  A(f)  is  also  band-limited*  to  B.  The  spectrum  of 
a2(t)  is  simply  A(f)  *  A(f)  and  is  band-limited  to  2 B.  Similarly,  the  spectrum  of  an(t)  is 
band-limited  to  nB.  Hence,  the  spectrum  consists  of  an  unmodulated  carrier  plus  spectra  of 
a(t),  r/2 (f),  ...»  an(t),  . . . ,  centered  at  coc.  Clearly,  the  modulated  wave  is  not  band-limited. 
It  has  an  infinite  bandwidth  and  is  not  related  to  the  modulating-signal  spectrum  in  any  simple 
way,  as  was  the  case  in  AM. 

Although  the  bandwidth  of  an  FM  wave  is  theoretically  infinite,  for  practical  signals  with 
bounded  \kfa(t)\  will  remain  finite.  Because  n\  increases  much  faster  than  \kfd(t)\n,  we 
have 


k"an(t) 

- : —  —  0  for  large  n 

n\ 

Hence,  we  shall  see  that  most  of  the  modulated-signal  power  resides  in  a  finite  bandwidth. 
This  is  the  principal  foundation  of  the  bandwidth  analysis  for  angle-modulations.  There  are 
two  distinct  possibilities  in  terms  of  bandwidths— narrowband  FM  and  wideband  FM. 

Narrowband  Angle  Modulation  Approximation 

Unlike  AM,  angle  modulations  are  nonlinear.  The  nonlinear  relationship  between  a(t)  and  <p{t) 
is  evident  from  the  terms  involving  a'\t)  in  Eq.  (5.9b).  When  kf  is  very  small  such  that 

\k/a(t)\  «  1 

then  all  higher  order  terms  in  Eq.  (5.9b)  are  negligible  except  for  the  first  two.  We  then  have 
a  good  approximation 
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^fm(0  ^  ^  [cos  o)ct  -  kfa(t)  sin  coct\  (5.10) 

This  approximation  is  a  linear  modulation  that  has  an  expression  similar  to  that  of  the  AM 
signal  with  message  signal  a(t).  Because  the  bandwidth  of  a(t)  is  B  Hz,  the  bandwidth  of 
™'!  Eqc(5  !0)  1S  28  Hz  accord'ng  to  the  frequency-shifting  property  due  to  the  term 

FM  tNBFM^  ‘  M  ?T  FM  Si8nal  f°r  the  Case  of  M(')l  «  I  is  called  narrowband 

FM  (NBFM).  Similarly,  the  narrowband  PM  (NBPM)  signal  is  approximated  by 

^pm(')  %  A  [cos  coct  -  kpm(t)  sin  (oct ]  (5. ID 

NBPM  also  has  the  approximate  bandwidth  of  2 B. 


*  This  is  because 
Hence,  if  M{f)  is 


band-limited  to be  band-UmS'to  7^'  ‘hr°USh  ‘  fUnCti°n 
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A  comparison  of  NBFM  [Eq.  (5. 1 0)]  with  AM  [Eq.  (5.9a)]  brings  out  clearly  the  similarities 
and  differences  between  the  two  types  of  modulation.  Both  have  the  same  modulated  bandwidth 
IB.  The  sideband  spectrum  for  FM  has  a  phase  shift  of  jt/2  with  respect  to  the  carrier,  whereas 
that  of  AM  is  in  phase  with  the  carrier.  It  must  be  remembered,  however,  that  despite  the 
apparent  similarities,  the  AM  and  FM  signals  have  very  different  waveforms.  In  an  AM  signal, 
the  oscillation  frequency  is  constant  and  the  amplitude  varies  with  time,  whereas  in  an  FM 
signal,  the  amplitude  stays  constant  and  the  frequency  varies  with  time. 


Wideband  FM  (WBFM)  Bandwidth  Analysis:  The  Fallacy  Exposed 

Note  that  an  FM  signal  is  meaningful  only  if  its  frequency  deviation  is  large  enough.  In  other 
words,  practical  FM  chooses  the  constant  kf  large  enough  that  the  condition  \kfa(t)\  «.  1 
is  not  satisfied.  We  call  FM  signals  in  such  cases  wideband  FM  (WBFM).  Thus,  in  ana¬ 
lyzing  the  bandwidth  of  WBFM,  we  cannot  ignore  all  the  higher  order  terms  in  Eq.  (5.9b). 
To  begin,  we  shall  take  here  the  route  of  the  pioneers,  who  by  their  intuitively  simple  rea¬ 
soning  came  to  grief  in  estimating  the  FM  bandwidth.  If  we  could  discover  the  fallacy  in 
their  reasoning,  we  would  have  a  chance  of  obtaining  a  better  estimate  of  the  (wideband)  FM 
bandwidth. 

Consider  a  low-pass  m(t)  with  bandwidth  B  Hz.  This  signal  is  well  approximated  by  a 
staircase  signal  w(r),  as  shown  in  Fig.  5.6a.  The  signal  m(t)  is  now  approximated  by  pulses  of 
constant  amplitude.  For  convenience,  each  of  these  pulses  will  be  called  a  “cell.”  To  ensure 


(c) 
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that  m(t)  has  all  the  information  of  m(t),  the  cell  width  in  m(t)  must  be  no  greater  than  the 
Nyquist  interval  of  1  /2B  second  according  to  the  sampling  theorem  (Chapter  6). 

It  is  relatively  easier  to  analyze  FM  corresponding  to  m(t)  because  its  constant  amplitude 
pulses  (cells)  of  width  T  =  1/2 B  second.  Consider  a  typical  cell  starting  at  t  =  t^.  This  cell 
has  a  constant  amplitude  m(f*).  Hence,  the  FM  signal  corresponding  to  this  cell  is  a  sinusoid  of 
frequency  a>c  +  ik/m(/*)  and  duration  T  =  1  /2 B,  as  shown  in  Fig.  5.6b.  The  FM  signal  for  m(t) 
consists  of  a  sequence  of  such  constant  frequency  sinusoidal  pulses  of  duration  T  =  1  /2 B 
corresponding  to  various  cells  of  rh(t).  The  FM  spectrum  for  m(t)  consists  of  the  sum  of 
the  Fourier  transforms  of  these  sinusoidal  pulses  corresponding  to  all  the  cells.  The  Fourier 
transform  of  a  sinusoidal  pulse  in  Fig.  5.6b  (corresponding  to  the  &th  cell)  is  a  sine  function 
shown  shaded  in  Fig.  5.6c  (see  Example  3.12,  Fig.  3.22d  with  T  =  1/2 B). 


(i>  —  coc  -  kfm(tk) 


ol>  +  (dc  +  kfm(tk) 
4 B 


Note  that  the  spectrum  of  this  pulse  is  spread  out  on  either  side  of  its  center  frequency  o)c  + 
k/m(tk )  by  4  n  B  as  the  main  lobe  of  the  sine  function.  Figure  5.6c  shows  the  spectra  of  sinusoidal 
pulses  corresponding  to  various  cells.  The  minimum  and  the  maximum  amplitudes  of  the  cells 
are  —ttip  and  mp ,  respectively.  Hence,  the  minimum  and  maximum  center  frequencies  of 
the  short  sinusoidal  pulses  corresponding  to  the  FM  signal  for  all  the  cells  are  coc  —  kfitip 
and  coc  4-  kftnp,  respectively.  Consider  the  sine  main  lobe  of  these  frequency  responses  as 
significant  contribution  to  the  FM  bandwidth,  as  shown  in  Fig.  5.6c.  Hence,  the  maximum  and 
the  minimum  significant  frequencies  in  this  spectrum  are (Dc+kfmp+A7TB  anda>c-kfmp-47tB, 
respectively.  The  FM  spectrum  bandwidth  is  approximately 


We  can  now  understand  the  fallacy  in  the  reasoning  of  the  pioneers.  The  maximum  and 
minimum  carrier  frequencies  are  a)c+kfmp  and  c oc—kfmp ,  respectively.  Hence,  it  was  reasoned 
t  at  the  spectral  components  must  also  lie  in  this  range,  resulting  in  the  FM  bandwidth  of  2 kfmp. 
The  implicit  assumption  was  that  a  sinusoid  of  frequency  ro  has  its  entire  roncentrated 


A f  =  kf"^x  ~  mmm  _kmr 
2-2*  J~kf2 n 

The  estimated  FM  bandwidth  (in  hertz)  can  then  be  expressed  as 


fflmax  ~  Wmjn 
2  •  2  K 


expressed  as 


^fm  -  2(A/  +  2 B) 


(5.12) 


which  is  considemhlv  u  -  “FP'oximation  ot  m(t),  not  the  actual 

considerably  smoother.  Hence,  ,he  actual  FM  bandwidth  is  somewhat  smaller  llt»« 
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this  value.  Based  on  Fig.  5.6c,  it  is  clear  that  a  better  FM  bandwidth  approximation  is  between 


[2A/,  2A/  +  4B] 


Therefore,  we  should  readjust  our  bandwidth  estimation.  To  make  this  midcourse  correction, 
we  observe  that  for  the  case  of  NBFM,  kf  is  very  small.  Hence,  given  a  fixed  mp.  A/  is 
very  small  (in  comparison  to  B)  for  NBFM.  In  this  case,  we  can  ignore  the  small  A /  term  in 
Eq.  (5.12)  with  the  result 


#fm  %  4 B 


But  we  showed  earlier  that  for  narrowband,  the  FM  bandwidth  is  approximately  2 B  Hz.  This 
indicates  that  a  better  bandwidth  estimate  is 


(5.13) 


This  is  precisely  the  result  obtained  by  Carson,1  who  investigated  this  problem  rigorously 
for  tone  modulation  [sinusoidal  m{t)].  This  formula  goes  under  the  name  Carson’s  rule 
in  the  literature.  Observe  that  for  a  truly  wideband  case,  where  A Eq.  (5.13)  can  be 
approximated  as 


Bm  *  2  A/  A f  »  B 


(5.14) 


Because  A a>  =  kfmp,  this  formula  is  precisely  what  the  pioneers  had  used  for  FM  bandwidth. 
The  only  mistake  was  in  thinking  that  this  formula  will  hold  for  all  cases,  especially  for  the 
narrowband  case,  where  A f  <gc  B. 

We  define  a  deviation  ratio  P  as 


(5.15) 


Carson’s  rule  can  be  expressed  in  terms  of  the  deviation  ratio  as 


Bfm  =25(0+  1) 


(5.16) 


The  deviation  ratio  controls  the  amount  of  modulation  and,  consequently,  plays  a  role 
similar  to  the  modulation  index  in  AM.  Indeed,  for  the  special  case  of  tone-modulated  FM,  the 
deviation  ratio  p  is  called  the  modulation  index. 

Phase  Modulation 

All  the  results  derived  for  FM  can  be  directly  applied  to  PM.  Thus,  for  PM,  the  instantaneous 
frequency  is  given  by 


ojj  =  coc  +  kpm(t) 


Therefore,  the  peak  frequency  deviation  A /  is  given  by 

A,  ,  [«(')]  max  [w(f)min] 

4 f  =  kP 


2  ■  2n 


(5.17a) 
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It'  we  assume  that 


nip  —  [^KOlrnax  —  |ftt(f)minl 


(5.17b) 


then 


(5.17c) 


Therefore,* 


Bpm  =  2(A f  +  B) 


(5.18a) 


=  2 


(5.18b) 


One  very  interesting  aspect  of  FM  is  that  Aco  =  kftnp  depends  only  on  the  peak  value  of  rn(t). 
It  is  independent  of  the  spectrum  of  m(t).  On  the  other  hand,  in  PM,  Aco  =  kpmp  depends  on  the 
peak  value  of  m(t).  But  m(t )  depends  strongly  on  the  spectral  composition  of  m(t).  The  presence 
of  higher  trequency  components  in  m(r)  implies  rapid  time  variations,  resulting  in  a  higher 
value  ol  rhp.  Conversely,  predominance  of  lower  frequency  components  will  result  in  a  lower 
value  of  mp.  Hence,  whereas  the  FM  signal  bandwidth  [Eq.  (5.13)]  is  practically  independent 
of  the  spectral  shape  of  m(f),  the  PM  signal  bandwidth  [Eq.  (5. 1 8)]  is  strongly  affected  by  the 
spectral  shape  of  wi (r).  For  rti(t)  with  a  spectrum  concentrated  at  lower  frequencies.  /ip\i  will 
be  smaller  than  when  the  spectrum  of  m(t)  is  concentrated  at  higher  frequencies. 

Spectral  Analysis  of  Tone  Frequency  Modulation 

Foi  an  1  M  earner  with  a  generic  message  signal  the  spectral  analysis  requires  the  use  of 
staircase  signal  approximation.  Tone  modulation  is  a  special  case  for  which  a  precise  spectral 
analysis  is  possible,  that  is,  when  m(t)  is  a  sinusoid.  We  use  this  special  case  to  verify  the  FM 
bandwidth  approximation.  Let 


m(r)  =  a  cos  ojmt 

From  Eq.  (5.7),  with  the  assumption  that  initially  a(-oo)  =  0,  we  have 

(X 

a(0  =  —  sin  co,„t 


Thus,  from  Eq.  (5.8a),  we  have 


VfmU)  =  Ae^a>ct+kfa/to" ’  sin  (omt) 


Moreover 


Aw  =  kfmp  =  akf 


*  Equation  (5. 1 7a)  can  be  applied  only  if 
derivative  does  not  exist.  In  such  a  case. 
tPpM  W  anJ  'hen  determine  Aw  from  v,,M 
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and  the  bandwidth  ofm(t)  is  2 nB  =  com  rad/s.  The  deviation  ratio  (or  in  this  case,  the  modulation 


index)  is 


A/  _  Am  _  akf 
B  2  nB  a>„, 


Hence, 


<pm  (/)  =  Aei{a>c,+Psin  Wmt) 
=  Ae^a>cl(e'^sin  a>ml) 


(5.19) 


Note  that  ^ sm  is  a  periodic  signal  with  period  2n/com  and  can  be  expanded  by  the 
exponential  Fourier  series,  as  usual. 


where 


The  integral  on  the  right-hand  side  cannot  be  evaluated  in  a  closed  form  but  must  be  integrated 
by  expanding  the  integrand  in  infinite  series.  This  integral  has  been  extensively  tabulated  and 
is  denoted  by  the  Bessel  function  of  the  first  kind  and  the  nth  order.  These  functions  are 
plotted  in  Fig.  5.7a  as  a  function  of  n  for  various  values  of  Thus, 


OO 


(5.20) 


Substituting  Eq.  (5.20)  into  Eq.  (5.19),  we  get 


OO 


¥>fmU)  =  A  J2  Jn(P)<J<c°c'+nMm,) 


and 


OO 


The  tone-modulated  FM  signal  has  a  carrier  component  and  an  infinite  number  of  sidebands 
of  frequencies  cdc  ±  com,  coc  ±  2 com, . . . ,  coc±  na)m, . . . ,  as  shown  in  Fig.  5.7b.  This  is  in  stark 
contrast  to  the  DSB-SC  spectrum  of  only  one  sideband  on  either  side  of  the  carrier  frequency. 
The  strength  of  the  /7th  sideband  at  co  =  Q)c+na)m  is*  Jn(/3).  From  the  plots  of  Jn(/3)  in  Fig.  5.7a, 
it  can  be  seen  that  for  a  given  /?,  Jn(fi)  decreases  with  /7,  and  there  are  only  a  finite  number 


*  Also  J-n{p)  =  (—  1  )nJn(P )•  Hence,  the  magnitude  of  the  LSB  at  co  =  coc  -  ncom  is  the  same  as  that  of  the  USB  at 
co  =  coc  -f  ncom. 
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Figure  5.7 
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Example  5.C 


of  significant  sideband  spectral  lines.  It  can  be  seen  from  Fig.  5.7a  that  is  negligible  for 
n  >  f}+  1 .  Hence,  the  number  of  significant  sideband  impulses  is  f)  +  1 .  The  bandwidth  of  the 
FM  carrier  is  given  by 


sfm  =  2(/3  +  1  )fm 
=  2(A f+B) 

which  corroborates  our  previous  result  [Eqs.  (5.13)].  When  fi  «;  1  (NBFM),  there  is  only  one 
significant  sideband  and  the  bandwidth  BFM  =  2/m  =  2 B.  It  is  important  to  note  that  this  tone 
modulation  case  analysis  is  a  verification,  not  a  proof,  of  Carson’s  formula. 

In  the  literature,  tone  modulation  in  FM  is  often  discussed  in  great  detail.  Since,  however, 
angle  modulation  is  a  nonlinear  modulation,  the  results  derived  for  tone  modulation  may 
have  little  connection  to  practical  situations.  Indeed,  these  results  are  meaningless  at  best  and 
misleading  at  worst  when  generalized  to  practical  signals.*  As  authors  and  instructors,  we  feel 
that  too  much  emphasis  on  tone  modulation  can  be  misleading.  For  this  reason  we  have  omitted 
further  such  discussion  here. 

The  method  for  finding  the  spectrum  of  a  tone-modulated  FM  wave  can  be  used  for  finding 
the  spectrum  of  an  FM  wave  when  m(t)  is  a  general  periodic  signal.  In  this  case, 


<Pm(t)  =  Aei<°c,[eikfa^)] 
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Because  a(t)  is  a  periodic  signal,  e>kfaM  is  also  a  periodic  signal,  which  can  be  expressed  as  an 
exponential  Fourier  series  in  the  preceding  expression.  After  this,  it  is  relatively  straightforward 
to  write  (t)  in  terms  of  the  carrier  and  the  sidebands. 


(a)  Estimate  fiFM  and  fiPM  for  the  modulating  signal  m(t)  in  Fig.  5.4a  for  kf  =  In  x  105 
and  kp  5jt.  Assume  the  essential  bandwidth  of  the  periodic  m(t)  as  the  frequency  of 
its  third  harmonic. 

(b)  Repeat  the  problem  if  the  amplitude  of  m(t)  is  doubled  [if  m(t)  is  multiplied  by  2]. 


1(a)  The  peak  amplitude  of  m(t)  is  unity.  Hence,  mp  =  1.  We  now  determine  the 
essential  bandwidth  B  of  m(t).  It  is  left  as  an  exercise  for  the  reader  to  show  that  the 
Fourier  series  for  this  periodic  signal  is  given  by 


n 


where 


0  n  even 


It  can  be  seen  that  the  harmonic  amplitudes  decrease  rapidly  with  n.  The  third  harmonic 
is  only  1 1%  of  the  fundamental,  and  the  fifth  harmonic  is  only  4%  of  the  fundamental. 
This  means  the  third  and  fifth  harmonic  powers  are  1.21  and  0.16%,  respectively,  of 
the  fundamental  component  power.  Hence,  we  are  justified  in  assuming  the  essential 
bandwidth  of  m(t)  as  the  frequency  of  its  third  harmonic,  that  is. 


B  =  3  x  ——  =  15  kHz 
2 


For  FM: 


A f  =  —kftnp  =  —(2 71  x  105)(1)  =  100 
2tt  2n 


and 


Bfu  =  2(A f  +  B)  =  230  kHz 


Alternatively,  the  deviation  ratio  fi  is  given  by 


and 


/100  \ 

£fm  =  2£(£+  1)  =  30  l  —  +  1  1  =  230kHz 
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For  PM:  The  peak  amplitude  of  m(t)  is  20,000  and 


A/  =  — kpmp  =  50  kHz 
2tt 


Hence, 

fipM  =  2(A/  +  S)  =  130  kHz 
Alternately,  the  deviation  ratio  fi  is  given  by 


_  A/  _  50 
~B  ~  15 


and 


Bpm  =  2«(/J+1)  =  30^+  l)  =  130  kHz. 

(b)  Doubling  m(t)  doubles  its  peak  value.  Hence,  mp  =  2.  But  its  bandwidth  is  unchanged 
so  that  B  =  15  kHz. 

For  FM: 

A/  =  2. kfmp  =  2-(2tt  x  105)(2)  =  200  kHz 

171  2  7T 

and 


flFM  =  2(A/  +  fl)  =  430  kHz 
Alternately,  the  deviation  ratio  ft  is  given  by 


a/;  =  200 

P  B  15 


and 


Bm  =  2B(p+\)  =  30^~  +  \^j  =  430kHz 


For  PM:  Doubling  m(f)  doubles  its  derivative  so  that  now  mp  =  40,000,  and 

A f  =  ~ kpmp  =  100  kHz 
and 


fipM  =  2(A/  +  fi)  =  230kHz 
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Alternately,  the  deviation  ratio  ft  is  given  by 

a/;=  100 

p  B  15 
and 

flpM  =  2B(f)  +  1)  =  30  +  l  j  =  230kHz 

Observe  that  doubling  the  signal  amplitude  [doubling  m(t )]  roughly  doubles  frequency 
deviation  A f  of  both  FM  and  PM  waveforms. 


Example  5.4  Repeat  Example  5.1  if  m{t)  is  time-expanded  by  a  factor  of  2:  that  is,  if  the  period  of  m(t)  is 
4  x  10-4. 

Recall  that  expansion  of  a  signal  by  a  factor  of  2  reduces  the  signal  spectral  width 
(bandwidth)  a  factor  of  2.  We  can  verify  this  by  observing  that  the  fundamental  fre¬ 
quency  is  2.5  kHz,  and  its  third  harmonic  is  7.5  kHz.  Hence,  B  =  7.5  kHz,  which  is 
half  the  previous  bandwidth.  Moreover,  time  expansion  does  not  affect  the  peak  amplitude 
and  thus  tnp  =  1 .  However,  mp  is  halved,  that  is,  mp  =  1 0, 000. 

For  FM: 

A f  =  -^-kfmp  =  100  kHz 
2n 

fiFM  =  2(A/  +  B)  =  2(100  +  7.5)  =  215  kHz 

For  PM: 


A f  =  kpinp  =  25  kHz 

2  7T 

flPM  =  2(A f  +  B)  —  65  kHz 

Note  that  time  expansion  of  m(t)  has  very  little  effect  on  the  FM  bandwidth,  but  it  halves  the 
PM  bandwidth.  This  verifies  our  observation  that  the  PM  spectrum  is  strongly  dependent 
on  the  spectrum  of  m(t). 


Sample  5.5 


An  angle-modulated  signal  with  carrier  frequency  coc  = 
<pEM  (t)  =  10  cos  (coct  +  5  sin  3000/ 


2tt  x 

+  10 


105  is  described  by  the  equation 
sin  2000;r/) 


(a)  Find  the  power  of  the  modulated  signal. 

(b)  Find  the  frequency  deviation  A/. 

(c)  Find  the  deviation  ratio  /3. 
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(d)  Find  the  phase  deviation  A </>. 

(e)  Estimate  the  bandwidth  of  <fHM  (/). 


Edwin  H. 

Armstrong. 

(Reproduced 


The  signal  bandwidth  is  the  highest  frequency  in  m(l)  (or  its  derivative).  In  this  case 
B  =  2000jr/27r  =  1000  Hz. 


with  permission 
from  Armstrong 
Family  Archives.) 


(a)  The  carrier  amplitude  is  10,  and  the  power  is 


(b)  To  find  the  frequency  deviation  A/,  we  find  the  instantaneous  frequency  cu„ 
given  by 


d 

(Oi  =  —6>(/)  =  o)c  +  15,000  cos  3000/  +  20,  000tt  cos  2000tt/ 


The  carrier  deviation  is  15,000  cos  3000/  +  20, 0007T  cos  20007r/.The  two  sinusoids  will 
add  in  phase  at  some  point,  and  the  maximum  value  of  this  expression  is  1 5, 000+20, 000 ;r. 
This  is  the  maximum  carrier  deviation  Aw.  Hence, 


A/  =  —  =  12,387.32  Hz 


(d)  The  angle  9c(t)  =  (ot  +  (5  sin  3000/  +  10  sin  2000 jr/).  The  phase  deviation  is 
the  maximum  value  of  the  angle  inside  the  parentheses,  and  is  given  by  A</>  =  15  rad. 

(e)  fiEM  =  2(A/  +  B)  =  26,774.65  Hz 

Observe  the  generality  of  this  method  of  estimating  the  bandwidth  of  an  angle-modulated 
waveform.  We  need  not  know  whether  it  is  FM,  PM,  or  some  other  kind  of  angle 
modulation.  It  is  applicable  to  any  angle-modulated  signal 


A  Historical  Note:  Edwin  H.  Armstrong  (ISQO-io^dt 
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Edwin  H. 
Armstrong. 
(Reproduced 
with  permission 
from  Armstrong 
Family  Archives.) 


“In  fact,  as  more  and  more  schemes  are  analyzed  and  tested,  and  as  the  essential  nature  of 
the  problem  is  more  clearly  perceivable,  we  are  unavoidably  forced  to  the  conclusion  that 
static  (noise),  like  the  poor,  will  always  be  with  us.”2  The  opinion  of  one  of  the  most  able 
mathematicians  of  the  day  in  the  communication  field,  thus,  set  back  the  development  of  FM 
by  more  than  a  decade.  The  noise-suppressing  advantage  of  FM  was  later  proved  by  Major 
Edwin  H.  Armstrong,3  a  brilliant  engineer  whose  contributions  to  the  field  of  radio  systems 
are  comparable  to  those  of  Hertz  and  Marconi.  It  was  largely  the  work  of  Armstrong  that  was 
responsible  for  rekindling  the  interest  in  FM. 

Although  Armstrong  did  not  invent  the  concept,  he  has  been  considered  the  father  of 
modem  FM.  Born  on  December  18,  1890,  in  New  York  City,  Edwin  H.  Armstrong  is  widely 
regarded  as  one  of  the  foremost  contributors  to  radio  electronics  of  the  twentieth  century. 
Armstrong  was  credited  with  the  invention  of  the  regenerative  circuit  (U.S.  Patent  1,1 13,149 
issued  in  1912,  while  he  was  a  junior  at  Columbia  University),  the  superheterodyne  circuit 
(U.S.  Patent  1,342,885  issued  in  1918,  while  serving  in  the  U.S.  Army  stationed  in  Paris,  during 
World  War  I),  the  super-regenerative  circuit  (U.S.  Patent  1,424,065,  issued  in  1922),  and  the 
complete  FM  radio  broadcasting  system  (U.S.  Patent  1,941,066,  1933).  All  are  breakthrough 
contributions  to  the  radio  field.  Fortune  magazine  in  1 939  declared:  Wideband  frequency  mod¬ 
ulation  is  the  fourth,  and  perhaps  the  greatest,  in  a  line  of  Armstrong  inventions  that  have  made 
most  of  modern  broadcasting  what  it  is.  Major  Armstrong  is  the  acknowledged  inventor  of  the 
regenerative  ‘feedback’  circuit,  which  brought  radio  art  out  of  the  crystal-detector  headphone 
stage  and  made  the  amplification  of  broadcasting  possible;  the  superheterodyne  circuit,  which 
is  the  basis  of  practically  all  modem  radio;  and  the  super-regenerative  circuit  now  in  wide  use 
in  . . .  shortwave  systems.4 

Armstrong  was  the  last  of  the  breed  of  the  lone  attic  inventors.  After  receiving  his  FM 
patents  in  1933,  he  gave  his  now  famous  paper  (which  later  appeared  in  print  as  in  the  pro¬ 
ceedings  of  the  IRE3),  accompanied  by  the  first  public  demonstration  of  FM  broadcasting 
on  November  5,  1935,  at  the  New  York  section  meeting  of  the  Institute  of  Radio  Engineers 
(IRE,  a  predecessor  of  the  IEEE).  His  success  in  dramatically  reducing  static  noise  using  FM 
was  not  fully  embraced  by  the  broadcast  establishment,  which  perceived  FM  as  a  threat  to  its 
vast  commercial  investment  in  AM  radio.  To  establish  FM  broadcasting,  Armstrong  fought  a 
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long  and  costly  battle  with  the  radio  broadcast  establishment,  which,  abetted  by  the  Federal 
Communications  Commission  (FCC),  fought  tooth  and  nail  to  resist  FM.  Still,  by  December 
1941 , 67  commercial  FM  stations  had  been  authorized  with  as  many  as  half  a  million  receivers 
in  use  and  43  applications  were  pending.  In  fact,  the  Radio  Technical  Planning  Board  (RTPB) 
made  its  final  recommendation  during  the  September  1944  FCC  hearing  that  FM  be  given  75 
channels  in  the  band  from  41  to  56  MHz. 

Despite  the  recommendation  of  the  RTPB,  which  was  supposed  to  be  the  best  advice 
available  from  the  radio  engineering  community,  strong  lobbying  for  the  FCC  to  shift  the  FM 
band  persisted,  mainly  by  those  who  propagated  the  concern  that  strong  radio  interferences  in 
the  40  MHz  band  might  be  possible  as  a  result  ionospheric  reflection.  Then  in  June  1945,  the 
FCC,  on  the  basis  of  erroneous  testimony  of  a  technical  expert,  abruptly  shifted  the  allocated 
bandwidth  of  FM  from  the  42-  to  50-MHz  range  to  the  88-  to  108-MHz.  This  dealt  a  crippling 
blow  to  FM  by  making  obsolete  more  than  half  a  million  receivers  and  equipment  (transmitters, 
antennas,  etc.)  that  had  been  built  and  sold  by  the  FM  industry  to  50  FM  stations  since  1941 
for  the  42  to  50  MHz  band.  Armstrong  fought  the  decision,  and  later  succeeded  in  getting  the 
technical  expert  to  admit  his  error.  In  spite  of  all  this,  the  FCC  allocations  remained  unchanged. 
Armstrong  spent  the  sizable  fortune  he  had  made  from  his  inventions  in  legal  struggles.  The 
broadcast  giants,  which  had  so  strongly  resisted  FM,  turned  around  and  used  his  inventions 
without  paying  him  royalties.  Armstrong  spent  much  of  his  time  in  court  in  some  of  the  longest, 
most  notable,  and  acrimonious  patent  suits  of  the  era.*'  In  the  end,  with  his  funds  depleted,  his 
energy  drained,  and  his  family  life  shattered,  a  despondent  Armstrong  committed  suicide:  (in 
1954)  he  walked  out  of  a  window  of  his  thirteenth  floor  apartment  in  New  York’s  River  House. 

Armstrong’s  widow  continued  the  legal  battles  and  won.  By  the  1960s,  FM  was  clearly 
established  as  the  superior  radio  system/'  and  Edwin  H.  Armstrong  was  fully  recognized  as  the 
inventor  of  frequency  modulation.  In  1955  the  ITU  added  him  to  its  roster  of  great  inventors. 
In  1980  Edwin  H.  Armstrong  was  inducted  into  the  U.S.  National  Inventors  Hall  of  Fame,  and 
his  picture  was  put  on  a  U.S.  postage  stamp  in  1983.7 
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5.3  GENERATING  FM  WAVES 

Basically,  there  are  two  ways  ot  generating  FM  waves:  indirect  and  direct.  We  first  describe 

the  narrowband  FM  generator  that  is  utilized  in  the  indirect  FM  generation  of  wideband  angle 
modulation  signals. 

NBFM  Generation 

NBPM,  Sif“ls-  w'  haw  *»™  earlier  ,ha,  because  \kfa( ,)|  «l 
|V«>I  «  I.  respectively,  the  modulated  signals  can  be  approximated  by 


^NBFM  U)  —  ^[COS  (Oct  —  kfCl(t)  sin  COct] 
^nbpm  V)  —  4 1  cos  (0ct  —  kpm(t)  sin  a>ct ] 


(5.21a) 

(5.21b) 


”d  “  '“lar  ,o  the  expression  of  ,he  AM  wave.  In  fad. 
DSB-SC  modulators  The 'hi'  TI'"'  °f  genera,mS  narrowband  FM  and  PM  signals  by  using 
I,  is  ^  ;agT  of  such  systems  appeal  in  Fig.  5*. 

necaus:TrZxiS„T»r(5vr  rr by  f*  5  8b  £ «« 

amplitude  v  ariations  A  nnniin  a  ■  °i' Th  put  of  this  NBFM  modulator  also  has  some 
can  remove  most  of  this  distortion.  ^ '**  eS'gned  l°  hmit  the  amPlitude  of  a  bandpass  signal 
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Bandpass  Limiter 

The  amplitude  variations  of  an  angle-modulated  carrier  can  be  eliminated  by  what  is  known  as 
a  bandpass  limiter,  which  consists  of  a  hard  limiter  followed  by  a  bandpass  filter  (Fig.  5.9a). 
The  input-output  characteristic  of  a  hard  limiter  is  shown  in  Fig.  5.9b.  Observe  that  the  bandpass 
limiter  output  to  a  sinusoid  will  be  a  square  wave  of  unit  amplitude  regardless  of  the  incoming 
sinusoidal  amplitude.  Moreover,  the  zero  crossings  of  the  incoming  sinusoid  are  preserved 
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in  the  output  because  when  the  input  is  zero,  the  output  is  also  zero  (Fig.  5.9b).  Thus  an 
angle-modulated  sinusoidal  input  v/(f)  =  A(t)  cos  0(t)  results  in  a  constant  amplitude,  angle- 
modulated  square  wave  v0(t ),  as  shown  in  Fig.  5.9c.  As  we  have  seen,  such  a  nonlinear  operation 
preserves  the  angle  modulation  information.  When  v0(t)  is  passed  through  a  bandpass  filter 
centered  at  coc ,  the  output  is  a  angle-modulated  wave,  of  constant  amplitude.  To  show  this, 
consider  the  incoming  angle-modulated  wave 


v/(f)  =  A(t)  cos  G(t) 


where 


The  output  v()(t)  of  the  hard  limiter  is  +1  or  —1,  depending  on  whether  v/(r)  =  A(t)  cos  6(t) 
is  positive  or  negative  (Fig.  5.9c).  Because  A(t)  >  0,  va(t)  can  be  expressed  as  a  function  of#: 


Hence,  v()  as  a  function  of  0  is  a  periodic  square  wave  function  with  period  2jt  (Fig.  5.9d), 
which  can  be  expanded  by  a  Fourier  series  (Chapter  2) 


At  any  instant  /,  0  a)ct  +  kf  J  m(a)  da.  Hence,  the  output  v0  as  a  function  of  time  is 


given  by 


filter  output  ea(t)  is  the  desired  angle-modulated 


carrier  with  a  constant  amplitude. 


noise  when  the  noise  is  small.8 
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Indirect  Method  of  Armstrong 

In  Armstrong’s  indirect  method,  NBFM  is  generated  as  shown  in  Fig.  5.8b  |or  Eq.  (5.10)].  The 
NBFM  is  then  converted  to  WBFM  by  using  additional  frequency  multipliers. 

A  frequency  multiplier  can  be  realized  by  a  nonlinear  device  followed  by  a  bandpass  tiler. 
First  consider  a  nonlinear  device  whose  output  signal  y(/)  to  an  input  x (/)  is  given  by 

y(t)  =  a2x2(t) 

If  an  FM  signal  passes  through  this  device,  then  the  output  signal  will  be 

y(r)  =  a2  cos2  j)ct  +  kf  J  m(a )  da j 

=  0.5a2  +  0.5a2  cos  +  2kf  J  m(or)  c/«j  (5.22) 

Thus,  a  bandpass  filter  centered  at  2a)c  would  recover  an  FM  signal  with  twice  the  original 
instantaneous  frequency.  To  generalize,  a  nonlinear  device  may  have  the  characteristic  of 

y(t)  =  ao  +  a\x(t)  +  a2x2(t)  -\ - +  a„.r"(0  (5.23) 


If  x(t )  =  A  cos  \o)ct  +  kf  f  m(a)da\ ,  then  by  using  trigonometric  identities,  we  can  readily 
show  that  y{t)  is  of  the  form 


y(t)  =  c0  +c i  cos  |<y ct  +  kf  J  w(o')<fo'j  +  c2  cos  j^2 coct  +  2 kf  J  m(a )  da rj 
3 - \-c„  cos  |  na)ct  +  nkj  J  m(a)  Jaj 


(5.24) 


Hence,  the  output  will  have  spectra  at  ioc,  2u)c, . . . ,  ncoc,  with  frequency  deviations 
A/,  2 A/, . . . ,  n A/,  respectively.  Each  one  of  these  components  is  an  FM  signal  separated 
from  the  others.  Thus,  a  bandpass  filter  centering  at  ncoc  can  recover  an  FM  signal  whose 
instantaneous  frequency  has  been  multiplied  by  a  factor  of  n.  These  devices,  consisting  of 
nonlinearity  and  bandpass  filters,  are  known  as  frequency  multipliers.  In  fact,  a  frequency 
multiplier  can  increase  both  the  carrier  frequency  and  the  frequency  deviation  by  an  integer//. 
Thus,  if  we  want  a  twelfth-fold  increase  in  the  frequency  deviation,  we  can  use  a  twelfth-order 
nonlinear  device  or  two  second-order  and  one  third-order  devices  in  cascade.  The  output  has 
a  bandpass  filter  centered  at  12 a)c,  so  that  it  selects  only  the  appropriate  term,  whose  carrier 
frequency  as  well  as  the  frequency  deviation  A f  are  12  times  the  original  values. 

This  forms  the  basis  of  the  Armstrong  indirect  frequency  modulator.  First,  generate  an 
NBFM  approximately.  Then  multiply  the  NBFM  frequency  and  limit  its  amplitude  variation. 
Generally,  we  require  to  increase  A f  by  a  very  large  factor  n.  This  increases  the  carrier  fre¬ 
quency  also  by  //.  Such  a  large  increase  in  the  carrier  frequency  may  not  be  needed.  In  this  case 
we  can  apply  frequency  mixing  (see  Example  4.2,  Fig.  4.7)  to  shift  down  the  carrier  frequency 
to  the  desired  value. 

A  simplified  diagram  of  a  commercial  FM  transmitter  using  Armstrong’s  method  is  shown 
in  Fig.  5.10.  The  final  output  is  required  to  have  a  carrier  frequency  of  91.2  MHz  and  A f  =  75 
kHz.  We  begin  with  NBFM  with  a  carrier  frequency  fC]  =  200  kHz  generated  by  a  crystal 
oscillator.  This  frequency  is  chosen  because  it  is  easy  to  construct  stable  crystal  oscillators  as 
well  as  balanced  modulators  at  this  frequency.  To  maintain  p  «;  1,  as  required  in  NBPM,  the 
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Figure  5.10 

Block  diagram  of 
the  Armstrong 
indirect  FM 
transmitter. 


deviation  A/  is  chosen  to  be  25  Hz.  For  tone  modulation,  =  A/ The  baseband  spectrum 
(required  for  high-fidelity  purposes)  ranges  from  50  Hz  to  15  kHz.  The  choice  of  A/  =  25  Hz 
is  reasonable  because  it  gives  =  0.5  for  the  worst  possible  case  (fm  =  50). 

To  achieve  A /  =  75  kHz,  we  need  a  multiplication  of  75,000/25  =  3000.  This  can  be 
done  by  two  multiplier  stages,  of  64  and  48,  as  shown  in  Fig.  5.10,  giving  a  total  multiplication 
of  64  x  48  =  3072,  and  A f  =  76.8  kHz.*  The  multiplication  is  effected  by  using  frequency 
doublers  and  triplers  in  cascade,  as  needed.  Thus,  a  multiplication  of  64  can  be  obtained  by  six 
doublers  in  cascade,  and  a  multiplication  of  48  can  be  obtained  by  four  doublers  and  a  tripler 
in  cascade.  Multiplication  of  fc  =  200  kHz  by  3072,  however,  would  yield  a  final  carrier  of 
about  600  MHz.  This  problem  is  solved  by  using  a  frequency  translation,  or  conversion,  after 
the  first  multiplier  (Fig.  5.10).  The  first  multiplication  by  64  results  in  the  carrier  frequency 
fc2  =  200  kHz  x  64  =  12.8  MHz,  and  the  carrier  deviation  A f2  =  25  x  64  =  1.6  kHz.  We 
now  use  a  frequency  converter  (or  mixer)  with  carrier  frequency  10.9  MHz  to  shift  the  entire 
spectrum.  This  results  in  a  new  carrier  frequency  fC}  =  1 2.8  -  10.9  =  1 .9  MHz.  The  frequency 
converter  shifts  the  entire  spectrum  without  altering  A/.  Hence,  A/3  =  1.6  kHz.  Further 
multiplication,  by  48,  yields/C4  =  1.9  x  48  =  91.2  MHz  and  A/4  =  1.6  x  48  =  76.8  kHz. 

This  scheme  has  an  advantage  of  frequency  stability,  but  it  suffers  from  inherent  noise 
caused  by  excessive  multiplication  and  distortion  at  lower  modulating  frequencies,  where 
A f/fm  is  not  small  enough. 


Example  5.6  Discuss  the  nature  of  distortion  inherent  in  the  Armstrong  indirect  FM  generator. 

Two  kinds  of  distortion  arise  in  this  scheme:  amplitude  distortion  and  frequency  distortion. 
The  NBFM  wave  is  given  by  [Eq.  (5.10)] 

^fm^)  =  ^ [cos  (oct  —  kfa{t)  sin  coct] 

=  AE(t)  cos  [coct  +0(t)] 

where 

£<,)  =  /'  +kfa2(t)  and  6>(r)  =  tan-1[Jfc/a(/)] 


25(75/76.8)  =  24°4!l  ^  'nS,ead  °f?6  8  kHz'  we  musl  red^e  the  narrowband  A f  from  25  Hz  to 
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Amplitude  distortion  occurs  because  the  amplitude  AE(t)  of  the  modulated  waveform  is 
not  constant.  This  is  not  a  serious  problem  because  amplitude  variations  can  be  eliminated 
by  a  bandpass  limiter,  as  discussed  earlier  in  the  section  (see  also  Fig.  5.9).  Ideally,  0(t) 
should  be  kfa(t).  Instead,  the  phase  0(t)  in  the  preceding  equation  is 


9(t)  =  tan  '[fya(r)] 
and  the  instantaneous  frequency  <«,•(/)  is 


coj(t)  =  0{t)  = 


kfajt) 

1  +  kja2(t) 


kftn(t) 

1  +  kja2{t) 


=  kfm(t)[  1  -  kja\t)  +  kfa\t) - ] 

Ideally,  the  instantaneous  frequency  should  be  kfin(t).  The  remaining  terms  in  this  equation 
are  the  distortion. 

Let  us  investigate  the  effect  of  this  distortion  in  tone  modulation  where  m(t)  = 
a  cos  comt ,  a(t)  =  or  sin  c omt/(om ,  and  the  modulation  index  p  =  akf/a>m : 


a)i(t)  =  ficOfti  cos  (omt(  1  —  f}2  sin2  comt  +  0*  sin4  a)mt - ) 


It  is  evident  from  this  equation  that  the  scheme  has  odd-harmonic  distortion,  the  most 
important  term  being  the  third  harmonic.  Ignoring  the  remaining  terms,  this  equation 
becomes 


<w/(0  —  flcom  cos  comt(  1  -  fi2  sin2  comt) 


=  fioj, 


K) 


2X  ,  - 

cos  comt  H - - —  cos  3 d)mt 


desired 


distortion 


The  ratio  of  the  third-harmonic  distortion  to  the  desired  signal  can  be  found  for  the 
generator  in  Fig.  5. 10.  For  the  NBFM  stage. 


PB  =  A/i  =  25  Hz 


Hence,  the  worst  possible  case  occurs  at  the  lower  modulation  frequency.  For  example,  if 
the  tone  frequency  is  only  50  Hz.  then  p  =  0.5.  In  this  case  the  third-harmonic  distortion 
is  1/15,  or  6.67%. 


Direct  Generation 

In  a  voltage-controlled  oscillator  (VCO),  the  frequency  is  controlled  by  an  external  voltage. 
The  oscillation  frequency  varies  linearly  with  the  control  voltage.  We  can  generate  an  FM 
wave  by  using  the  modulating  signal  m(t)  as  a  control  signal.  This  gives 


a)j(t)  =  cjc  +  kjrn(t) 
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One  can  construct  a  VCO  using  an  operational  amplifier  and  a  hysteretic  comparator,9  (such 
as  a  Schmitt  trigger  circuit).  Another  way  of  accomplishing  the  same  goal  is  to  vary  one  of  the 
reactive  parameters  (C  or  L)  of  the  resonant  circuit  of  an  oscillator.  A  reverse-biased  semicon¬ 
ductor  diode  acts  as  a  capacitor  whose  capacitance  varies  with  the  bias  voltage.  The  capacitance 
of  these  diodes,  known  under  several  trade  names  (e.g.,  Varicap,  Varactor,  Voltacap),  can  be 
approximated  as  a  linear  function  of  the  bias  voltage  m(t)  over  a  limited  range.  In  Hartley  or 
Colpitt  oscillators,  for  instance,  the  frequency  of  oscillation  is  given  by 

1 

(*>o  =  ~i= 

Vlc 

If  the  capacitance  C  is  varied  by  the  modulating  signal  m(f),  that  is,  if 

C  =  Co  -  km(t) 

then 

_ 1 _ 

ppm 

«FW 

Here  we  have  applied  the  binomial  approximation 

( 1  +  x)n  %  1  +  tvc 


kmjt) 

Co 


«  1 


M  «  i 


with  n—\/2.  Thus, 


Example  5.7 


Figure  5.11 

Designing  an 
Armstrong 
indirect 
modulator. 


too  =  toc 


i  +  ^l 

2C„  J 


where  coc  = 


s/LCo 


—  toc  +  kfm(t)  with  kr  =  — - 

’  2C0 

Because  C  =  C0  -  km{t),  the  maximum  capacitance  deviation  is 


Hence. 


AC  =  kmp  =  y°mP 

(Dc 


AC  _  2 kfmp  _  2^V 
Co  ojc 


S  k,m*  °f  c»- helps  lin”' 

es  060,11186  of  the  approximation  used  in  this  derivation. 
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We  may  also  generate  direct  FM  by  using  a  saturable  core  reactor,  where  the  inductance 
of  a  coil  is  varied  by  a  current  through  a  second  coil  (also  wound  around  the  same  core).  This 
results  in  a  variable  inductor  whose  inductance  is  proportional  to  the  current  in  the  second  coil. 

Direct  FM  generation  generally  produces  sufficient  frequency  deviation  and  requires  little 
frequency  multiplication.  But  this  method  has  poor  frequency  stability.  In  practice,  feedback 
is  used  to  stabilize  the  frequency.  The  output  frequency  is  compared  with  a  constant  frequency 
generated  by  a  stable  crystal  oscillator.  An  error  signal  (error  in  frequency)  is  detected  and  fed 
back  to  the  oscillator  to  correct  the  error. 

Features  of  Angle  Modulation 

FM  (like  angle  modulation  in  general)  has  a  number  of  unique  features  that  recommend  it 
for  various  radio  systems.  The  transmission  bandwidth  of  AM  systems  cannot  be  changed. 
Because  of  this,  AM  systems  do  not  have  the  feature  of  exchanging  signal  power  for  trans¬ 
mission  bandwidth.  Pulse-coded  modulation  (PCM)  systems  (Chapter  6)  have  such  a  feature, 
and  so  do  angle-modulated  systems.  In  angle  modulation,  the  transmission  bandwidth  can  be 
adjusted  by  adjusting  A/.  It  is  shown  in  Chapter  10  that  for  angle-modulated  systems,  the  SNR 
is  roughly  proportional  to  the  square  of  the  transmission  bandwidth  Bj .  Recall  that  in  PCM, 
the  SNR  varies  exponentially  with  Bj  and  is,  therefore,  superior  to  angle  modulation. 


Example  5.7  Design  an  Armstrong  indirect  FM  modulator  to  generate  an  FM  signal  with  carrier  frequency 
97.3  MHz  and  A /  =  10.24  kHz.  A  NBFM  generator  of/C)  =  20  kHz  and  A/  =  5  Hz  is 
available.  Only  frequency  doublers  can  be  used  as  multipliers.  Additionally,  a  local  oscillator 
(LO)  with  adjustable  frequency  between  400  and  500  kHz  is  readily  available  for  frequency 
mixing. 


Fjgure  5.11 

Designing  an 
Armstrong 
indirect 
modulator. 


The  modulator  is  shown  in  Fig.  5.11.  We  need  to  determine  M j,  A/2,  and/LG.  First,  the 
NBFM  generator  generates 

fcx  =20,000  and  A/i  =  5 


The  final  WBFM  should  have 

fC4  =  97.3  x  106  A/4=  10,240 

We  first  find  the  total  factor  of  frequency  multiplication  needed  as 
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Because  only  frequency  doublers  can  be  used,  we  have  three  equations: 

M\  =  2”' 

M2  =  2"2 
n\  +n2  =  11 


It  is  also  clear  that 


M  =  2  "'/c  and  /C4  =  2"2/,3 

To  find/LO,  there  are  three  possible  relationships: 

fC)  =fc2  ±/L0  and  fC3  =/L0  -/C2 

Each  should  be  tested  to  determine  the  one  that  will  fall  in 

400,000  </L0  <  500,000 

(a)  First,  we  test/,,  =/Q  -/L0.  This  case  leads  to 


97.3  x  106  =  2"2  (2"'/Cl  -/lo) 

=  2"1+"2/.|  -  2"2/lo 
=  2"  20  x  103  -  2"2/lo 


Thus,  we  have 


A o  =  2“"!  (4.096  x  I07  -  9.73  x  I07)  <  0 


This  is  outside  the  local  oscillator  frequency  range. 

(b)  Next,  we  test/,,  =/Q  +/  This  case  leads  to 


'  LO 
^6 


97.3  x  106  =  2”2  (2 ”'/„  +/LO) 


=  2n20x  103  +  2"2/. 


LO 


Thus,  we  have 


/lo=2-"2  (5.634  x  107) 

"  ni(c)  7  If  wic°h7  44/()  kHz;  whlch  1S  within  the  realizable  range  of  the  local  oscillator, 

(c)  If  we  choose/,,  =/L0  -/Q,  ,hen  we  have 


97.3  x  106  =/LQ  _  2"22,,|/c 


Cl 


~  ~'2fto  ~  2u(20  x  103) 
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Thus,  we  have 


/lo  =  2_"2  ( 13.826  x  107) 


No  integer  hi  will  lead  to  a  realizable/L0. 

Thus,  the  final  design  is  M\  =  16.  M2  =  128,  and /L0  =  440  kHz. 


5.4  DEMODULATION  OF  FM  SIGNALS 

The  information  in  an  FM  signal  resides  in  the  instantaneous  frequency  cu,  =  coc  4-  kfm(t). 
Hence,  a  frequency-selective  network  with  a  transfer  function  of  the  form  \H(f)\  =  2anf  -f  b 
over  the  FM  band  would  yield  an  output  proportional  to  the  instantaneous  frequency 
(Fig.  5.12a).*  There  are  several  possible  circuits  with  such  characteristics.  The  simplest  among 
them  is  an  ideal  differentiator  with  the  transfer  function  jlrcf. 


(a)  FM 
demodulate 
frequency 
response. 

(b)  Output  i 
differentiate 
tbe  input  F/ 
wave,  (c)  F 
demodulati 
direct 

differentiati 


local  oscillator. 


d 

Envelope 

A  \utc  +  I 

di 

detector 

(c) 


Provided  the  variations  of  a)j  are  slow  in  comparison  to  the  time  constant  of  the  network. 
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If  we  apply  (pm(t)  to  an  ideal  differentiator,  the  output  is 


—  00 


=  A  [coc  +  kftn(t)\  sin  coct  +  kf  m(a)d(a)  —  tt  (5.26) 


Figure  5.13 

(a)  RC  high-pass 
filter. 

(b)  Segment  of 
positive  slope  in 
amplitude 
response. 


Both  the  amplitude  and  the  frequency  of  the  signal  <pm  (t)  are  modulated  (Fig.  5.12b),  the 
envelope  being  A[coc  +  kjm(t) ].  Because  A co  =  kfinp  <  a>c ,  we  have  coc  +  kfm{t)  >  0  for 
all  t%  and  m(t)  can  be  obtained  by  envelope  detection  of  (pm(t)  (Fig.  5.12c). 

The  amplitude  A  of  the  incoming  FM  carrier  must  be  constant.  If  the  amplitude  A  were 
not  constant,  but  a  function  of  time,  there  would  be  an  additional  term  containing  dA/dt  on  the 
right-hand  side  ot  Eq.  (5.26).  Even  if  this  term  were  neglected,  the  envelope  of  <pFM  (t)  would 
be  A(0[cu(.  +  kfm(t)]f  and  the  envelope-detector  output  would  be  proportional  to  m(t)A(t ), 
still  leading  to  distortions.  Hence,  it  is  essential  to  maintain  A  constant.  Several  factors,  such 
as  channel  noise  and  fading,  cause  A  to  vary.  This  variation  in  A  should  be  suppressed  via  the 
bandpass  limiter  (discussed  earlier  in  Sec.  5.3)  before  the  signal  is  applied  to  the  FM  detector. 

Practical  Frequency  Demodulators 

The  dilterentiator  is  only  one  way  to  convert  frequency  variation  of  FM  signals  into  amplitude 
variation  that  subsequently  can  be  detected  by  means  of  envelope  detectors.  One  can  use 
an  operational  amplifier  differentiator  at  the  FM  receiver.  On  the  other  hand,  the  role  of  the 
differentiator  can  be  replaced  by  any  linear  system  whose  frequency  response  contains  a  linear 
segment  ot  positive  slope.  By  approximating  the  ideal  linear  slope  in  Fig.  5.12(a),  this  method 
is  known  as  slope  detection. 


One  simple  device  would  be  an  RC  high-pass  filter  of  Fig.  5. 1 3.  The  RC  frequency  response 


Figure  5.13 

(a)  RC  high-pass 
filter. 

(b)  Segment  of 
positive  slope  in 
amplitude 
response. 


is  simply 


H(f)  =  J}71^ 

1  +j2nJRC 


^  jin fRC  if  2nfRC  «  1 


(Oc  <0)o  =  — _ 

■JlC 


(a) 


(b) 
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Because  the  operation  is  on  the  slope  of  \H(f)\,  this  method  is  also  called  slope  detection. 
Since,  however,  the  slope  of  \H(f)\  is  linear  over  only  a  small  band,  there  is  considerable 
distortion  in  the  output.  This  fault  can  be  partially  corrected  by  a  balanced  discriminator 
formed  by  two  slope  detectors.  Another  balanced  demodulator,  the  ratio  detector,  also  widely 
used  in  the  past,  offers  better  protection  against  carrier  amplitude  variations  than  does  the 
discriminator.  For  many  years  ratio  detectors  were  standard  in  almost  all  FM  receivers. 10 

Zero-crossing  detectors  are  also  used  because  of  advances  in  digital  integrated  circuits. 
The  first  step  is  to  use  the  amplitude  limiter  of  Fig.  5.9(a)  to  generate  the  rectangular  pulse 
output  of  Fig.  5.9(c).  The  resulting  rectangular  pulse  train  of  varying  width  can  then  be  applied 
to  trigger  a  digital  counter.  These  are  the  frequency  counters  designed  to  measure  the  instan¬ 
taneous  frequency  from  the  number  of  zero  crossings.  The  rate  of  zero  crossings  is  equal  to 
the  instantaneous  frequency  of  the  input  signal. 


FM  Demodulation  via  PLL 

Consider  a  PLL  that  is  in  lock  with  input  signal  sin  [u>ct  +  0,(/)]  and  output  error  signal  ea(t). 
When  the  input  signal  is  an  FM  signal. 


Oi(t)  = 


m(a )  da  + 


1 T 
2 


(5.27) 


then. 


Q0(t)  =  kf  f  m(a)da  +  0.5n  —  0e(t) 

J-0 O 

With  PLL  in  lock  we  can  assume  a  small  frequency  error  9e(t)  %  0.  Thus,  the  loop  filter  output 
signal  is 


e„(t)  =  ~0O(  0  =  -~r\kf  f  m(a)da  +  0.5jt  -  0e(f)l  ~  —  m(t)  (5.28) 
c  cdt  [  7-oo  J  c 


Thus,  the  PLL  acts  as  an  FM  demodulator.  If  the  incoming  signal  is  a  PM  wave,  then 
e„(t)  =  knm(t)/c.  In  this  case  we  need  to  integrate  e„(t)  to  obtain  the  desired  signal  m(/). 

To  more  precisely  analyze  PLL  behavior  as  an  FM  demodulator,  we  consider  the  case  of 
a  small  error  (linear  model  of  the  PLL)  with  H(s)  =  1 .  For  this  case,  feedback  analysis  of  the 
small -error  PLL  in  Chapter  4  becomes 


«„(*)  = 


AKH(s) 

s+AKH(s) 


®i(s) 


AK 

7+AK 


®i(s) 


If  E„(s)  and  M  (.r)  are  Laplace  transforms  of  e„{t)  and  m(t),  respectively,  then  from  Eqs.  (5.27) 
and  (5.28)  we  have 


0,(s)  = 


kfMjs) 


s 


and  s@0(s)  =  cE„(s ) 
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Hence, 


(kf\  AK 

Thus,  the  PLL  output  e0(t)  is  a  distorted  version  of  m(t)  and  is  equivalent  to  the  output  of  a 
single-pole  circuit  (such  as  a  simple  RC  circuit)  with  transfer  function  kfAK/c(s  4 -  AK)  to 
which  m(t)  as  the  input.  To  reduce  distortion,  we  must  choose  AK  well  above  the  bandwidth 
of  w(f),  so  that  eQ(t)  kfm(t)/c. 

In  the  presence  of  small  noise,  the  behavior  of  the  PLL  is  comparable  to  that  of  a  frequency 
discriminator.  The  advantage  of  the  PLL  over  a  frequency  discriminator  appears  only  when 
the  noise  is  large. 


5.5  EFFECTS  OF  NONLINEAR  DISTORTION 
AND  INTERFERENCE 

Immunity  of  Angle  Modulation  to  Nonlinearities 

A  very  useful  feature  ot  angle  modulation  is  its  constant  amplitude,  which  makes  it  less  sus¬ 
ceptible  to  nonlinearities.  Consider,  for  instance,  an  amplifier  with  second-order  nonlinear 
distortion  whose  input  ,t(r)  and  output  >>(/)  are  related  by 

y(r)  =  ao  +  ai*(/)  +  aix2(t)  -\ +  a„xn (t) 


C  I  early,  the  first  term  is  the  desired  signal  amplification  term,  while  the  remaining  terms  are 
the  unwanted  nonlinear  distortion.  For  the  angle  modulated  signal 

x(t)  =  A  cos  [t oct  +  VKO] 

trigonometric  identities  can  be  applied  to  rewrite  the  nonideal  system  output  y(t)  as 

yd)  =  Co  +  c,  cos  lcoct  +  xHt)]  +  C2  cos  [2(0ct  +  2^(f)] 

+  •  •  •  -I-  c„  cos  [ncoct  +  n\jr(t )] 

Because  sufficiently  large  coc  makes  each  component  of  y(t)  separable  in  frequency  domain, 
a  bandpass  filter  centered  at  with  bandwidth  equaling  to  Bm  (or  BPM)  can  extract  the 
desired  FM  signal  component  c,  cos  [a >ct  + +«)}  without  any  distortion.  This  shows  that 
angle-modulated  signals  are  immune  to  nonlinear  distortions. 

A  similar  nonlinearity  in  AM  not  only  causes  unwanted  modulation  with  carrier  frequen¬ 
cies  nav  but  also  causes  distortion  of  the  desired  signal.  For  instance,  if  a  DSB-SC  signal 
m(i)  cos  coct  passes  through  a  nonlinearity  y(r)  =  a x(t)  +  bx\t),  the  output  is 


){t)  am(t)  cos  (Dct  +  bni^(t)  cos**  coct 
=  |am(/)  +  ^m3(r)J  b 


cos  (Dct  -j-  cos  3(oct 
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Immunity  from  nonlinearity  is  the  primary  reason  for  the  use  of  angle  modulation  in 
microwave  radio  relay  systems,  where  power  levels  are  high.  This  requires  highly  efficient 
nonlinear  class  C  amplifiers.  In  addition,  the  constant  amplitude  of  FM  gives  it  a  kind  of 
immunity  to  rapid  fading.  The  effect  of  amplitude  variations  caused  by  rapid  fading  can  be 
eliminated  by  using  automatic  gain  control  and  bandpass  limiting.  These  advantages  made  FM 
attractive  as  the  technology  behind  the  first-generation  (1G)  cellular  phone  system. 

The  same  advantages  of  FM  also  make  it  attractive  for  microwave  radio  relay  systems.  In 
the  legacy  analog  long-haul  telephone  systems,  several  channels  are  multiplexed  by  means  of 
SSB  signals  to  form  L-carrier  signals.  The  multiplexed  signals  are  frequency-modulated  and 
transmitted  over  a  microwave  radio  relay  system  with  many  links  in  tandem.  In  this  application, 
however,  FM  is  used  not  to  reduce  noise  effects  but  to  realize  other  advantages  of  constant 
amplitude,  and.  hence,  NBFM  rather  than  WBFM  is  used. 


Interference  Effect 

Angle  modulation  is  also  less  vulnerable  than  AM  to  small-signal  interference  from  adjacent 
channels. 

Let  us  consider  the  simple  case  of  the  interference  of  an  unmodulated  carrier  A  cos  coct 
with  another  sinusoid  /  cos  (coc  -I-  co)t.  The  received  signal  r(t)  is 

r(t)  =  A  cos  coct  + 1  cos  (coc  +  co)t 

=  (A  +  /  cos  cot)  cos  coct  —  /  sin  col  sin  coct 
=  Er(t)  cos  [coct  +  fait)] 


where 


When  the  interfering  signal 


xj/d(t)  =  tan 


-l 


/  sin  cot 
A- \- 1  cos  cot 


is  small  in  comparison  to  the  carrier  (/  A), 


/ 

\jjd(t)  ~  -  sin  cot 
A 


(5.29) 


The  phase  of  Er{t)  cos  [coct  +  tMOl  is  fd(t),  and  its  instantaneous  frequency  is  coc  +  xj/d(t). 
If  the  signal  Er(t)  cos  [coct  +  ifj(t)]  is  applied  to  an  ideal  phase  demodulator,  the  output  yd(t) 
would  be  fait).  Similarly,  the  output  yd(t)  of  an  ideal  frequency  demodulator  would  be  \frd(t). 
Hence, 


for  PM  (5.30) 

forFM  (5.31) 

Observe  that  in  either  case,  the  interference  output  is  inversely  proportional  to  the  carrier 
amplitude  A.  Thus,  the  larger  the  carrier  amplitude  A,  the  smaller  the  interference  effect.  This 
behavior  is  very  different  from  that  in  AM  signals,  where  the  interference  output  is  independent 


/  . 

yd(0  =  j  sin 
loo 

yd (t)  =  —  cos  cot 


286  ANGLE  MODULATION  AND  DEMODULATION 


Figure  5.14 

Effect  of 
interference 
in  PM,  FM,  and 
FM  with 
preemphasis- 
deemphasis 

(PDE). 


of  the  carrier  amplitude.*  Hence,  angle-modulated  systems  are  much  better  than  AM  systems 
at  suppressing  weak  interference  (/  <K  A). 

Because  of  the  suppression  of  weak  interference  in  FM,  we  observe  what  is  known  as  the 
capture  effect  when  listening  to  FM  radios.  For  two  transmitters  with  carrier  frequency  sep¬ 
aration  less  than  the  audio  range,  instead  of  getting  interference,  we  observe  that  the  stronger 
carrier  effectively  suppresses  (captures)  the  weaker  carrier.  Subjective  tests  show  that  an  inter¬ 
ference  level  as  low  as  35  dB  in  the  audio  signals  can  cause  objectionable  effects.  Hence,  in 
AM,  the  interference  level  should  be  kept  below  35  dB.  On  the  other  hand,  for  FM,  because 
of  the  capture  effect,  the  interference  level  need  only  be  below  6  dB. 

The  interference  amplitude  (l /A  for  PM  and  Iw/A  for  FM)  vs.  co  at  the  receiver  output 
is  shown  in  Fig.  5.14.  The  interference  amplitude  is  constant  for  all  co  in  PM  but  increases 
linearly  with  co  in  FM.* 


Interference  due  to  Channel  Noise 

The  channel  noise  acts  as  interference  in  an  angle-modulated  signal.  We  shall  consider  the 
most  common  form  of  noise,  white  noise,  which  has  a  constant  power  spectral  density.  Such  a 
noise  may  be  considered  as  a  sum  of  sinusoids  of  all  frequencies  in  the  band.  All  components 
ave  the  same  amplitudes  (because  of  uniform  density).  This  means  /  is  constant  for  all  co,  and 
the  amplitude  spectrum  of  the  interference  at  the  receiver  output  is  as  shown  in  Fig.  5.14.  The 
interference  amplitude  spectrum  is  constant  for  PM,  and  increases  linearly  with  to  for  FM. 


Preemphasis  and  Deemphasis  in  FM  Broadcasting 

Figure  5.14  shows  that  in  FM,  the  interference  (the  noise)  increases  linearly  with  frequenc; 

S?  ,h' POT :he„ T 0U,pU' is  a.  higher  frequencies.  A  glam* . 

PSD  of  “  signal  »(,)  is  concentrated  a.  lower  frequencit 
Us’  1  e  no,se  *s  concentrated  at  higher  frequencies,  where  m(t) 1 


*  For  instance,  an  AM  signal  with  an  interfering  sinusoid  /  cos  (<oc  +  «,),  is  given  by 
r(t)  =  [A  +  m(/)]  cos  coc t  +  /  cos  (a)c  +  co)t 

=  lA  +  m(/)  + 1  cos  w'l  cos  (oct  -  /  sin  cot  sin  ojct 

The  envelope  of  this  signal  is 


~*J  ~  ^  t  myi)  +  /  cos  (ot  I  A 

I.  -CO.  «.  .hid.  i.  Inam^^n.  „d.«li 
systems.  SynChr°nous  demodulation  is  used.  We  come  to  aVimilar  conclusion  for  AM- 

behaves  linearly^br  multipl^mlrferingTin^so'ids '!>y  -°  m°re  than  one  in,erferin8  sinusoid.  The  system 

carrier  amplitude.  8  S'nUSOlds  Provided  lh<='r  amplitudes  are  very  small  in  comparison  to  the 


Figure  5.15 

Preemphasis- 
deemphasis  in 
an  FM  system. 


Figure  5.16 

(a)  Preemphasis 
filter  and  (b)  its 
frequency 
response. 

(c)  Deemphasis 
filter  and  (d)  its 
frequency 
response. 
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Figure  5.15 

Preemphasis- 
deemphasis  in 
an  FM  system. 


Figure  5.16 

(a)  Preemphasis 
filter  and  (b)  its 
frequency 
response. 

(c)  Deemphasis 
filter  and  (d)  its 
frequency 
response. 
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the  weakest.  This  may  seem  like  a  disaster.  But  actually,  in  this  very  situation  there  is  a 
hidden  opportunity  to  reduce  noise  greatly.  The  process,  shown  in  Fig.  5. 15,  works  as  follows. 
At  the  transmitter,  the  weaker  high-frequency  components  (beyond  2.1  kHz)  of  the  audio 
signal  m(t)  are  boosted  before  modulation  by  a  preemphasis  filter  of  transfer  function  Hp(f). 
At  the  receiver,  the  demodulator  output  is  passed  through  a  deemphasis  filter  of  transfer 
function  Hd(f)  =  1  /Hp(f).  Thus,  the  deemphasis  filter  undoes  the  preemphasis  by  attenuating 
(deemphasizing)  the  higher  frequency  components  (beyond  2. 1  kHz),  and  thereby  restores  the 
original  signal  m(t).  The  noise,  however,  enters  at  the  channel,  and  therefore  has  not  been 
preemphasized  (boosted).  However,  it  passes  through  the  deemphasis  filter,  which  attenuates 
its  higher  frequency  components,  where  most  of  the  noise  power  is  concentrated  (Fig.  5.14). 
Thus,  the  process  of  preemphasis-deemphasis  (PDE)  leaves  the  desired  signal  untouched  but 
reduces  the  noise  power  considerably. 


Preemphasis  and  Deemphasis  Filters 

Figure  5. 14  provides  an  opportunity  to  preemphasis.  The  FM  has  smaller  interference  than  PM 
at  lower  frequencies,  while  the  opposite  is  true  at  higher  frequencies.  If  we  can  make  our  system 
behave  like  FM  at  lower  frequencies  and  behave  like  PM  at  higher  frequencies,  we  will  have 
the  best  of  both  worlds.  This  is  accomplished  by  a  system  used  in  commercial  broadcasting 
(Fig.  5.15)  with  the  preemphasis  (before  modulation)  and  deemphasis  (after  demodulation) 
filters  Hp(f)  and  Hd(f)  shown  in  Fig.  5.16.  The  frequency f\  is  2.1  kHz,  and/2  is  typically 
30  kHz  or  more  (well  beyond  audio  range),  so  that  /2  does  not  even  enter  into  the  picture. 
These  filters  can  be  realized  by  simple  RC  circuits  (Fig.  5.16).  The  choice  of/i  =  2.1  kHz 
was  apparently  made  on  an  experimental  basis.  It  was  found  that  this  choice  of f\  maintained 


288 


ANGLE  MODULATION  AND  DEMODULATION 


the  same  peak  amplitude  nip  with  or  without  preemphasis.1*  This  satisfied  the  constraint  of  a 
fixed  transmission  bandwidth. 

The  preemphasis  transfer  function  is 


where  K ,  the  gain,  is  set  at  a  value  of  0*2/ co\ .  Thus, 


(5.32b) 


For  2j if  co  1, 


(5.32c) 


For  frequencies  w\  <£  2nf  <&  (02, 


(5.32d) 


Thus,  the  preemphasizer  acts  as  a  differentiator  at  intermediate  frequencies  (2.1—15  kHz), 
which  effectively  makes  the  scheme  PM  over  these  frequencies.  This  means  that  FM  with 
PDE  is  FM  over  the  modulating-signal  frequency  range  of  0  to  2.1  kHz  and  is  nearly  PM  over 
the  range  of  2. 1  to  15  kHz,  as  desired. 

The  deemphasis  filter  Hj(f)  is  given  by 


Note  that  for  2nf  «  a>2,  Hp(f )  ~  (J2nf +a)0/(O\.  Hence,  Hp(f)Hd(f)  ~  1  over  the  baseband 


Optimum  PDE  filters  are  discussed  in  Chapter  10.  For  historical  and  practical  reasons, 
optimum  PDE  filters  are  not  used  in  practice.  It  can  be  shown  that  the  PDE  enhances  the  SNR 
by  13.27  dB  (a  power  ratio  of  21.25). 
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2. 1  kHz).  In  the  Dolby-A  system,  designed  for  commercial  use,  the  bands  are  divided  into  four 
subbands  (below  80  Hz,  80—3  kHz,  3—9  kHz,  and  above  9  kHz).  The  amount  of  preemphasis 
is  optimized  for  each  band. 

We  could  also  use  PDE  in  AM  broadcasting  to  improve  the  output  SNR.  In  practice, 
however,  this  is  not  done  for  several  reasons.  First,  the  output  noise  amplitude  in  AM  is 
constant  with  frequency  and  does  not  increase  linearly  as  in  FM.  Hence,  the  deemphasis  does 
not  yield  such  a  dramatic  improvement  in  AM  as  it  does  in  FM.  Second,  introduction  of  PDE 
would  necessitate  modifications  of  receivers  already  in  use.  Third,  increasing  high-frequency 
component  amplitudes  (preemphasis)  would  increase  interference  with  adjacent  stations  (no 
such  problem  arises  in  FM).  Moreover,  an  increase  in  the  frequency  deviation  ratio  (i  at  high 
frequencies  would  make  detector  design  more  difficult. 


5.6  SUPERHETERODYNE  ANALOG  AM/FM 
RECEIVERS 

The  radio  receiver  used  in  broadcast  AM  and  FM  systems,  is  called  the  superheterodyne 
receiver  (Fig.  5.17).  It  consists  of  an  RF  (radio-frequency)  section,  a  frequency  converter 
(Example  4.2),  an  intermediate-frequency  (IF)  amplifier,  an  envelope  detector,  and  an  audio 
amplifier. 

The  RF  section  consists  basically  of  a  tunable  filter  and  an  amplifier  that  picks  up  the 
desired  station  by  tuning  the  filter  to  the  right  frequency  band.  The  next  section,  the  frequency 
mixer  (converter),  translates  the  carrier  from  coc  to  a  fixed  IF  frequency  of  co^  (see  Example 
4.2  for  frequency  conversion).  For  this  purpose,  the  receiver  uses  a  local  oscillator  whose 
frequency  /lo  is  exactly /if  above  the  incoming  carrier  frequency  fc\  that  is, 

/lo  =fc  +/if 

The  simultaneous  tuning  of  the  local  oscillator  and  the  RF  tunable  filter  is  done  by  one  joint 
knob.  Tuning  capacitors  in  both  circuits  are  ganged  together  and  are  designed  so  that  the  tuning 


Figure  5.17 

Superheterodyne 

receiver. 
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frequency  of  the  local  oscillator  is  always  /if  Hz  above  the  tuning  frequency  fc  of  the  RF  filter. 
This  means  every  station  that  is  tuned  in  is  translated  to  a  fixed  carrier  frequency  of /if  Hz  by 
the  frequency  converter  for  subsequent  processing  at  IF. 

This  superheterodyne  receiver  structure  is  broadly  utilized  in  most  broadcast  systems.  The 
intermediate  frequencies  are  chosen  to  be  455  kHz  (AM  radio),  10.7  MHz  (FM  radio),  and 
38  MHz  (TV  reception). 

As  discovered  by  Armstrong  for  AM  signals,  the  translation  of  all  stations  to  a  fixed 
intermediate  frequency  (/if  =  455  kHz  for  AM)  allows  us  to  obtain  adequate  selectivity.  It  is 
difficult  to  design  precise  bandpass  filters  of  bandwidth  10  kHz  (the  modulated  audio  spectrum) 
if  the  center  frequency  fc  is  very  high.  This  is  particularly  true  in  the  case  of  tunable  filters. 
Hence,  the  RF  filter  cannot  provide  adequate  selectivity  against  adjacent  channels.  But  when 
this  signal  is  translated  to  an  IF  frequency  by  a  converter,  it  is  further  amplified  by  an  IF 
amplifier  (usually  a  three-stage  amplifier),  which  does  have  good  selectivity.  This  is  because 
the  IF  frequency  is  reasonably  low;  moreover,  its  center  frequency  is  fixed  and  factory-tuned. 
Hence,  the  IF  section  can  effectively  suppress  adjacent-channel  interference  because  of  its 
high  selectivity.  It  also  amplifies  the  signal  for  envelope  detection. 

In  reality,  the  entire  selectivity  is  practically  realized  in  the  IF  section;  the  RF  section 
plays  a  negligible  role.  The  main  function  of  the  RF  section  is  image  frequency  sup¬ 
pression.  As  observed  in  Example  4.2,  the  output  of  the  mixer,  or  converter,  consists  of 
components  ot  the  difference  between  the  incoming  (fc )  and  the  local  oscillator  frequen¬ 
cies  (/Lo)  (i  e.,/ip  =  |/Lo  —/cl)-  Now,  consider  the  AM  example.  If  the  incoming  carrier 
frequency  fc  =  1000  kHz,  then/LO  =fc  +/rf  =  1000  4-455  =  1455  kHz.  But  another  carrier, 
with/.'  =  M55  +  455  =1910  kHz,  will  also  be  picked  up  because  the  difference/.'  — /lo  *s 
also  455  kHz.  The  station  at  1910  kHz  is  said  to  be  the  image  of  the  station  of  1000  kHz. 
AM  stations  that  are  2/if  =  910  kHz  apart  are  called  image  stations  and  both  would  appear 
simultaneously  at  the  IF  output,  were  it  not  for  the  RF  filter  at  receiver  input.  The  RF  filter 
may  provide  poor  selectivity  against  adjacent  stations  separated  by  10  kHz,  but  it  can  provide 
reasonable  selectivity  against  a  station  separated  by  910  kHz.  Thus,  when  we  wish  to  tune  in 
a  station  at  1000  kHz,  the  RF  filter,  tuned  to  1000  kHz,  provides  adequate  suppression  of  the 
image  station  at  1910  kHz. 


'  a  i  v/vw 


I  he  receiver  (Fig.  5.17)  converts  the  incoming  carrier  frequency  to  the  IF  by  using  i 
oscillator  of  Irequency/Lo  higher  than  the  incoming  carrier  frequency  and,  hence,  is  called 
a  superheterodyne  receiver.  We  pick/LO  higher  than  fc  because  this  leads  to  a  smaller  tuning 
ratio  of  the  maximum  to  minimum  tuning  frequency  for  the  local  oscillator  The  AM  broadcast- 
band  frequencies  range  from  530  to  1710  kHz.  The  superheterodyne /LO  ranges  from  1005  to 
2055  kHz  (ratio  of  2.045),  whereas  the  subheterodyne  range  of/LO  would  be  95  to  1145  kHz 

( ratio  ol  1 2.05).  It  is  much  easier  to  design  an  oscillator  that  is  tunable  over  a  smaller  frequency 
ratio. 

The  importance  ot  the  superheterodyne  principle  in  radio  and  television  broadcasting  can¬ 
not  be  overstressed.  In  the  early  days  (before  1919),  the  entire  selectivity  against  adjacent 
stations  was  realized  in  the  RF  filter.  Because  this  filter  often  had  poor  selectivity,  it  was  nec¬ 
essary  to  use  several  stages  (several  resonant  circuits)  in  cascade  for  adequate  selectivity.  In 
the  earlier  receivers  each  filter  was  tuned  individually.  It  was  very  time-consuming  and  cum¬ 
bersome  to  tune  in  a  station  by  bringing  all  resonant  circuits  into  synchronism.  This  task  was 
made  easier  as  var.able  capacitors  were  ganged  together  by  mounting  them  on  the  same  shaft 

c  m  he  Capac,tors  are  bulkV-  and  there  is  a  limit  to  the  number  that 

rnnb  e  tr*  f  These  faCt0rS’ ,n  tUm’ limited  the  selectivity  available  from  receivers. 
G  nsequently.  adjacent  carrier  frequencies  had  to  be  separated  widely,  resulting  in  fewer  fte- 

morTi radio  stations" '  &  SUperheterodyne  receiver  that  made  it  possible  to  accommodate  man) 
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5.7  FM  BROADCASTING  SYSTEM 

The  FCC  has  assigned  a  frequency  range  of  88  to  108  MHz  for  FM  broadcasting, 
with  a  separation  of  200  kHz  between  adjacent  stations  and  a  peak  frequency  deviation 
A f  =  75  kHz. 

A  monophonic  FM  receiver  is  identical  to  the  superheterodyne  AM  receiver  in  Fig.  5.17, 
except  that  the  intermediate  frequency  is  10.7  MHz  and  the  envelope  detector  is  replaced  by 
a  PLL  or  a  frequency  discriminator  followed  by  a  deemphasizer. 

Earlier  FM  broadcasts  were  monophonic.  Stereophonic  FM  broadcasting,  in  which  two 
audio  signals,  L  (left  microphone)  and  R  (right  microphone),  are  used  for  a  more  natural  effect, 
was  proposed  later.  The  FCC  ruled  that  the  stereophonic  system  had  to  be  compatible  with 
the  original  monophonic  system.  This  meant  that  the  older  monophonic  receivers  should  be 
able  to  receive  the  signal  L  +  R,  and  the  total  transmission  bandwidth  for  the  two  signals  (L 
and  R)  should  still  be  200  kHz,  with  A/  =  75  kHz  for  the  two  combined  signals.  This  would 
ensure  that  the  older  receivers  could  continue  to  receive  monophonic  as  well  as  stereophonic 
broadcasts,  although  the  stereo  effect  would  be  absent. 

A  transmitter  and  a  receiver  for  a  stereo  broadcast  are  shown  in  Fig.  5.18a  and  c.  At  the 
transmitter,  the  two  signals  L  and  R  are  added  and  subtracted  to  obtain  L  +  R  and  L-  R.  These 
signals  are  preemphasized.  The  preemphasized  signal  (L  -  R)'  DSB-SC  modulates  a  carrier 
of  38  kHz  obtained  by  doubling  the  frequency  of  a  19-kHz  signal  that  is  used  as  a  pilot.  The 
signal  ( L  +  R)'  is  used  directly.  All  three  signals  (the  third  being  the  pilot)  form  a  composite 
baseband  signal  m(t )  (Fig.  5.18b), 


/  /  COrt 

m(t)  =  (L  +  R)  +  (L-  R)  cos  coct  +  acos  —  (5.33) 

The  reason  for  using  a  pilot  of  19  kHz  rather  than  38  kHz  is  that  it  is  easier  to  sep¬ 
arate  the  pilot  at  19  kHz  because  there  are  no  signal  components  within  4  kHz  of  that 
frequency. 

The  receiver  operation  (Fig.  5. 1 8c)  is  self-explanatory.  A  monophonic  receiver  consists  of 
only  the  upper  branch  of  the  stereo  receiver  and,  hence,  receives  only  L  +  R.  This  is  of  course 
the  complete  audio  signal  without  the  stereo  effect.  Hence,  the  system  is  compatible.  The  pilot 
is  extracted,  and  (after  doubling  its  frequency)  it  is  used  to  demodulate  coherently  the  signal 
(L  —  R)f  cos  coct . 

An  interesting  aspect  of  stereo  transmission  is  that  the  peak  amplitude  of  the  composite 
signal  m(t)  in  Eq.  (5.33)  is  practically  the  same  as  that  of  the  monophonic  signal  (if  we  ignore 
the  pilot),  and,  hence.  A/ — which  is  proportional  to  the  peak  signal  amplitude  for  stereophonic 
transmission — remains  practically  the  same  as  for  the  monophonic  case.  This  can  be  explained 
by  the  so-called  interleaving  effect  as  follows. 

The  U  and  R'  signals  are  very  similar  in  general.  Hence,  we  can  assume  their  peak  ampli¬ 
tudes  to  be  equal  to  Ap.  Under  the  worst  possible  conditions,  L'  and  R'  will  reach  their  peaks 
at  the  same  time,  yielding  [Eq.  (5.33)] 

MOImax  =2 Ap+Ot 

In  the  monophonic  case,  the  peak  amplitude  of  the  baseband  signal  (L  +  R)'  is  2 Ap.  Hence,  the 
peak  amplitudes  in  the  two  cases  differ  only  by  a,  the  pilot  amplitude.  To  account  for  this,  the 
peak  sound  amplitude  in  the  stereo  case  is  reduced  to  90%  of  its  full  value.  This  amounts  to  a 
reduction  in  the  signal  power  by  a  ratio  of  (0.9)*-  =  0.81,  or  1  dB.  Thus,  the  effective  SNR  is 
reduced  by  1  dB  because  of  the  inclusion  of  the  pilot. 
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Figure  5.1 8 

(a)  FM  stereo 
transmitter,  (b) 
Spectrum  of  a 
baseband  stereo 
signal,  (c)  FM 
stereo  receiver. 


(a) 


Figure  5.19 

FM  and  PM 
signals  in  the 
time  and 
frequency 
domains. 


(c) 


5.8  MATLAB  EXERCISES 

In  this  section,  we  use  MATLAB  to  build  an  FM  modulation  and  demodulation  example.  The 
MATLAB  program  is  given  by  ExampleFM.m.  Once  again  use  apply  the  same  message 
signal  m2(t).  The  FM  coefficient  is  kf  =  80  and  the  PM  coefficient  is  k„  =  n.  The  carrier 
frequency  remains  300  Hz.  The  resulting  FM  and  PM  signals  in  the  time  domain  are  shown  in 

lg..  .  19.  The  corresponding  frequency  responses  are  also  shown  in  Fig  5  19  The  frequency 

domain  responses  clearly  show  the  much  higher  bandwidths  of  the  FM*and  PM  signals  when 
compared  with  amplitude  modulations. 


%  (ExampleFM.m) 


%  This  program  uses  triangl 
%  and  demodulation 


m  to  illustrate  frequency  modulation 
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Figure  5.19 

FM  and  PM 
signals  in  the 
time  and 
frequency 
domains. 
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t S=1 . e-4 ; 


t=-0.04:tS:0.04; 

Ta=0 . 01 ; 

m_sig=triangl ( (t+0 . 01) /Ta) -triangl ( (t-0 . 01) /Ta) ; 

Lf  f  t  =  length  (t)  ;  Lf  f t=2/sceil  (log2  (Lfft)  )  ; 

M_fre=f  ftshift  (f  ft  (m_sig/  Lf  ft) )  ; 
f reqm= ( -Lf ft/2 :Lf ft/2-1) / (Lf ft*ts)  ; 

B_m= 100;  %Bandwidth  of  the  signal  is  B_m  Hz. 

%  Design  a  simple  lowpass  filter  with  bandwidth  B__m  Hz. 
h=f irl (80, [B_m*ts] ) ; 

% 

kf =160*pi ; 

m_intg=kf *ts*cumsum(m_sig) ; 
s_fm=cos (2*pi*300*t+m_intg) ; 
s_pm=cos (2*pi*300*t+pi*m_sig) ; 

Lf f t=length  (t)  ;  Lf f t=2~ceil  (log2  (Lfft)  +1); 

S_fm=f ftshift (f f t (s_fm, Lf ft) ) ; 

S_j)m=f  ftshift  (f  ft  (s_pm,  Lf  f  t )  )  ; 
f reqs= (-Lf ft/2 :Lf ft/2-1) / (Lfft*ts) ; 

s_fmdem=dif f ( [s_fm(l)  s_fm] )/ts/kf; 
s_fmrec=s_fmdem. * (s_fmdem>0) ; 
s_dec=filter (h, l,s_fmrec) ; 


%  Demodulation 

%  Using  an  ideal  LPF  with  bandwidth  200  Hz 


Trangel=  [-0.04  0.04  -1.2  1.2]; 


figure (1) 

subplot (211) ;ml=plot (t ,m_sig) ; 

axis (Trangel) ;  set (ml, ' Linewidth' ,2) ; 

xlabel ( ' {\it  t}  (sec)');  ylabel('{\it  m}({\it  t})'); 

title ( 'Message  signal'); 

subplot (212) ;m2=plot (t.sdec) ; 

set (m2, 'Linewidth' ,2) ; 

xlabel ('{\it  t}  (sec)');  ylabel('{\it  m}_d({\it  t})') 
title ('demodulated  FM  signal'); 


figure (2) 


subplot (211) ;tdl=plot (t, s_fm) ; 
axis (Trangel) ;  set (tdl, ' Linewidth' , 2) 
xlabel ('{\it  t}  (sec)');  ylabel('{\it 
title ('FM  signal'); 


s}_{\rm  FM} ( (\it 


subplot (212) ;td2=plot(t,s _pm) ; 
axis (Trangel) ;  set (td2 ,' Linewidth' , 2) 
xlabel ('{\it  t}  (sec)');  ylabel('{\it 


s}_{\rm  PM} ( {\it 


t } )  ' ) 


t})  ') 
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title ('PM  signal' ) ; 
figure (3) 

subplot (211) ; fpl=plot (t , s_fmdem) / 
set (f pi, ' Linewidth' , 2) ; 

xlabel('{\it  t}  (sec)');  ylabel('{\it  d  s}_{\rm  FM}({\it  t})/dt') 

title ( ' FM  derivative'); 

subplot (212) ; fp2=plot (t , s_fmrec) ; 

set (fp2, 'Linewidth' ,  2)  ; 

xlabel ( ' { \it  t}  (sec) ' ) ; 

title (' rectified  FM  derivative'); 

Frange=  [-600  600  0  300]; 
figure (4) 

subplot (211 ) ; fdl=plot (f reqs , abs (S_fm) ) ; 
axis (Frange) ;  set (fdl, 'Linewidth' ,2)  ; 

xlabel  ( '  { \it  f}  (Hz)');  ylabel('{\it  S}_{\rm  FM}({\it  f})') 
title ('FM  amplitude  spectrum'); 
subplot (212) ; f d2=plot (f reqs, abs (S_pm) ) ; 
axis (Frange) ;  set (fd2 , ' Linewidth' , 2 ) ; 

xlabel ( ' { \it  f}  (Hz)');  ylabel('{\it  S}_{\rm  PM}({\it  f})') 
title ('PM  amplitude  spectrum'); 


figure  5.20 

Signals  at  the 
demodulator: 

(°)  after 
differentiator; 

W  rectifier. 


To  obtain  the  demodulation  results  (Fig.  5.20),  a  differentiator  is  first  applied  to  change  the 
frequency-modulated  signal  into  an  amplitude-  and  frequency-modulated  signal  (Fig.  5.20). 
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Figure  5.21 

FM  modulation 
and  1 

demodulation: 

(a)  original  0  5 

message; 

(b)  recovered  g 

signal.  's  0 
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Demodulated  FM  signal 


PROBLEM 


Figure  P.5.1-1 


Upon  applying  the  rectifier  for  envelope  detection,  we  see  that  the  message  signal  follows 
closely  to  the  envelope  variation  of  the  rectifier  output. 

Finally,  the  rectifier  output  signal  is  passed  through  a  low-pass  filter  with  bandwidth 
100  Hz.  We  used  the  finite  impulse  response  low-pass  filter  of  order  80  this  time  because  of 
the  tighter  filter  constraint  in  this  example.  The  FM  detector  output  is  then  compared  with  the 
original  message  signal  in  Fig.  5.21. 

The  FM  demodulation  results  clearly  show  some  noticeable  distortions.  First,  the  higher 
order  low-pass  filter  has  a  much  longer  response  time  and  delay.  Second,  the  distortion  dur¬ 
ing  the  negative  halt  ot  the  message  is  more  severe  because  the  rectifier  generates  very  few 
cycles  of  the  halt-sinusoid.  This  happens  because  when  the  message  signal  is  negative,  the 
instantaneous  frequency  of  the  FM  signal  is  low.  Because  we  used  a  carrier  f  requency  of  only 
300  Hz,  the  effect  ot  low  instantaneous  frequency  is  much  more  pronounced.  If  a  practical 
carrier  frequency  of  100  MHz  were  applied,  this  kind  of  distortion  would  be  completely 
negligible. 
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PROBLEMS 

5.1-1  Sketch  <pm(t)  and  <pm(t)  for  the  modulating  signal  m(t)  shown  in  Fig.  P5.1-1,  given  cac  = 
108,  kf  =  105,andLp  =  25. 


5.1-2  A  baseband  signal  m(t)  is  the  periodic  sawtooth  signal  shown  in  Fig.  P5. 1  -2. 

(a)  Sketch  (pm  (/)  and  (pPM  (/)  for  this  signal  m(t)  if  coc  =  2;r  x  1 06,  kf  =  20007z\  and  kp  =  n/2. 

(b)  Show  that  this  PM  signal  is  equivalent  to  another  PM  signal  modulated  by  a  periodic  rect¬ 
angular  message  signal.  Explain  why  it  is  necessary  to  use  kp  <  tt  in  this  case.  [Note  that 
the  PM  signal  has  a  constant  frequency  but  has  phase  discontinuities  corresponding  to  the 
discontinuities  of 


5.1-3  A  periodic  message  signal  m(t)  as  shown  in  Figure  P5. 1-3  is  transmitted  as  an  angle-modulated 
signal. 

The  modulation  system  has  coc  =  27T  x  1 0^  rad/s.  Let  the  signal  bandwidth  of  m(t)  be  approximated 
by  its  own  fifth-harmonic  frequency. 

(a)  To  generate  an  FM  signal  with  kf  =  20 jr,  sketch  the  frequency-modulated  signal  sFM(r)  in 
the  time  domain. 
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(b)  If  a  PM  signal  is  generated  for  m(t)  with  kp  =  n/2,  sketch  the  phase-modulated  signal  .?pmW 
in  the  time  domain. 

5.1- 4  Over  an  interval  |/|  <  1,  an  angle-modulated  signal  is  given  by 

^>EM(/)  =  10  cos  13,0007/7 
It  is  known  that  the  carrier  frequency  o)c  =  10,0007r. 

(a)  Assuming  a  PM  signal  with  kp  =  1000,  determine  m(t)  over  the  interval  |f  |  <  1. 

(b)  Assuming  an  FM  signal  with  kf  =  1000,  determine  m(t)  over  the  interval  |f|  <  1. 

5.1- 5  A  periodic  message  signal  m(t)  as  shown  in  Figure  P5.1-5  is  to  be  transmitted  by  using  angle 

modulation.  Its  bandwidth  is  approximated  by  200  Hz.  The  modulation  system  has  coc  =  4/r  x 
10**  rad/s. 


(a)  If  an  FM  signal  with  kf  =  500tt  is  generated,  sketch  in  the  FM  signal  in  the  time  domain. 

(b)  It  a  PM  signal  with  kp  =  0.257T  is  to  be  generated,  sketch  the  PM  signal  waveform  in  the 
time  domain. 

5.2- 1  For  the  modulated  signals  in  Prob.  5.1-3, 

(a)  Determine  the  approximate  bandwidth  of  the  FM  signal. 

(b)  Determine  the  approximate  bandwidth  of  the  PM  signal. 

5.2- 2  Repeat  Prob.  5.2-1  for  the  modulated  signals  in  Prob.  5.1-5.  Assume  the  bandwidth  of  m(t)  to  be 

its  fifth-harmonic  frequency. 

5.2- 3  For  a  message  signal 


m(/)  =  2  cos  1000/  +  9  cos  2000tt/ 

(a)  Write  expressions  (do  not  sketch)  for  VpM(,)  and  Vfm(,)  when  A  =  10.  cuc  =  106,  */  ' 
lOOOTr  and  =  1 .  For  determining  (/),  use  the  indefinite  integral  of  m(/);  that  is.  take 
the  value  of  the  integral  at  t  =  -oo  to  be  0. 
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(b)  Estimate  the  bandwidths  of  ^  (/)  and  (pPM  (r). 

5.2- 4  An  angle-modulated  signal  with  carrier  frequency  a*  =  2jt  x  106  is  described  by  the  equation 

^em  (0  =  10  cos  ( (oct  +  0. 1  sin  2(XX)7r/) 

(a)  Find  the  power  of  the  modulated  signal. 

(b)  Find  the  frequency  deviation  A/. 

(c)  Find  the  phase  deviation  A<p. 

(d)  Estimate  the  bandwidth  of  (pEM  (t). 

5.2- 5  Repeat  Prob.  5.2-4  if 

<Pem  (0  =  5  cos  +  20  sin  1 000;r/  H-  10  sin  2000tt/) 

5.2- 6  Estimate  the  bandwidth  for  (pPM(t)  and  (p^it )  in  Prob.  5.1-1.  Assume  the  bandwidth  of  m(t)  in 

Fig.  P5.1-1  to  be  the  third-harmonic  frequency  of  m(t). 

5.2- 7  Given  m(t)  =  sin  20007T/,  kf  =  200, 000;r,  and  kp  =  10. 

(a)  Estimate  the  bandwidths  of  (p m  (t)  and  <pPM  (/). 

(b)  Repeat  part  (a)  if  the  message  signal  amplitude  is  doubled. 

(c)  Repeat  part  (a)  if  the  message  signal  frequency  is  doubled. 

(d)  Comment  on  the  sensitivity  of  FM  and  PM  bandwidths  to  the  spectrum  of  m(t). 

5.2- 8  Given  m(t)  —  £-r/|0°,  fc  =  104  Hz,  kf  =  600077%  and  kp  =  80007T. 

(a)  Find  A/,  the  frequency  deviation  for  FM  and  PM. 

(b)  Estimate  the  bandwidths  of  the  FM  and  PM  waves. 

Hint :  Find  M  if)  and  find  its  3  dB  bandwidth. 

5.2- 9  (a)  Applying  Parseval’s  theorem,  show  that 

oo 

E  = 1 

n=— oo 


(b)  Prove  that 


Hint:  Show  first  that  f*n  e^Psin  =  2  /qT  cos  (^sin  x  -  nx)dx. 

5.3- 1  Design  (the  block  diagram  of)  an  Armstrong  indirect  FM  modulator  to  generate  an  FM  carrier 

with  a  carrier  frequency  of  96  MHz  and  A/  =  20  kHz.  A  narrowband  FM  generator  with  fc  =  200 
kHz  and  adjustable  A/  in  the  range  of  9  to  1 0  Hz  is  available.  The  stockroom  also  has  an  oscillator 
with  adjustable  frequency  in  the  range  of  9  to  10  MHz.  There  are  bandpass  filters  with  any  center 
frequency,  and  only  frequency  doublers  are  available. 

5.3- 2  (a)  Design  (the  block  diagram  of)  an  Armstrong  indirect  FM  modulator  to  generate  an  FM  carrier 

with  a  carrier  frequency  of  98. 1  MHz  and  A/  =  75  kHz.  A  narrowband  FM  generator  is  available 
at  a  carrier  frequency  of  1 00  kHz  and  a  frequency  deviation  A/  =  1 0  Hz.  The  stockroom  also 
has  an  oscillator  with  an  adjustable  frequency  in  the  range  of  10  to  11  MHz.  There  are  also  plenty 
of  frequency  doublers,  triplers,  and  quintuplers. 
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Figure  P.5.4-1 


Figure  P.5.4-2 


(b)  Determine  the  tunable  range  of  the  carrier  frequency  in  the  design  of  part  (a). 

5.3- 3  Design  an  Armstrong  indirect  FM  modulator  in  block  diagram  to  generate  an  FM  signal  with 

carrier  96.3  MHz  and  A/  =  20.48  kHz.  A  narrowband  FM  generator  with/*-  =  150  kHz  and 
A/  =  10  Hz  is  available.  Only  a  limited  number  of  frequency  doublers  are  available  as  frequency 
multipliers.  In  addition,  an  oscillator  with  adjustable  frequency  from  1 3  to  1 4  MHz  is  also  available 
for  mixing,  along  with  bandpass  filters  of  any  specification. 

5.4-  1  (a)  Show  that  when  m(t)  has  no  jump  discontinuities,  an  FM  demodulator  fol  lowed  by  an  integrator 

(Fig.  P5.4-la)  forms  a  PM  demodulator.  Explain  why  it  is  necessary  for  the  FM  demodulator  to 
remove  any  dc  offset  before  the  integrator. 

(b)  Show  that  a  PM  demodulator  followed  by  a  differentiator  (Fig.  P5.4- lb)  serves  as  an  FM 
demodulator  even  if  m(t)  has  jump  discontinuities  or  the  PM  demodulator  output  has  dc  offset. 


(a)  PM  demodulator 


(b)  FM  demodulator 


5.4-2  A  periodic  square  wave  m(t)  (Fig.  P5.4-2a)  frequency-modulates  a  carrier  of  frequency  fc  = 
10  kHz  with  A f  =  I  kHz.  The  carrier  amplitude  is  A.  The  resulting  FM  signal  is  demodulated, 
as  shown  in  Fig.  P5.4-2b  by  the  method  discussed  in  Sec.  5.4  (Fig.  5.12).  Sketch  the  waveforms 
at  points  b.  c,  d,  and  e. 


m(t) 

« - 7 

1 

~1 

(a) 


«<o 

FM 

modulator 

d 

Envelope 

Dc 

dt 

© 

detector 

© 

blocking 

© 

(b) 


Demodulator 
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5.4- 3  Let  s(t)  be  an  angle-modulated  signal  that  a  receiver  obtains, 

s(t)  =  2  cos  [  107;n  +  2  sin  (20007T/  +  0.3jt)  -  3^  cos  ( 1 00/)]. 

(a)  Find  the  bandwidth  of  this  FM  signal. 

(b)  If  .y(f)  is  sent  to  an  (ideal)  envelope  detector,  find  the  detector  output  signal. 

(c)  If  .y(r)  is  first  differentiated  before  the  envelope  detector,  find  the  detector  output  signal. 

(d)  Explain  which  detector  output  can  be  processed  to  yield  the  message  signal  m(t)  and  find  the 
message  signal  m(t)  if  kj  =  2007T. 

5.4- 4  Use  small-error  analysis  of  PLL  to  show  that  a  first-order  loop  [//(s)  =  1]  cannot  track  an 

incoming  signal  whose  instantaneous  frequency  varies  linearly  with  time  [0;(t)  =  kt2].  This 
signal  can  be  tracked  within  a  constant  phase  if  H(s)  =  (s  +  a)/s.  It  can  be  tracked  with  zero 
phase  error  if  H(s)  =  (s2  4-  as  +  b)/s2. 

5.5- 1  In  an  extreme  case  of  preemphasis  filter  design,  we  can  choose 

Hpif)  =j2nf 

This  is  simply  a  differentiator  prior  to  frequency  modulation. 

Find  the  corresponding  deemphasis  filter  and  show  that  the  overall  modulation  and  demodulation 
system  using  this  particular  pair  of  preemphasis  and  deemphasis  filters  is  effectively  a  PM  system. 

5.6-  1  A  transmitter  transmits  an  AM  signal  with  a  carrier  frequency  of  1 530  kHz.  When  an  inexpensive 

radio  receiver  (which  has  a  poor  selectivity  in  its  RF-stage  bandpass  filter)  is  tuned  to  1530  kHz, 
the  signal  is  heard  loud  and  clear.  This  same  signal  is  also  heard  (not  as  strongly)  at  another  dial 
setting.  State,  with  reasons,  at  what  frequency  you  will  hear  this  station.  The  IF  frequency  is 
455  kHz. 

5.6- 2  Consider  a  superheterodyne  FM  receiver  designed  to  receive  the  frequency  band  of  88  to  108  MHz 

with  IF  frequency  10.7  MHz.  What  is  the  range  of  frequencies  generated  by  the  local  oscillator 
for  this  receiver?  Analyze  and  explain  whether  it  is  possible  for  an  FM  receiver  to  receive  both  a 
desired  FM  station  and  an  image  FM  station. 

5.6- 3  In  shortwave  AM  radio,  the  IF  is  also  455  kHz.  A  receiver  is  designed  to  receive  shortwave 

broadcasting  of  31  meter  band  between  9.4  and  9.9  MHz. 

(a)  Determine  the  frequency  range  of  the  local  oscillator  for  this  receiver. 

(b)  Analyze  and  explain  whether  it  is  possible  for  this  receiver  to  receive  both  a  desired  AM 
station  and  an  image  station  within  the  same  31 -meter  band. 
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Figure  6.1 

Sampled  signal 
and  its  Fourier 
spectra. 


As  briefly  discussed  in  Chapter  1,  analog  signals  can  be  digitized  through  sampling  and 
quantization.  This  analog-to-digital  (A/D)  conversion  sets  the  foundation  of  modem 
digital  communication  systems.  In  the  A/D  converter,  the  sampling  rate  must  be  large 
enough  to  permit  the  analog  signal  to  be  reconstructed  from  the  samples  with  sufficient  accu¬ 
racy.  The  sampling  theorem,  which  is  the  basis  for  determining  the  proper  (lossless)  sampling 
rate  tor  a  given  signal,  has  played  a  huge  role  in  signal  processing,  communication  theory,  and 
A/D  circuit  design. 


6.1  SAMPLING  THEOREM 


We  first  show  that  a  signal  g(i)  whose  spectrum  is  band-limited  to  5  Hz,  that  is. 


G(/)  =  0  for  |/|  >  B 


can  be  reconstructed  exactly  (without  any  error)  from  its  discrete  time  samples  taken  uniformly 
at  a  rate  of  5  samples  per  second.  The  condition  is  that  R  >  26.  In  other  words,  the  minimum 
sampling  frequency  for  perfect  signal  recovery  is  R  =  25  H? 


The  spectrum  G(/)  in  Fig.  6. .  b  is  shown  as  real,  for  convenience.  Our  arguments  are  valid  for  complex  Gif)- 
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Figure  6.1 

Sampled  signal 
and  its  Fourier 
spectra. 
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sampled  signal  g(t)  and  the  original  analog  signal  g(t)  is 


g(t )  =  g(t)STs(t)  =  J2z(nTsW  -  nT *)  (6.!) 

n 


Because  the  impulse  train  Srs(t)  is  a  periodic  signal  of  period  Ts ,  it  can  be  expressed  as 
an  exponential  Fourier  series,  already  found  in  Example  3.13  as 

1  00  2tt 

&t,«)  =  —  J2  e’nWs'  “>s=y  =  2 jtfs  (6.2) 

&  n —  —  ** 


Therefore, 


g(t)  =  g(t)h,U) 

oo 

=  y  E 

i  g 

*  n=-oo 


(6.3) 


To  find  G(f),  the  Fourier  transform  of  g(t),  we  take  the  Fourier  transform  of  the  summation 
in  Eq.  (6.3).  Based  on  the  frequency-shifting  property,  the  transform  of  the  nth  term  is  shifted 
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by  nfs.  Therefore, 


1 

G(f)  =  —  G(/  —  nfs) 

*  S  . 


(6.4) 


This  means  that  the  spectrum  G(f)  consists  of  G(/),  scaled  by  a  constant  \/TSy  repeating 
periodically  with  period  /*  =  1  //,  Hz,  as  shown  in  Fig.  6.1e. 

After  uniform  sampling  that  generates  a  set  of  signal  samples  {g  (/:/,)},  the  vital  question 
becomes:  Can  g(t )  be  reconstructed  from  g(t)  without  any  loss  or  distortion?  If  we  are  to 
reconstruct  g(t)  from  g(t ),  equivalently  in  the  frequency  domain  we  should  be  able  to  recover 
G(f )  from  G(/).  Graphically  from  Fig.  6. 1 ,  perfect  recovery  is  possible  if  there  is  no  overlap 
among  the  replicas  in  G(/).  Figure  6.1e  clearly  shows  that  this  requires 


(6.5) 


fs>  2B 


Also,  the  sampling  interval  Ts  =  1  //*.  Therefore, 


Thus,  asjong  as  the  sampling  frequency  /,  is  greater  than  twice  the  signal  bandwidth  B  (in 
hertz),  G(/)  will  consist  of  nonoverlapping  repetitions  of  G(/).  When  this  is  true,  Fig.  6.1e 
shows  that  g(t)  can  be  recovered  from  its  samples  g(t)  by  passing  the  sampled  signal  g(t) 
through  an  ideal  low-pass  filter  of  bandwidth  B  Hz.  The  minimum  sampling  rate  /,  =  2# 
required  to  recover  g(t)  from  its  samples  g(t)  is  called  the  Nyquist  rate  for  g(t ),  and  the 
corresponding  sampling  interval  Ts  =  1/2 B  is  called  the  Nyquist  interval  for  the  low-pass 
signal  g(t).* 

We  need  to  stress  one  important  point  regarding  the  possibility  of /,  =  2 B  and  a  particular 
class  ot  low-pass  signals.  For  a  general  signal  spectrum,  we  have  proved  that  the  sampling 
rat e./i  >  2 B.  However,  it  the  spectrum  G(f)  has  no  impulse  (or  its  derivatives)  at  the  highest 
tiequency  By  then  the  overlap  is  still  zero  as  long  as  the  sampling  rate  is  greater  than  or  equal 
to  the  Nyquist  rate,  that  is. 


fs>2B 


If,  on  the  other  hand,  G(f)  contains  an  impulse  at  the  highest  frequency  then  the  equality 
must  be  removed  or  else  overlap  will  occur.  In  such  case,  the  sampling  rate/*  must  be  greater 
than  2 B  Hz.  A  well-known  example  is  a  sinusoid  g(t)  =  sin  2nB(t  -  to).  This  signal  is  band- 
limited  to  B  Hz,  but  all  its  samples  are  zero  when  uniformly  taken  at  a  rate/,  =  2 B  (starting 
at  t  =  /o),  and  g(t)  cannot  be  recovered  from  its  Nyquist  samples.  Thus,  for  sinusoids,  the 
condition  of/,  >  2 B  must  be  satisfied. 


6.1.1  Signal  Reconstruction  from  Uniform  Samples 

I  he  process  ot  reconstructing  a  continuous  time  signal  g(t)  from  its  samples  is  also  known  as 
interpolation.  In  Fig.  6. 1 ,  we  used  a  constructive  proof  to  show  that  a  signal  g(t)  band-limited 
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to  B  Hz  can  be  reconstructed  (interpolated)  exactly  from  its  samples.  This  means  not  only 
that  uniform  sampling  at  above  the  Nyquist  rate  preserves  all  the  signal  information,  but  also 
that  simply  passing  the  sampled  signal  through  an  ideal  low-pass  filter  of  bandwidth  B  Hz 
will  reconstruct  the  original  message.  As  seen  from  Eq.  (6.3),  the  sampled  signal  contains  a 
component  (l/Ts)g(t),  and  to  recover g(t)  [or  G(/)],  the  sampled  signal 

*(')  =  £*(«  Ts)S(t-nTs) 

must  be  sent  through  an  ideal  low-pass  filter  of  bandwidth  B  Hz  and  gain  Ts.  Such  an  ideal 
filter  response  has  the  transfer  function 


"</)  =  r-n(4 ^)  =  7'-n(4) 


(6.7) 


Ideal  Reconstruction 

To  recover  the  analog  signal  from  its  uniform  samples,  the  ideal  interpolation  filter  transfer 
function  found  in  Eq.  (6.7)  is  shown  in  Fig.  6.2a.  The  impulse  response  of  this  filter,  the  inverse 
Fourier  transform  of  //(/),  is 

h{t)  —  2 BTS  sine  (2 JtBt)  (6.8) 

Assuming  the  use  of  Nyquist  sampling  rate,  that  is,  2 BTS  =  1.  then 

h(t)  =  sine  {In Bt)  (6.9) 

This  h{t)  is  shown  in  Fig.  6.2b.  Observe  the  very  interesting  fact  that  h(t)  =  0  at  all  Nyquist 
sampling  instants  (/  =  ±n/2B)  except  t  =  0.  When  the  sampled  signal  g(/)  is  applied  at 
the  input  of  this  filter,  the  output  is  g{t).  Each  sample  in  g(t),  being  an  impulse,  generates  a 
sine  pulse  of  height  equal  to  the  strength  of  the  sample,  as  shown  in  Fig.  6.2c.  The  process  is 
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identical  to  that  shown  in  Fig.  6.6.  except  that  h(t)  is  a  sine  pulse  instead  of  a  rectangular  pulse. 
Addition  of  the  sine  pulses  generated  by  all  the  samples  results  in  g(t).  The  Ath  sample  of  the 
input  g(t)  is  the  impulse  g(kTs)S(t-  k Ts);  the  filter  output  of  this  impulse  is  g(kTs)h(t  —  kTs). 
Hence,  the  filter  output  to  g(t),  which  is  g(t),  can  now  be  expressed  as  a  sum, 

g(0  =  ^2g(kTs)h(t  -  kTs ) 

=  J^g(kTs)  sine  [2nB(t  -  kTs)]  (6.10a) 

k 

=  Y^g(kTs)  sine  (2^ Bl  -  kn)  (6.10b) 

k 

Equation  (6. 10)  is  the  interpolation  formula,  which  yields  values  of  g(t)  between  samples  as 
a  weighted  sum  of  all  the  sample  values. 


Figure  6.4 

Practical 

reconstruction 

(interpolation) 

pulse. 


Example  6. 1  Find  a  signal  g(t)  that  is  band-limited  to  B  Hz  and  whose  samples  are 

g(0)  =  1  and  *(±7*,)  =  g(±2Ts)  =  g(±37s)  =  •  •  •  =  0 

where  the  sampling  interval  Ts  is  the  Nyquist  interval  for  g(r),  that  is,  Ts  =  1  /2 B. 

We  use  the  interpolation  formula  (6.10b)  to  construct  g(t)  from  its  samples.  Since  all 
but  one  ot  the  Nyquist  samples  are  zero,  only  one  term  (corresponding  to  it  =0)  in  the 
summation  on  the  right-hand  side  of  Eq.  (6. 10b)  survives.  Thus, 


g(t)  =  sinc(27rBr) 


(6.11) 


This  signal  is  shown  in  Fig.  6.3.  Observe  that  this  is  the  only  signal  that  has  a  bandwidth 

B  Hz  and  sample  values  g(0)  =  I  and  g(nTs)  =  0  („  jk  0).  No  other  signal  satisfies  these 
conditions. 


Figure  6.3 

Signal  recon¬ 
structed  from  the 
Nyquist  samples 
in  Example  6.1 . 


Practical  Signal  Reconstruction  (Interpolation) 

3  5 1“ “ "«l  end unrealizable. Tl* - 

iS  retold  1,  °7u  km*  “Ure  °<  «  reconstruction  pulse  used  * 

reconstruction  of  Eq.  (6. 1 0).  For  practical  application  of  signal  reconstruction  (e.g.  » 
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Figure  6.4 

Practical 

reconstruction 

(interpolation) 

pulse. 


CD  player),  we  need  to  implement  realizable  signal  reconstruction  systems  from  the  uniform 
signal  samples. 

For  practical  implementation,  this  reconstruction  pulse  p(t)  must  be  easy  to  generate.  For 
example,  we  may  apply  the  reconstruction  pulse  p(t)  as  shown  in  Fig.  6.4.  However,  we  must 
first  use  the  nonideal  interpolation  pulse  p(t)  to  analyze  the  accuracy  of  the  reconstructed 
signal.  Let  us  denote  the  new  signal  from  reconstruction  as 


git)  -  ^2 g(nTs)p(t  -  nTs ) 


(6.12) 


n 


To  determine  its  relation  to  the  original  analog  signal  g(/),  we  can  see  from  the  properties  of 
convolution  and  Eq.(6.1)  that 


git)  =  J2s(nTs)p(t  -  nTs )  =  Pit)  *  J2  8(nT,Wt  -  nTs) 


n 


n 


(6.13a) 


=  pit)*  git) 


In  the  frequency  domain,  the  relationship  between  the  reconstruction  and  the  original  analog 
signal  can  rely  on  Eq.  (6.4) 


(6.13b) 


This  means  that  the  reconstructed  signal  git)  using  pulse  p(t)  consists  of  multiple  replicas  of 
Gif)  shifted  to  the  frequency  center  nf,  and  filtered  by  P(f).  To  fully  recover  g(t),  further 
filtering  of  g(t)  becomes  necessary.  Such  filters  are  often  referred  to  as  equalizers. 

Denote  the  equalizer  transfer  function  as  £(/).  Distortionless  reconstruction  requires  that 


Gif )  =  Eif)G(f) 

=  Eif)Pif)  ^r£C(/-«/i) 

/  t 

•*  I! 


This  relationship  clearly  illustrates  that  the  equalizer  must  remove  all  the  shifted  replicas 
G(f  —  nfs)  in  the  summation  except  for  the  low-pass  term  with  n  =  0,  that  is, 


Figure  6.4 

Practical 

reconstruction 

(interpolation) 

pulse. 


Eif)Pif)  =  0  l/l  >fs  —  & 


(6.14a) 
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Figure  6.5 

Practical  signal 
reconstruction. 


Figure  6.6 

Simple  interpo¬ 
lation  by  means 
of  simple 
rectangular 
pulses. 


Additionally,  distortionless  reconstruction  requires  that 

E(f)P(f)  =  Ts  |/|  <  B  (6.14b) 

The  equalizer  filter  £(/)  must  be  low-pass  in  nature  to  stop  all  frequency  content  above 
/,•  -  B  Hz.  and  it  should  be  the  inverse  of  />(/)  within  the  signal  bandwidth  of  B  Hz.  Figure  6.5 
demonstrates  the  diagram  of  a  practical  signal  reconstruction  system  utilizing  such  an  equalizer. 

Let  us  now  consider  a  very  simple  interpolating  pulse  generator  that  generates  short 
(zero-order  hold)  pulses.  As  shown  in  Fig.  6.6, 


This  is  a  gate  pulse  of  unit  height  with  pulse  duration  Tp.  The  reconstruction  will  first  generate 

8(0  =  £  g(nTs)  n  ( — n7v  ~°  5M 
n  \  Tp  ) 

The  transfer  funchon  of  Biter  />(/)  is  ,he  Fourier  transform  „f  n  (,/r„)  shifted  by  0.57,: 

P(f)  =  Tp  sine  (irfTp)  e'Wp  (6.15) 

As  a  result,  the  equalizer  frequency  response  should  satisfy 


T,/P(f)  |/|  <  B 

Flexible  A<  |/|  <  (\/Ts-B) 

0  I/I  >  (1/Tj  -  fi) 


f(/)  = 
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It  is  important  tor  us  to  ascertain  that  the  equalizer  passband  response  is  realizable.  First 
of  all,  we  can  add  another  time  delay  to  the  reconstruction  such  that 

£(/)  =  Ts  •  —  |/|  <  B  (6.1 6) 

sm  ( nJTp ) 

For  the  passband  gain  of  £(/)  to  be  well  defined,  it  is  imperative  for  us  to  choose  a  short 
pulse  width  Tp  such  that 


sin  (j xfTp) 


I/I  5  B 


This  means  that  the  equalizer  £(/)  does  not  need  to  achieve  infinite  gain.  Otherwise  the 
equalizer  would  become  unrealizable.  Equivalently,  this  requires  that 


TP  <  1  /B 

Hence,  as  long  as  the  rectangular  reconstruction  pulse  width  is  shorter  than  1  /B,  it  may  be 
possible  to  design  an  analog  equalizer  filter  to  recover  the  original  analog  signal  g(t)  from 
the  nonideal  reconstruction  pulse  train.  Of  course,  this  is  a  requirement  for  a  rectangular 
reconstruction  pulse  generator.  In  practice,  Tp  can  be  chosen  very  small,  to  yield  the  following 
equalizer  passband  response: 


£(/)  =  Ts  ■ 


nf 

sin  (7 tJTp) 


\f\<B 


(6.17) 


This  means  that  very  little  distortion  remains  when  very  short  rectangular  pulses  are  used  in 
signal  reconstruction.  Such  cases  make  the  design  of  the  equalizer  either  unnecessary  or  very 
simple.  An  illustrative  example  is  given  as  a  MATLAB  exercise  in  Sec.  6.9. 

We  can  improve  on  the  zero-order-hold  filter  by  using  the  first-order-hold  filter,  which 
results  in  a  linear  interpolation  instead  of  the  staircase  interpolation.  The  linear  interpolator, 
whose  impulse  response  is  a  triangle  pulse  A(r/2TV),  results  in  an  interpolation  in  which 
successive  sample  tops  are  connected  by  straight-line  segments  (Prob.  6.1-7). 

6.1.2  Practical  Issues  in  Signal  Sampling 
and  Reconstruction 


Realizability  of  Reconstruction  Filters  _ 

If  a  signal  is  sampled  at  the  Nyquist  rate/  =  2 B  Hz.  the  spectrum  G(/)  consists  of  repetitions 
of  G(  f)  without  any  gap  between  successive  cycles,  as  shown  in  Fig.  6.7a.  To  recover  g(t) 
from  g(t ),  we  need  to  pass  the  sampled  signal  g(t)  through  an  ideal  low-pass  filter  (dotted 
area  in  Fig.  6.7a).  As  seen  in  Sec.  3.5,  such  a  filter  is  unrealizable  in  practice;  it  can  be  closely 
approximated  only  with  infinite  time  delay  in  the  response.  This  means  that  we  can  recover 
the  signal  g(t)  from  its  samples  with  infinite  time  delay. 

A  practical  solution  to  this  problem  is_to  sample  the  signal  at  a  rate  higher  than  the  Nyquist 
rate  (/  >  2 B  or  cos  >  4; tB).  This  yields  G(/),  consisting  of  repetitions  of  G(/)  with  a  finite 
band  gap  between  successive  cycles,  as  shown  in  Fig.  6.7b.  We  can  now  recover  G(/)  from 
G(/)  |or  from  G(/)]  by  using  a  low-pass  filter  with  a  gradual  cutoff  characteristic  (dotted  area 
in  Fig.  6.7b).  But  even  in  this  case,  the  filter  gain  is  required  to  be  zero  beyond  the  first  cycle 
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Figure  6.7 

Spectra  of  a 
sampled  signal: 
(a)  at  the  Nyquist 
rate;  (b)  above 
the  Nyquist  rate. 


(b) 


Figure  6.8 

Aliasing  effect. 

(a)  Spectrum  of  a 
practical  signal 

gin 

(b)  Spectrum  of 
sampled  #(/). 

1(c)  Reconstructed 
signal  spectrum, 
(d)  Sampling 
|  scheme  using 

antialiasing  filter, 
(e)  Sampled 
signal  spectrum 
(dotted)  and  the 
reconstructed 
signal  spectrum 
(solid)  when 
antialiasing  filter 
is  used. 


ol  G(  / )  (Fig.  6.7b).  According  to  the  Paley-Wiener  criterion,  it  is  impossible  to  realize  even 
this  filter.  The  only  advantage  in  this  case  is  that  the  required  filter  can  be  better  approximated 
with  a  smaller  time  delay.  This  shows  that  it  is  impossible  in  practice  to  recover  a  band-limited 
signal  g(t)  exactly  from  its  samples,  even  if  the  sampling  rate  is  higher  than  the  Nyquist  rate. 

However,  as  the  sampling  rate  increases,  the  recovered  signal  approaches  the  desired  signal 
more  closely. 


The  Treachery  of  Aliasing 

There  is  another  fundamental  practical  difficulty  in  reconstructing  a  signal  from  its  samples. 
The  sampling  theorem  was  proved  on  the  assumption  that  the  signal  g(t)  is  band-limited. 
Alt  practical  signals  are  time-limited ;  that  is,  they  are  of  finite  duration  or  width.  We  can 
demonstrate  (Prob.  6. 1  -8)  that  a  signal  cannot  be  time-limited  and  band-limited  simultaneously. 
At,  me-hmued  signal  cannot  be  band-limited,  and  vice  versa  (but  a  signal  can  be  simultaneously 
non-time-hmited  and  non-band-limited).  Clearly,  all  practical  signals,  which  are  necessarily 
ime-hmited,  are  non-band-limited,  as  shown  in  Fig.  6.8a;  they  have  infinite  bandwidth,  and 

,r:‘7  \[]  C°ns!sts°f  overlaPPing  cycles  of  C(/)  repeating  every  fs  Hz  (the  sampling 
frequency),  as  dlustrated  m  Fig  6.8b.  Because  of  the  infinite  bandwidth  in  \L  case,  the  spectral 

Z  Z  IP™', dable’|regardless  of  the  samPl>ng  rate.  Sampling  a.  a  higher  rate  reduces  but 
tlh  r  '  i  °T  PP'"g  between  repeating  spectral  cycles.  Because  of  the  overlapping 
theolSI llv  ti°n8er  TPlCte  mformation  about  Gif),  and  it  is  no  longer  possible,  even 

passed  through  an  hTi*  0  fr°m  the  Sampled  s'gnal  SU)-  If  the  sampled  signal  is 

G  (  f)  (Fie  6  8  ■>  '°W'pass  hlter°f  cutoff  frequency  fs/2  Hz,  the  output  is  not  Gif)  but 
Gaif)  (Fig.  6.8c),  which  ,s  a  version  of  G(/)  distorted  as  a  result  of  two  separate  causes: 

1.  The  loss  of  the  tail  of  Gif)  beyond  |/|  >/j/2  Hz. 

2.  The  reappearance  of  this  tail  inverted  or  folded  back  onto  the  spectrum. 

frequency.  The  spectmimn^hT  ^  lr^Uency  -^/2  ~  ,/2T  Hz,  which  is  called  the  folding 

frequency!  For  instance,  a  com^nemOfreq  uencyS(//2) °nI° '*Sel^all"e*<^';^ 

‘ ppear  dS  comPonents  of  frequencies  below  fs/2.  This  tail  inversion. 


6.1  Sampling  Theorem  311 


Figure  6.8 

Aliasing  effect. 

(a)  Spectrum  of  a 
practical  signal 

g(t). 

(b)  Spectrum  of 
sampled  #(/). 

(c)  Reconstructed 
signal  spectrum. 

(d)  Sampling 
scheme  using 
antialiasing  filter. 

(e)  Sampled 
signal  spectrum 
(dotted)  and  the 
reconstructed 
signal  spectrum 
(solid)  when 
antialiasing  filter 
is  used. 
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known  as  spectral  folding  or  aliasing,  is  shown  shaded  in  Fig.  6.8b  and  also  in  Fig.  6.8c.  In 
the  process  of  aliasing,  not  only  are  we  losing  all  the  components  of  frequencies  above  the 
folding  frequency  fs/2  Hz,  but  these  very  components  reappear  (aliased)  as  lower  frequency 
components  in  Fig.  6.8b  or  c.  Such  aliasing  destroys  the  integrity  of  the  frequency  components 
below  the  folding  frequency /v/2,  as  depicted  in  Fig.  6.8c. 

The  problem  of  aliasing  is  analogous  to  that  of  an  army  when  a  certain  platoon  has  secretly 
defected  to  the  enemy  side  but  remains  nominally  loyal  to  their  army.  The  army  is  in  double 
jeopardy.  First,  it  has  lost  the  defecting  platoon  as  an  effective  fighting  force.  In  addition,  during 
actual  fighting,  the  army  will  have  to  contend  with  sabotage  caused  by  the  defectors  and  will 
have  to  use  loyal  platoon  to  neutralize  the  defectors.  Thus,  the  army  has  lost  two  platoons  to 
nonproductive  activity. 


Defectors  Eliminated:  The  Antialiasing  Filter 

It  you  were  the  commander  ot  the  betrayed  army,  the  solution  to  the  problem  would  be  obvious. 
As  soon  as  you  got  wind  of  the  defection,  you  would  incapacitate,  by  whatever  means,  the 
detecting  platoon.  By  taking  this  action  before  the  fighting  begins ,  you  lose  only  one  (the 
detecting)  platoon.  This  is  a  partial  solution  to  the  double  jeopardy  of  betrayal  and  sabotage, 
a  solution  that  partly  rectifies  the  problem  and  cuts  the  losses  in  half. 

We  follow  exactly  the  same  procedure.  The  potential  defectors  are  all  the  frequency  com¬ 
ponents  beyond  the  folding  frequency  fs/2  =  1/2T  Hz.  We  should  eliminate  (suppress)  these 
components  from  g(t)  before  sampling  g(t).  Such  suppression  of  higher  frequencies  can  be 
accomplished  by  an  ideal  low-pass  filter  of  cutoff/, /2  Hz,  as  shown  in  Fig.  6.8d.  This  is  called 
the  antialiasing  filter.  Figure  6.8d  also  shows  that  antialiasing  filtering  is  performed  before 
sampling.  Figure  6.8e  shows  the  sampled  signal  spectrum  and  the  reconstructed  signal  Gaa(f ) 
when  the  antialiasing  scheme  is  used.  An  antialiasing  filter  essentially  band-limits  the  signal 
s  ~  z‘  way*  we  l°se  only  the  components  beyond  the  folding  frequency  fs/ 2  Hz. 
ese  suppressed  components  now  cannot  reappear,  corrupting  the  components  of  frequencies 
c  c  \v  t  e  o  int  frequency.  Clearly,  use  of  an  antialiasing  filter  results  in  the  reconstructed 
S'gnal  specm,m  C.(/)  =  Gif)  for |/|  <1.1 2.  Thus,  although  we  lost  the  spectrum  beyoud 
f-  spec™  tor  all  the  frequencies  below /,/2  remains  intact.  The  effective  aliasing 

tstottion  is  cut  in  hall  owing  to  elimination  of  folding.  We  stress  again  that  the  antialiasing 
operation  must  be  performed  before  the  signal  is  sampled 

and  whho!!,a'r,inr  “'f  ^ '°  noise  Noi»-  totally,  has  a  wideband  spectrum, 

trr  “"I™"8; 'h7"aSi-  phenome"°"  taelf  will  cause  the  noise  components 

noise  spectrum' beyond  frequency 'n  S'8"al  bal,d'  Anlial'asinS  suPPresses  lhe  e",ire 

hlte/which'S118  *1*  l*'"8  il"al  *"“• is  “"liable.  In  practice  we  use  a  steep-cutoff 
hlter.  which  leaves  a  sharply  attenuated  residual  spectrum  beyond  the  folding  frequency/,/! 

F,rePb^shlremesr;Band‘,  i"'UWl  *** '» *PPear  Band-Limited 

means  that  glr)  are  sub-Nyquis,' ampkTof  g(!J  ^ZT  °'  °Ver'apping  cycles  of  C(/) 

Fig.  6.8b  as  the  soectrum  C  ( f  up-  PA  o  x  ^  '  However'  we  may  also  view  the  spectrum  m 

The  spectrum  Ga(f)  is  band-limited  to  ically  every^  Hz  without  overlap. 

fs/  Hz.  Hence,  these  (sub-Nyquist)  samples  of  gif) 

This  is  a  somewhat  simplified  picture.^  Cycl.es'  °nly.the  neighboring  spectral  cycles  overlap, 

of  the  infinite  width  of  all  practical  signal  soectra  Fort  '  <  i  *  r  aP  anc*  interact  with  every  other  cycle  because 
frequencies.  This  results  in  an  imig£S 8,1  Pra‘'ica'  »«**»  al«>  -ustdecay  at  I  higher 
When  such  an  assumption  is  not  justified,  aliasing  computationr^omeMrmwrintXed  immed‘ate  nelghb0rS 
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are  actually  the  Nyquist  samples  for  signal  g„(f).  In  conclusion,  sampling  a  non-band-limited 
signal  g(t)  at  a  rate/,  Hz  makes  the  samples  appear  to  be  the  Nyquist  samples  of  some  signal 
ga(t),  band-limited  tofs/2  Hz.  In  other  words,  sampling  makes  a  non-band-limited  signal 
appear  to  be  a  band-limited  signal  ga(t)  with  bandwidth/v/2  Hz.  A  similar  conclusion  applies 
if  g(r)  is  band-limited  but  sampled  at  a  sub-Nyquist  rate. 

6.1 .3  Maximum  Information  Rate:  Two  Pieces  of 
Information  per  Second  per  Hertz 

A  knowledge  of  the  maximum  rate  at  which  information  can  be  transmitted  over  a  channel  of 
bandwidth  B  Hz  is  of  fundamental  importance  in  digital  communication.  We  now  derive  one 
of  the  basic  relationships  in  communication,  which  states  that  a  maximum  of2B  independent 
pieces  of  information  per  second  can  be  transmitted ,  error  free ,  over  a  noiseless  channel  of 
bandwidth  B  Hz.  The  result  follows  from  the  sampling  theorem. 

First,  the  sampling  theorem  shows  that  a  low-pass  signal  of  bandwidth  B  Hz  can  be  fully 
recovered  from  samples  uniformly  taken  at  the  rate  of  2 B  samples  per  second.  Conversely, 
we  need  to  show  that  any  sequence  of  independent  data  at  the  rate  of  2 B  Hz  can  come  from 
uniform  samples  of  a  low-pass  signal  with  bandwidth  B.  Moreover,  we  can  construct  this 
low-pass  signal  from  the  independent  data  sequence. 

Suppose  a  sequence  of  independent  data  samples  is  denoted  as  {g,7}.  Its  rate  is  IB  samples 
per  second.  Then  there  always  exists  a  (not  necessarily  band-limited)  signal  g(f)  such  that 

gn  =  g(nTs)  Ts  =  2^ 

In  Figure  6.9a  we  illustrate  again  the  effect  of  sampling  the  non-band-limited  signal  g(t)  at 
sampling  rate/,  =  2 B  Hz.  Because  of  aliasing,  the  ideal  sampled  signal 

g(t)  =  ^  g(nTs)S(t  -  nTs ) 

n 

=  y',ga(nTs)8(t  -  nTs) 

n 

where  ga(t)  is  the  aliased  low-pass  signal  whose  samples  ga(nTs )  equal  to  the  samples  of 
g(nTs).  In  other  words,  sub-Nyquist  sampling  of  a  signal  g(f)  generates  samples  that  can 
be  equally  well  obtained  by  Nyquist  sampling  of  a  band-limited  signal  ga(t).  Thus,  through 
Figure  6.9,  we  demonstrate  that  sampling  g(r)  and  ga(t)  at  the  rate  of  2 B  Hz  will  generate  the 
same  independent  information  sequence  {g,,}: 

gn  =  g(nTs)  =  ga(nTs)  Ts  =  —  (6. 1 8) 

Also  from  the  sampling  theorem,  a  low-pass  signal  &,(/)  with  bandwidth  B  can  be  reconstructed 
from  its  uniform  samples  |Eq.  (6.10)] 

ga(t)  =  sinc  ( 2nBt  ~  kn ) 

n 

Assuminu  no  noise,  this  signal  can  be  transmitted  over  a  distortionless  channel  of  bandwidth 
B  Hz,  error  free.  At  the  receiver,  the  data  sequence  {g,,}  can  be  recovered  from  the  Nyquist 
samples  of  the  distortionless  channel  output  ga(t)  as  the  desired  information  data. 
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Figure  6.9 

(a)  Non-band- 
limited  signal 
spectrum  and  its 
sampled 
spectrum  G(f). 

(b)  Equivalent 
low-pass  signal 
spectrum  Ga(f) 
constructed  from 
uniform  samples 
of  git)  at 
sampling 

rate  2 B. 


This  theoretical  rate  of  communication  assumes  a  noise-free  channel.  In  practice,  chan¬ 
nel  noise  is  unavoidable,  and  consequently,  this  rate  will  cause  some  detection  errors.  In 
Chapter  14,  we  shall  present  the  Shannon  capacity  which  determines  the  theoretical  error-free 
communication  rate  in  the  presence  of  noise. 


Figure  6. 1 0 

Illustration  of 

practical 

sampling. 


6. 1 .4  Nonideal  Practical  Sampling  Analysis 

Thus  tar.  we  have  mainly  focused  on  ideal  uniform  sampling  that  can  use  an  ideal  impulse 
sampling  pulse  train  to  precisely  extract  the  signal  value  g(kTs)  at  the  precise  instant  of  t  = 
kTs.  In  practice,  no  physical  device  can  carry  out  such  a  task.  Consequently,  we  need  to 
consider  the  more  practical  implementation  of  sampling.  This  analysis  is  important  to  the 
better  understanding  of  errors  that  typically  occur  during  practical  A/D  conversion  and  their 
effects  on  signal  reconstruction. 

Practical  samplers  take  each  signal  sample  over  a  short  time  interval  T„  around  t  =  kTs. 
n  ot  lor  words,  every  T,  seconds,  the  sampling  device  takes  a  short  snapshot  of  duration  Tp 
rom  the  signal  g(t)  being  sampled.  This  is  just  like  taking  a  sequence  of  still  photographs 
ot  a  sprinter  during  an  100-meter  Olympic  race.  Much  like  a  regular  camera  that  generates  a 
still  picture  by  averaging  the  picture  scene  over  the  window  T„,  the  practical  sampler  would 
generate  a  sample  value  at  t  =  kTs  by  averaging  the  values  of  signal  g(t)  over  the  window  Tp, 


1  rV 2 

gl  (kTs)  =  —  /  g(kTs  +  /)  dt 

1 P J—T  “ 


•P  J—Tp/2 


(6.19a) 


'hiS  aVmg'"8  ^  ‘  device-dependent 

(6.19b) 


1  fTp/2 

gl(kTs)  =  —  q(t)g(kTs  +  t)dt 

‘p  J-Tp/2 
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Figure  6.10 

Illustration  of 

practical 

sampling. 


Thus  we  have  used  the  camera  analogy  to  establish  that  practical  samplers  in  fact  generate 
sampled  signal  of  the  form 


g(t)  =  £gl  (kTs)S(t  -  kTs )  (6.20) 

We  will  now  show  the  relationship  between  the  practically  sampled  signal  g(t)  and  the  original 
low-pass  analog  signal  g(t)  in  the  frequency  domain. 

We  will  use  Fig.  6.10  to  illustrate  the  relationship  between  g(t)  and  g(t)  for  the  special 
case  of  uniform  weighting.  This  means  that 


<7(0  = 


1 

0 


|/|  <  0.57}, 
|7|  >  0.57}, 


As  shown  in  Fig.  6.10.  gi(t)  can  be  equivalently  obtained  by  first  using  “natural  gating”  to 
generate  the  signal  snapshots 


g(t)  =  g(t)  ■  qTs(t) 


(6.21) 
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where 

OO 

^(0=  Y  ^U-nTt) 

tl— — OO 


Figure  6.10b  illustrates  the  snapshot  signal  g(f).  We  can  then  define  an  averaging  filter  with 
impulse  response 


MO  = 


— f-  <  t  <  — 


elsewhere 


or  transfer  function 


Ha(f)  =  sine  (nJTp) 

Sending  the  naturally  gated  snapshot  signal  g(/)  into  the  averaging  filter  generates  the 
output  signal 


£i(0  =  MO  *?(0 

As  illustrated  in  Fig.  6.10c,  the  practical  sampler  generate  a  sampled  signal  g(t)  by  sampling 
the  averaging  filter  output  g\(kTs).  Thus  we  have  used  Fig.  6.10c  to  establish  the  equivalent 
process  ot  taking  snapshots,  averaging,  and  sampling  in  generating  practical  samples  of  g(t). 
Now  we  can  examine  the  frequency  domain  relationships  to  analyze  the  distortion  generated 
by  practical  samplers. 

In  the  following  analysis,  we  will  consider  a  general  weighting  function  q(t)  whose  only 
constraint  is  that 


q(t)  =0,  t  #  (-0.57},,  0.57},) 

To  be^in,  note  that  qTs(t)  is  periodic.  Therefore,  its  Fourier  series  can  be  written  as 


00 

<M')=  Y 

n=~oo 

where 


q(t)e  iw°stdt 


Thus,  the  averaging  filter  output  signal  is 


oo 

=  M0*  Y  QngiOei™’' 

n=— oo 


(6.22) 
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In  the  frequency  domain,  we  have 

oo 

G\{f)  =  H(f)  £  Qn  G(f  —  nfs) 

n=—oo 

00 

=  sine  (nJTp)  £  QnG(f-nfs)  (6.23) 

n=—o o 

Because 

g(t)  =  Y/gl(kTs)8(t  -  lcTs) 

k 

we  can  apply  the  sampling  theorem  to  show  that 

G(f)  =  ^rTGi(f  +  mfs) 

=  f  E — [  <**+f*v’  1 E  a  c</ + -»/,  - »/.) 

5  m  1-  “  n 

=e(^e  (2n  sine  [(7r/  +  (n  +  €)7r/s)rp]^  C(/  +  £/j)  (6.24) 

The  last  equality  came  from  the  change  of  the  summation  index  l  =  m-  n. 

We  can  define  frequency  responses 

Fdf)  =  7T  J2  Q"  sinc  + (n  +  tWsWp] 

Is  n 

This  definition  allows  us  to  conveniently  write 

G(f)  =  £  Fe(f)Gi  (/  +  */,)  (6.25) 


For  the  low-pass  signal  G(f )  with  bandwidth  B  Hz,  applying  an  ideal  low-pass  (interpolation) 
filter  will  generate  a  distorted  signal 


Fo(f)G(f ) 


(6.26a) 


in  which 


Fo(f)  =  sinc  +  nfs)Tp\ 

Is 


(6.26b) 


It  can  be  seen  from  Eqs.  (6.25)  and  (6.26)  that  the  practically  sampled  signal  already  contains 
a  known  distortion  Fo(f). 


318  SAMPLING  AND  ANALOG-TO-DIGITAL  CONVERSION 


Moreover,  the  use  of  a  practical  reconstruction  pulse  p(t)  as  in  Eq.  (6.12)  will  generate 
additional  distortion.  Let  us  reconstruct  g(t)  by  using  the  practical  samples  to  generate 


g(t)  =  J2s\(nTs)p(t  -  nTx) 


n 


Then  from  Eq.  (6.13)  we  obtain  the  relationship  between  the  spectrum  of  the  reconstruction 
and  the  original  message  G(/)  as 


C(/)  =  P(/)  X>n(/)G(/ +  n/?) 


(6.27) 


n 


Since  G(f)  has  bandwidth  B  Hz,  we  will  need  to  design  a  new  equalizer  with  transfer  function 
£(/)  such  that  the  reconstruction  is  distortionless  within  the  bandwidth  B ,  that  is, 


1  \f\<B 

E(f)P(f)Fo(f)  =  Flexible  B  <  |/|  <fs  -  B 

0  \f\>fs~B 


(6.28) 


This  single  equalizer  can  be  designed  to  compensate  for  two  sources  of  distortion:  nonideal 
sampling  effect  in  F0(f)  and  nonideal  reconstruction  effect  in  P(f).  The  equalizer  design  is 
made  practically  possible  because  both  distortions  are  known  in  advance. 


6. 1 .5  Some  Applications  of  the  Sampling  Theorem 

The  sampling  theorem  is  very  important  in  signal  analysis,  processing,  and  transmission 
because  it  allows  us  to  replace  a  continuous  time  signal  by  a  discrete  sequence  of  numbers. 
Processing  a  continuous  time  signal  is  therefore  equivalent  to  processing  a  discrete  sequence  of 
numbers.  This  leads  us  directly  into  the  area  of  digital  filtering.  In  the  field  of  communication, 
the  transmission  of  a  continuous  time  message  reduces  to  the  transmission  of  a  sequence  of 
numbers.  This  opens  doors  to  many  new  techniques  of  communicating  continuous  time  sig¬ 
nals  by  pulse  trams.  The  continuous  time  signal  *(*)  is  samnled.  and  samnle  values  are  used  to 
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Figure  6.1 1 

PuTse-modulated 
signals,  (a)  The 
unmodulated 
signal,  (b)  The 
PAM  signal. 

(c)  The  PWM 
(PDM)  signal. 

(d)  The  PPM 
signal. 


Inn  i  ii  1 1 1 


Pulse  widths  are 
^  the  same  but 
their  locations 
change. 


(d) 


figure  6.12 

Time  division 
multiplexing  of 
two  signals. 


p'9ure  6.13 

PCM  system 
d'agram. 
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Sampler 

Quantizer 

Bit- 

encoder 

- ► 

10  11 

6-2  PULSE  CODE  MODULATION  (PCM) 

PCM  is  the  most  useful  and  widely  used  of  all  the  pulse  modulations  mentioned.  As  shown  in 
Fig.  6.13,  PCM  basically  is  a  tool  for  converting  an  analog  signal  into  a  digital  signal  (A/D 
conversion).  An  analog  signal  is  characterized  by  an  amplitude  that  can  take  on  any  value  over 
a  continuous  range.  This  means  that  it  can  take  on  an  infinite  number  of  values.  On  the  other 
hand,  digital  signal  amplitude  can  take  on  only  a  finite  number  of  values.  An  analog  signal  can 
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Figure  6.14 

Quantization  of 
a  sampled 
analog  signal. 


a* 

•o 

* 

o 


Quantized  samples  of  m(t) 


be  converted  into  a  digital  signal  by  means  of  sampling  and  quantizing,  that  is,  rounding  off  its 
value  to  one  of  the  closest  permissible  numbers  (or  quantized  levels),  as  shown  in  Fig.  6.14. 
The  amplitudes  of  the  analog  signal  m(t)  lie  in  the  range  (—mp<  mp),  which  is  partitioned  intoL 
subintervals,  each  ot  magnitude  Av  =  2 mp/L.  Next,  each  sample  amplitude  is  approximated 
by  the  midpoint  value  of  the  subinterval  in  which  the  sample  falls  (see  Fig.  6. 14  for  L  =  16). 
Each  sample  is  now  approximated  to  one  of  the  L  numbers.  Thus,  the  signal  is  digitized,  with 
quantized  samples  taking  on  any  one  of  the  L  values.  Such  a  signal  is  known  as  an  L-ary 
digital  signal. 


From  practical  viewpoint,  a  binary  digital  signal  (a  signal  that  can  take  on  only  two  values) 
is  very  desirable  because  of  its  simplicity,  economy,  and  ease  of  engineering.  We  can  convert 
an  L-ary  signal  into  a  binary  signal  by  using  pulse  coding.  Such  a  coding  for  the  case  of  L  =  16 
was  shown  in  Fig.  1.5.  This  code,  formed  by  binary  representation  of  the  16  decimal  digits 
from  0  to  1 5,  is  known  as  the  natural  binary  code  (NBC).  Other  possible  ways  of  assigning 
a  binary  code  will  be  discussed  later.  Each  of  the  16  levels  to  be  transmitted  is  assigned  one 
binary  code  ot  four  digits.  The  analog  signal  m(t)  is  now  converted  to  a  (binary)  digital  signal. 
A  binary  digit  is  called  a  bit  for  convenience.  This  contraction  of  “binary  digit”  to  “bit”  has 
become  an  industry  standard  abbreviation  and  is  used  throughout  the  book. 

Thus,  each  sample  in  this  example  is  encoded  by  four  bits.  To  transmit  this  binary  data, 
we  need  to  assign  a  distinct  pulse  shape  to  each  of  the  two  bits.  One  possible  way  is  to  assign  a 
negative  pulse  to  a  binary  0  and  a  positive  pulse  to  a  binary  1  (Fig.  1 .5)  so  that  each  sample  is 

signal^118111111601  ^  3  gmUP  0t  f°Ur  binary  PUlSCS  (PU|SC  C°de)' The  resultin£  si£nal  is  a  binary 

The  audio  signal  bandwidth  is  about  15  kHz.  However,  for  speech,  subjective  tests  show 
that  signal  articulation  (intelligibility)  is  not  affected  if  all  the  components  above  3400  Hz 
^hSUPP^SCd*  ’  Smce  the  obJective  in  telephone  communication  is  intelligibility  rather  than 
J  fn  a  TPOnentS  ab°Ve  3400  Hz  are  eliminated  by  a  low-pass  filter.  The  resulting 

kLhii:rT  Ut  a  Fate  °f  8000  Samp,es  Per  sec°nd  (8  kHz).  This  rate  is  intentionally 
for  sign  il  re  *  ^  ^  •  ^Ul*{  samP^ng  rate  ot  6.8  kHz  so  that  realizable  filters  can  be  applied 

7 7,  K  S“mple  iS  •*•"»  1““-^  imo  256  levels  (L  -  256).  which 

sign.,  requires  8  X  MOO  ="«, ^  ^ 


*  Components  below  300  Hz  may  also  be 


suppressed  without  affecting  the 


articulation. 
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The  compact  disc  (CD)  is  a  more  recent  application  of  PCM.  This  is  a  high-fidelity  situation 
requiring  the  audio  signal  bandwidth  to  be  20  kHz.  Although  the  Nyquist  sampling  rate  is  only 
40  kHz,  the  actual  sampling  rate  of  44. 1  kHz  is  used  for  the  reason  mentioned  earlier.  The 
signal  is  quantized  into  a  rather  large  number  (L  =  65,536)  of  quantization  levels,  each  of 
which  is  represented  by  16  bits  to  reduce  the  quantizing  error.  The  binary-coded  samples  ( 1 .4 
million  bit/s)  are  then  recorded  on  the  compact  disc. 


6.2.1  Advantages  of  Digital  Communication 

Here  are  some  of  the  advantages  of  digital  communication  over  analog  communication. 

1 .  Digital  communication,  which  can  withstand  channel  noise  and  distortion  much  better 
than  analog  as  long  as  the  noise  and  the  distortion  are  within  limits,  is  more  rugged  than  analog 
communication.  With  analog  messages,  on  the  other  hand,  any  distortion  or  noise,  no  matter 
how  small,  will  distort  the  received  signal. 

2.  The  greatest  advantage  of  digital  communication  over  analog  communication,  how¬ 
ever,  is  the  viability  of  regenerative  repeaters  in  the  former.  In  an  analog  communication  system, 
a  message  signal  becomes  progressively  weaker  as  it  travels  along  the  channel,  whereas  the 
cumulative  channel  noise  and  the  signal  distortion  grow  progressively  stronger.  Ultimately 
the  signal  is  overwhelmed  by  noise  and  distortion.  Amplification  offers  little  help  because  it 
enhances  the  signal  and  the  noise  by  the  same  proportion.  Consequently,  the  distance  over 
which  an  analog  message  can  be  transmitted  is  limited  by  the  initial  transmission  power.  For 
digital  communications,  a  long  transmission  path  may  also  lead  to  overwhelming  noise  and 
interferences.  The  trick,  however,  is  to  set  up  repeater  stations  along  the  transmission  path  at 
distances  short  enough  to  be  able  to  detect  signal  pulses  before  the  noise  and  distortion  have 
a  chance  to  accumulate  sufficiently.  At  each  repeater  station  the  pulses  are  detected,  and  new, 
clean  pulses  are  transmitted  to  the  next  repeater  station,  which,  in  turn,  duplicates  the  same  pro¬ 
cess.  If  the  noise  and  distortion  are  within  limits  (which  is  possible  because  of  the  closely  spaced 
repeaters),  pulses  can  be  detected  correctly.*  This  way  the  digital  messages  can  be  transmitted 
over  longer  distances  with  greater  reliability.  The  most  significant  error  in  PCM  comes  from 
quantizing.  This  error  can  be  reduced  as  much  as  desired  by  increasing  the  number  of  quan¬ 
tizing  levels,  the  price  of  which  is  paid  in  an  increased  bandwidth  of  the  transmission  medium 
(channel). 

3.  Digital  hardware  implementation  is  flexible  and  permits  the  use  of  microprocessors, 
digital  switching,  and  large-scale  integrated  circuits. 

4.  Digital  signals  can  be  coded  to  yield  extremely  low  error  rates  and  high  fidelity  as  well 
as  for  privacy. 

5.  It  is  easier  and  more  efficient  to  multiplex  several  digital  signals. 

6.  Digital  communication  is  inherently  more  efficient  than  analog  in  exchanging  SNR  for 
bandwidth. 

7.  Digital  signal  storage  is  relatively  easy  and  inexpensive.  It  also  has  the  ability  to  search 
and  select  information  from  distant  electronic  database. 

8.  Reproduction  with  digital  messages  can  be  extremely  reliable  without  deterioration. 
Analog  messages  such  as  photocopies  and  films,  for  example,  lose  quality  at  each  successive 
stage  of  reproduction  and  must  be  transported  physically  from  one  distant  place  to  another, 
often  at  relatively  high  cost. 


The  error  in  pulse  detection  can  be  made  negligible. 
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9.  The  cost  of  digital  hardware  continues  to  halve  every  two  or  three  years,  while 
performance  or  capacity  doubles  over  the  same  time  period.  And  there  is  no  end  in  sight 
yet  to  this  breathtaking  and  relentless  exponential  progress  in  digital  technology.  As  a 
result,  digital  technologies  today  dominate  in  any  given  area  of  communication  or  storage 
technologies. 

A  Historical  Note 

The  ancient  Indian  writer  Pingala  applied  what  turns  out  to  be  advanced  mathematical  concepts 
for  describing  prosody,  and  in  doing  so  presented  the  first  known  description  of  a  binary  numeral 
system,  possibly  as  early  as  the  eighth  century  BCE.6  Others,  like  R.  Hall  in  Mathematics  of 
Poetry  place  him  later,  circa  200  BCE.  Gottfried  Wilhelm  Leibniz  (1646-1716)  was  the  first 
mathematician  in  the  West  to  work  out  systematically  the  binary  representation  (using  1  s  and  Os) 
for  any  number.  He  felt  a  spiritual  significance  in  this  discovery,  believing  that  1,  representing 
unity,  was  clearly  a  symbol  for  God,  while  0  represented  nothingness.  He  reasoned  that  if  all 
numbers  can  be  represented  merely  by  the  use  of  1  and  0,  this  surely  proves  that  God  created 
the  universe  out  of  nothing! 


6.2.2  Quantizing 

As  mentioned  earlier,  digital  signals  come  from  a  variety  of  sources.  Some  sources  such  as 
computers  are  inherently  digital.  Some  sources  are  analog,  but  are  converted  into  digital  form 
by  a  variety  of  techniques  such  as  PCM  and  delta  modulation  (DM),  which  will  now  be 
analyzed.  The  rest  of  this  section  provides  quantitative  discussion  of  PCM  and  its  various 
aspects,  such  as  quantizing,  encoding,  synchronizing,  the  required  transmission  bandwidth 
and  SNR. 

For  quantization,  we  limit  the  amplitude  of  the  message  signal  m(t)  to  the  range  (—mp,  tnp ), 
as  shown  in  Fig.  6. 1 4.  Note  that  mp  is  not  necessarily  the  peak  amplitude  of  m(t) .  The  amplitudes 
ot  beyond  are  simply  chopped  off.  Thus,  mp  is  not  a  parameter  of  the  signal  m(t)\ 
rather,  it  is  the  limit  of  the  quantizer.  The  amplitude  range  ( —mp,  mp)  is  divided  into  L  uniformly 
spaced  intervals,  each  ot  width  Av  =  2mp/L.  A  sample  value  is  approximated  by  the  midpoint 
ol  the  interval  in  which  it  lies  (Fig.  6.14).  The  quantized  samples  are  coded  and  transmitted 
as  binary  pulses.  At  the  receiver  some  pulses  may  be  detected  incorrectly.  Hence,  there  are 
two  sources  ot  error  in  this  scheme:  quantization  error  and  pulse  detection  error.  In  almost  all 
practical  schemes,  the  pulse  detection  error  is  quite  small  compared  to  the  quantization  error 
and  can  be  ignored.  In  the  present  analysis,  therefore,  we  shall  assume  that  the  error  in  the 
received  signal  is  caused  exclusively  by  quantization.  A  general  analysis  that  includes  errors 
ot  both  types  is  given  in  Sec.  10.4. 

It  m(kTs) iis  the  kth  sample  of  the  signal  m(t),  and  if  m(kTs)  is  the  corresponding  quantized 
sample,  then  from  the  interpolation  formula  in  Eq.  (6. 10), 


m(t)  =  ^m(kTs)  sine  (2nBt  -  kn) 

k 


m(t)  =  ]T  m(kTs)  sine  (2nBt  -  kn) 
* 


and 
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where  fn(t)  is  the  signal  reconstructed  from  quantized  samples.  The  distortion  component  q(t) 
in  the  reconstructed  signal  is  q(t)  =  m(t)  -  m{t).  Thus, 


<7(0  =  ^2lm(kTs)  -  m(kTs )]  sine  (2 nBt  -  kn) 


k 


=  q(kTs)  sine  (2 nBt  -  kn) 


k 


where  q(kTs)  is  the  quantization  error  in  the  *th  sample.  The  signal  q(t)  is  the  undesired  signal, 
and,  hence,  acts  as  noise,  known  as  quantization  noise.  To  calculate  the  power,  or  the  mean 
square  value  of  q(t ),  we  have 


rT/2 

J-T/2 


q2(t)dt 


(6.29a) 


We  can  show  that  (see  Prob.  3.7-3)  the  signals  sine  (2 nBt  -  mn)  and  sine  (2ttBi  —  nn)  are 
orthogonal,  that  is. 


(6.29b) 


Because  of  this  result,  the  integrals  of  the  cross-product  terms  on  the  right-hand  side  of 
Eq.  (6.29a)  vanish,  and  we  obtain 


T^oo  T  J-t/2  * 


From  the  orthogonality  relationship  (6.29b),  it  follows  that 


(6.30) 


Because  the  sampling  rate  is  2 B,  the  total  number  of  samples  over  the  averaging  interval  T  is 
2BT.  Hence,  the  right-hand  side  of  Eq.  (6.30)  represents  the  average,  or  the  mean  of  the  square 
of  the  quantization  error.  The  quantum  levels  are  separated  by  Av  =  2 mp/L.  Since  a  sample 
value  is  approximated  by  the  midpoint  of  the  subinterval  (of  height  Av)  in  which  the  sample 
falls,  the  maximum  quantization  error  is  ±Av/2.  Thus,  the  quantization  error  lies  in  the  range 
(— Av/2,  Av/2),  where 


(6.31) 
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Assuming  that  the  error  is  equally  likely  to  lie  anywhere  in  the  range  (-Av/2,  Av/2),  the 
mean  square  quantizing  error  q 2  is  given  by* 


Av 


(fdq 


 (Av)2 

(6.32) 

12 

ml 

p 

~  3  L2 

(6.33) 

Because  q~(t)  is  the  mean  square  value  or  power  of  the  quantization  noise,  we  shall  denote  it 
by  Nq, 


Assuming  that  the  pulse  detection  error  at  the  receiver  is  negligible,  the  reconstructed  signal 
m(t)  at  the  receiver  output  is 


m(l)  =  m(t)  +  q(t) 

The  desired  signal  at  the  output  is  m(t),  and  the  (quantization)  noise  is  q(t).  Since  the  power 
of  the  message  signal  m(t)  is  m2(t),  then 


S0  =  m2(t ) 


1 


and 


2  «f(t) 


>ni 


(6.34) 


I"  thls  equation>  mP  is  the  Peak  amplitude  value  that  a  quantizer  can  accept,  and  is  therefore 
a  parameter  of  the  quantizer.  This  means  S0/N0,  the  SNR.  is  a  linear  function  of  the  message 

signal  power  nr(t)  (see  Fig.  6.18  with  (i  =  0). 
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6.2.3  Principle  of  Progressive  Taxation:  Nonuniform 
Quantization 


Recall  that  S()/N0 ,  the  SNR,  is  an  indication  of  the  quality  of  the  received  signal.  Ideally  we 
would  like  to  have  a  constant  SNR  (the  same  quality)  for  all  values  of  the  message  signal  power 

Unfortunately,  the  SNR  is  directly  proportional  to  the  signal  power  nr(t),  which  varies 
from  speaker  to  speaker  by  as  much  as  40  dB  (a  power  ratio  of  104).  The  signal  power  can  also 
vary  because  of  the  different  lengths  of  the  connecting  circuits.  This  indicates  that  the  SNR  in 
Eq.  (6.34)  can  vary  widely,  depending  on  the  speaker  and  the  length  of  the  circuit.  Even  for 
the  same  speaker,  the  quality  of  the  received  signal  will  deteriorate  markedly  when  the  person 
speaks  softly.  Statistically,  it  is  found  that  smaller  amplitudes  predominate  in  speech  and  larger 
amplitudes  are  much  less  frequent.  This  means  the  SNR  will  be  low  most  of  the  time. 

The  root  of  this  difficulty  lies  in  the  fact  that  the  quantizing  steps  are  of  uniform  value 
Av  =  2 mp/L.  The  quantization  noise  Nq  =  (Av)2/12  [Eq.  (6.32)]  is  directly  proportional 
to  the  square  of  the  step  size.  The  problem  can  be  solved  by  using  smaller  steps  for  smaller 
amplitudes  (nonuniform  quantizing),  as  shown  in  Fig.  6.15a.  The  same  result  is  obtained  by 
first  compressing  signal  samples  and  then  using  a  uniform  quantization.  The  input-output 
characteristics  of  a  compressor  are  shown  in  Fig.  6.15b.  The  horizontal  axis  is  the  normalized 
input  signal  (i.e.,  the  input  signal  amplitude  m  divided  by  the  signal  peak  value  mp).  The 
vertical  axis  is  the  output  signal  y.  The  compressor  maps  input  signal  increments  A m  into 
larger  increments  Ay  for  small  input  signals,  and  vice  versa  for  large  input  signals.  Hence,  a 
given  interval  Am  contains  a  larger  number  of  steps  (or  smaller  step  size)  when  m  is  small. 
The  quantization  noise  is  lower  for  smaller  input  signal  power.  An  approximately  logarithmic 
compression  characteristic  yields  a  quantization  noise  nearly  proportional  to  the  signal  power 

m2(t ),  thus  making  the  SNR  practically  independent  of  the  input  signal  power  over  a  large 
dynamic  range5  (see  later  Fig.  6.18).  This  approach  of  equalizing  the  SNR  appears  similar  to 
the  use  of  progressive  income  tax  to  equalize  incomes.  The  loud  talkers  and  stronger  signals 
are  penalized  with  higher  noise  steps  Av  to  compensate  the  soft  talkers  and  weaker  signals. 

Among  several  choices,  two  compression  laws  have  been  accepted  as  desirable  standards 
by  the  ITU-T:6  the  //-law  used  in  North  America  and  Japan,  and  the  A-law  used  in  Europe  and 
the  rest  of  the  world  and  on  international  routes.  Both  the  //-law  and  the  A-law  curves  have 
odd  symmetry  about  the  vertical  axis.  The  //-law  (for  positive  amplitudes)  is  given  by 


1 


y  = 


In  (1  +  fi) 

The  A-law  (for  positive  amplitudes)  is 
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(6.35a) 


(6.35b) 


These  characteristics  are  shown  in  Fig.  6.16. 

The  compression  parameter  n  (or  A)  determines  the  degree  of  compression.  To  obtain  a 
nearly  constant  S„/N0  over  a  dynamic  range  of  for  input  signal  power  40  dB,  n  should  be 
greater  than  1 00.  Early  North  American  channel  banks  and  other  digital  terminals  used  a  value 
of  n  =  100,  which  yielded  the  best  results  for  7-bit  (128-level)  encoding.  An  optimum  value 
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Figure  6.15 

Nonuniform 

quantization. 


(a) 


Figure  6.16 

(a)  /i* law 
characteristic. 

(b)  /4-Law 
characteristic. 


(b) 


of  n  =  255  has  been  used  for  all  North  American  8-bit  (256-level)  digital  terminals,  and  the 
earlier  value  of  n  is  now  almost  extinct.  For  the  A-law,  a  value  of  A  =  87.6  gives  comparable 
results  and  has  been  standardized  by  the  ITU-T.6 

The  compressed  samples  must  be  restored  to  their  original  values  at  the  receiver  by  using 
an  expander  with  a  characteristic  complementary  to  that  of  the  compressor.  The  compressor  and 
the  expander  together  are  called  the  compandor.  Figure  6.17  describes  the  use  of  compressor 
and  expander  along  with  a  uniform  quantizer  to  achieve  nonuniform  quantization. 

Generally  speaking,  time  compression  of  a  signal  increases  its  bandwidth.  But  in  PCM- 
we  are  compressing  not  the  signal  m(t)  in  time  but  its  sample  values.  Because  neither  the  time 
scale  not  the  number  of  samples  changes,  the  problem  of  bandwidth  increase  does  not  arise 
here.  It  is  shown  in  Sec.  10.4  that  when  a  M-law  compandor  is  used,  the  output  SNR  is 


So_  _  3 L2 

No  [In  ( 1  +  n)]2 


M2  » 


) 


(6.36) 


figure  6.17 

Utilization  of 
compressor  and 
expander  for 
nonuniform 
quantization. 
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(a) 


(b) 


Figure  6.17 

Utilization  of 
compressor  and 
expander  for 
nonuniform 
quantization. 


The  output  SNR  for  the  cases  of  fx  =  255  and  fx  =  0  (uniform  quantization)  as  a  function  of 
nr{t)  (the  message  signal  power)  is  shown  in  Fig.  6.18. 

The  Compandor 

A  logarithmic  compressor  can  be  realized  by  a  semiconductor  diode,  because  the  V-I 
characteristic  of  such  a  diode  is  of  the  desired  form  in  the  first  quadrant: 


Two  matched  diodes  in  parallel  with  opposite  polarity  provide  the  approximate  characteristic 
in  the  first  and  third  quadrants  (ignoring  the  saturation  current).  In  practice,  adjustable  resistors 
are  placed  in  series  with  each  diode  and  a  third  variable  resistor  is  added  in  parallel.  By  adjusting 
various  resistors,  the  resulting  characteristic  is  made  to  fit  a  finite  number  of  points  (usually 
seven)  on  the  ideal  characteristics. 

An  alternative  approach  is  to  use  a  piecewise  linear  approximation  to  the  logarithmic  char¬ 
acteristics.  A  1 5-segmented  approximation  (Fig.  6. 1 9)  to  the  eighth  bit  (L  =  256)  with  fx  =  255 
law  is  widely  used  in  the  D2  channel  bank  that  is  used  in  conjunction  with  the  T 1  carrier  system. 
The  segmented  approximation  is  only  marginally  inferior  in  terms  of  SNR.s  The  piecewise 
linear  approximation  has  almost  universally  replaced  earlier  logarithmic  approximations  to 
the  true  fx  =  255  characteristic  and  is  the  method  of  choice  in  North  American  standards. 
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Figure  6.18 

Ratio  of  signal  to 
quantization 
noise  in  PCM 
with  and  without 
compression. 


Figure  6.19 

Piecewise  linear 

compressor 

characteristic. 


.  nuugu  a  U  uc  M  compressor  working  with  a  /z  =  255  expander  will  be  superior  to  sun- 
ilar  piecewise  linear  devices,  a  digital  terminal  device  exhibiting  the  true  characteristic  in 
today  s  network  must  work  end-to-end  against  other  network  elements  that  use  the  piecewise 
linear  approximation.  Such  a  combination  of  differing  characteristics  is  inferior  to  either  of 
the  characteristics  obtained  when  the  compressor  and  the  expander  operate  using  the  same 
compression  law. 

In  the  standard  audio  file  format  used  by  Sun,  Unix  and  Java,  the  audio  in  “au 
hies  can  be  pulse-code-modulated  or  compressed  with  the  ITU-T  G71 1  standard  through 
either  the  M-law  or  the  4-law/’  The  /i-law  compressor  (ji  =  255)  converts  14-brt 
signed  linear  PCM  samples  to  logarithmic  8-bit  samples,  leading  to  storage  saving.  The 
/Ulaw  compressor  (4  =  87.6)  converts  13-bit  signed  linear  PCM  samples  to  logarithmic 
8-bit  samples  In  both  cases,  sampling  at  the  rate  of  8000  Hz,  a  G77  encoder  thus  creates  from 
audio  signals  bit  streams  at  64  kilobits  per  second  (kbit/s).  Since  the  4-law  and  the  fi- law  are 

riotd  .hTiTm6’  aud,r recoded  im°  “au” fiies  can  be  dec°ded  jn  edher  f°rmat- u  shouid 

and  4-law  audio  format  has  compression  options  that  use 
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The  PCM  Encoder 

The  multiplexed  PAM  output  is  applied  at  the  input  of  the  encoder,  which  quantizes  and  encodes 
each  sample  into  a  group  of  n  binary  digits.  A  variety  of  encoders  is  available.7- 10  We  shall 
discuss  here  the  digit-at-a-time  encoder,  which  makes  n  sequential  comparisons  to  generate  an 
n-bit  codeword.  The  sample  is  compared  with  a  voltage  obtained  by  a  combination  of  reference 
voltages  proportional  to  27,  26.  25, . . . ,  2°.  The  reference  voltages  are  conveniently  generated 
by  a  bank  of  resistors  /?,  2 R,  2 2R . 2 1 R. 

The  encoding  involves  answering  successive  questions,  beginning  with  whether  the  sam¬ 
ple  is  in  the  upper  or  lower  half  of  the  allowed  range.  The  first  code  digit  1  or  0  is  generated, 
depending  on  whether  the  sample  is  in  the  upper  or  the  lower  half  of  the  range.  In  the  second 
step,  another  digit  I  or  0  is  generated,  depending  on  whether  the  sample  is  in  the  upper  or  the 
lower  half  of  the  subinterval  in  which  it  has  been  located.  This  process  continues  until  the  last 
binary  digit  in  the  code  has  been  generated. 

Decoding  is  the  inverse  of  encoding.  In  this  case,  each  of  the  n  digits  is  applied  to  a  resistor 
of  different  value.  The  fcth  digit  is  applied  to  a  resistor  2 kR.  The  currents  in  all  the  resistors 
are  added.  The  sum  is  proportional  to  the  quantized  sample  value.  For  example,  a  binary  code 
word  10010110  will  give  a  current  proportional  to  27  +  0  +  0  +  24  +  0  +  22  +  21  +  0  =  150. 
This  completes  the  D/A  conversion. 

6.2.4  Transmission  Bandwidth  and  the  Output  SNR 

For  a  binary  PCM,  we  assign  a  distinct  group  of  n  binary  digits  (bits)  to  each  of  the  /.quantization 
levels.  Because  a  sequence  of  n  binary  digits  can  be  arranged  in  2"  distinct  patterns. 


L  =  2n  or  n  =  log2  L 


(6.37) 


each  quantized  sample  is,  thus,  encoded  into  n  bits.  Because  a  signal  m(t)  band-limited  to  B 
Hz  requires  a  minimum  of  2 B  samples  per  second,  we  require  a  total  of  2nB  bit/s,  that  is,  2nB 
pieces  of  information  per  second.  Because  a  unit  bandwidth  ( 1  Hz)  can  transmit  a  maximum  of 
two  pieces  of  information  per  second  (Sec.  6. 1 .3),  we  require  a  minimum  channel  of  bandwidth 
Bj  Hz,  given  by 


Bj  —  nB  Hz 


(6.38) 


This  is  the  theoretical  minimum  transmission  bandwidth  required  to  transmit  the  PCM  signal. 
In  Secs.  7.2  and  7.3,  we  shall  see  that  for  practical  reasons  we  may  use  a  transmission  bandwidth 
higher  than  this  minimum. 


The  maximum  acceptable  error  in  the  sample  amplitude  (the  maximum  quantization  error)  is 
0.5%  of  the  peak  amplitude  mp.  The  quantized  samples  are  binary  coded.  Find  the  minimum 
bandwidth  of  a  channel  required  to  transmit  the  encoded  binary  signal.  If  24  such  signals 
are  time-division-multiplexed,  determine  the  minimum  transmission  bandwidth  required  to 
transmit  the  multiplexed  signal. 

I  The  Nyquist  sampling  rate  is  R/y  =  2  x  3000  =  6000  Hz  (samples  per  second).  The 
actual  sampling  rate  is  Ra  =  6000  x  (1^)  =  8000  Hz. 

The  quantization  step  is  Ai\  and  the  maximum  quantization  error  is  ±Av/2. 


330 


SAMPLING  AND  ANALOG-TO-DIGITAL  CONVERSION 


Therefore,  from  Eq.  (6.31), 


For  binary  coding,  L  must  be  a  power  of  2.  Hence,  the  next  higher  value  of  L  that  is  a 
power  of  2  is  L  =  256. 

From  Eq.  (6.37),  we  need  n  =  log2  256  =  8  bits  per  sample.  We  require  to  transmit 
a  total  of  C  =  8  x  8000  =  64. 000  bit/s.  Because  we  can  transmit  up  to  2  bit/s  per  hertz 
of  bandwidth,  we  require  a  minimum  transmission  bandwidth  Bj  =  C / 2  =  32  kHz. 

The  multiplexed  signal  has  a  total  of  Cm  =  24  x  64,000  =  1.536  Mbit/s,  which 
requires  a  minimum  of  1.536/2  =  0.768  MHz  of  transmission  bandwidth. 


Exponential  Increase  of  the  Output  SNR 

From  Eq.  (6.37),  Lr  =  22",  and  the  output  SNR  in  Eq.  (6.34)  or  Eq.  (6.36)  can  be  expressed  as 


(6.39) 


where 


jln  ^  | '  ^ 2  [compressed  case,  in  Eq.  (6.36)] 


Substitution  of  Eq.  (6.38)  into  Eq.  (6.39)  yields 


TT  =  c(2)2Bt'B 
No 


(6.40) 


From  Eq.  (6.40)  we  observe  that  the  SNR  increases  exponentially  with  the  transmission  band- 
w  idth  Bj .  This  trade  of  SNR  tor  bandwidth  is  attractive  and  comes  close  to  the  upper  theoretical 
limit.  A  small  increase  in  bandwidth  yields  a  large  benefit  in  terms  of  SNR.  This  relationship 
is  clearly  seen  by  using  the  decibel  scale  to  rewrite  Eq.  (6.39)  as 


=  10loglo[c(2)2"] 

=  101og|0c  +  2/ilogl02 
=  (a  +  6n)  dB 


(6.41) 
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Example  6.3  A  signal  m(t )  of  bandwidth  B  =  4  kHz  is  transmitted  using  a  binary  companded  PCM  with 
H  =  1 00.  Compare  the  case  of  L  =  64  with  the  case  of  L  —  256  from  the  point  of  view  of 
transmission  bandwidth  and  the  output  SNR. 

For  L  =  64,  n  =  6,  and  the  transmission  bandwidth  is  nB  =  24  kHz. 

Sq 
N~o 

a 

Hence, 

So 

-f  =  27.49  dB 
No 

For  L  =  256,  n  =  8.  and  the  transmission  bandwidth  is  32  kHz, 

~  =  a  +  6«  =  39.49  dB 
N0 

The  difference  between  the  two  SNRs  is  12  dB,  which  is  a  ratio  of  16.  Thus,  the  SNR 
for  L  =  256  is  16  times  the  SNR  for  L  =  64.  The  former  requires  just  about  33%  more 
bandwidth  compared  to  the  latter. 


Comments  on  Logarithmic  Units 

Logarithmic  units  and  logarithmic  scales  are  very  convenient  when  a  variable  has  a  large 
dynamic  range.  Such  is  the  case  with  frequency  variables  or  SNRs.  A  logarithmic  unit  for  the 
power  ratio  is  the  decibel  (dB).  defined  as  10  log10  (power  ratio).  Thus,  an  SNR  is  x  dB,  where 

S 

x  =  10  logl0  - 

We  use  the  same  unit  to  express  power  gain  or  loss  over  a  certain  transmission  medium.  For 
instance,  if  over  a  certain  cable  the  signal  power  is  attenuated  by  a  factor  of  1 5,  the  cable  gain  is 

G  =  10  log|0 =  -H.76  dB 

or  the  cable  attenuation  (loss)  is  1 1.76  dB. 

Although  the  decibel  is  a  measure  of  power  ratios,  it  is  often  used  as  a  measure  of  power 
itself.  For  instance,  “100  watt”  may  be  considered  to  be  a  power  ratio  of  100  with  respect  to 
1  -watt  power,  and  is  expressed  in  units  of  dB  W  as 


=  (a  +  36)  dB 
3 


=  10  log 


[In  (101  )]2 


=  -8.51 


^dBW  =  10  log  io  100  —  20  dBW 
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Thus,  100- watt  power  is  20  dBW.  Similarly,  power  measured  with  respect  to  1  mW  power  is 
dBm.  For  instance,  100-watt  power  is 


100W 

PdBm  =  10  |og 


=  50  dBm 


6.3  DIGITAL  TELEPHONY:  PCM  IN  T1 
CARRIER  SYSTEMS 

A  Historical  Note 

Because  of  the  unavailability  of  suitable  switching  devices,  more  than  20  years  elapsed  between 
the  invention  of  PCM  and  its  implementation.  Vacuum  tubes,  used  before  the  invention  of  the 
transistor,  were  not  only  bulky,  but  they  were  poor  switches  and  dissipated  a  lot  of  heat.  Systems 
having  vacuum  tubes  as  switches  were  large,  rather  unreliable,  and  tended  to  overheat.  PCM 
was  just  waiting  for  the  invention  of  the  transistor,  which  happens  to  be  a  small  device  that 
consumes  little  power  and  is  a  nearly  ideal  switch. 

Coincidentally,  at  about  the  time  the  transistor  was  invented,  the  demand  for  telephone 
service  had  become  so  heavy  that  the  existing  system  was  overloaded,  particularly  in  large 
cities.  It  was  not  easy  to  install  new  underground  cables  because  space  available  under  the 
streets  in  many  cities  was  already  occupied  by  other  services  (water,  gas,  sewer,  etc.).  Moreover, 
digging  up  streets  and  causing  many  dislocations  was  not  very  attractive.  An  attempt  was  made 
on  a  limited  scale  to  increase  the  capacity  by  frequency-division-multiplexing  several  voice 
channels  through  amplitude  modulation.  Unfortunately,  the  cables  were  primarily  designed 
tor  the  audio  voice  range  (0-4  kHz)  and  suffered  severely  from  noise.  Furthermore,  cross  talk 
between  pairs  of  channels  on  the  same  cable  was  unacceptable  at  high  frequencies.  Ironically 
PCM— requiring  a  bandwidth  several  times  larger  than  that  required  for  FDM  signals— offered 
the  solution.  This  is  because  digital  systems  with  closely  spaced  regenerative  repeaters  can 
work  satisfactorily  on  noisy  lines  that  give  poor  high-frequency  performance.^  The  repeaters, 
spaced  approximately  6000  feet  apart,  clean  up  the  signal  and  regenerate  new  pulses  before  the 
pulses  get  too  distorted  and  noisy.  This  is  the  history  of  the  Bell  System’s  T1  carrier  system. '  10 
A  pair  of  wires  that  used  to  transmit  one  audio  signal  of  bandwidth  4  kHz  is  now  used  to  transmit 
24  time-division-multiplexed  PCM  telephone  signals  with  a  total  bandwidth  of  1 .544  MHz. 

T1  Time  Division  Multiplexing 

A  schematic  of  a  T1  carrier  system  is  shown  in  Fig.  6.20a.  All  24  channels  are  sampled 
in  a  sequence.  The  sampler  output  represents  a  time-division-multiplexed  PAM  signal.  The 
multiplexed  PAM  signal  is  now  applied  to  the  input  of  an  encoder  that  quantizes  each  sample 
and  encodes  it  into  eight  binary  pulses — a  binary  codeword*  (see  Fig  6  20b).  The  signal- 
now  converted  to  digital  form,  is  sent  over  the  transmission  medium.  Regenerative  repeaters 
spaced  approximately  6000  feet  apart  detect  the  pulses  and  retransmit  new  pulses.  At  the 
receiver,  the  decoder  converts  the  binary  pulses  into  samples  (decoding).  The  samples  are 
then  demultiplexed  (i.e.,  distributed  to  each  of  the  24  channels).  The  desired  audio  signal  is 
reconstructed  by  passing  the  samples  through  a  low-pass  filter  in  each  channel. 


In  an  earlier  version,  each  sample  was  encoded  by 


seven  bits.  An  additional  bit  was  added  for  signaling. 
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Figure  6.20 

T1  carrier 
system. 


Channel 


(a) 


(b) 


The  commutators  in  Fig.  6.20  are  not  mechanical  but  are  high-speed  electronic  switching 
circuits.  Several  schemes  are  available  for  this  purpose. 1 1  Sampling  is  done  by  electronic  gates 
(such  as  a  bridge  diode  circuit,  as  shown  in  Fig.  4.5a)  opened  periodically  by  narrow  pulses  of 
2  ns  duration.  The  1.544  Mbit/s  signal  of  the  T1  system,  called  digital  signal  level  1  (DS1), 
is  used  further  to  multiplex  into  progressively  higher  level  signals  DS2,  DS3,  and  DS4,  as 
described  next,  in  Sec.  6.4 

After  the  Bell  System  introduced  the  T1  carrier  system  in  the  United  States,  dozens  of 
variations  were  proposed  or  adopted  elsewhere  before  the  ITU-T  standardized  its  30-channel 
PCM  system  with  a  rate  of  2.048  Mbit/s  (in  contrast  to  T1 .  with  24  channels  and  1 .544  Mbit/s). 
The  30-channel  system  is  used  all  over  the  world,  except  in  North  America  and  Japan.  Because 
of  the  widespread  adoption  of  the  T1  carrier  system  in  the  United  States  and  Japan  before  the 
ITU-T  standardization,  the  two  standards  continue  to  be  used  in  different  parts  of  the  world, 
with  appropriate  interfaces  in  international  connections. 
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Figure  6.21 
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Synchronizing  and  Signaling 

Binary  codewords  corresponding  to  samples  of  each  of  the  24  channels  are  multiplexed  in 
a  sequence,  as  shown  in  Fig.  6.21.  A  segment  containing  one  codeword  (corresponding  to 
one  sample)  from  each  of  the  24  channels  is  called  a  frame.  Each  frame  has  24  x  8  =  192 
information  bits.  Because  the  sampling  rate  is  8000  samples  per  second,  each  frame  takes 
1 25  ms.  To  separate  information  bits  correctly  at  the  receiver,  it  is  necessary  to  be  sure  where 
eac  frame  begins.  Therefore,  a  framing  bit  is  added  at  the  beginning  of  each  frame.  This 
ma  es  a  total  of  1 93  bits  per  frame.  Framing  bits  are  chosen  so  that  a  sequence  of  framing  bits, 

one  at  the  beginning  of  each  frame,  forms  a  special  pattern  that  is  unlikely  to  be  formed  in  a 
speech  signal. 

The  sequence  formed  by  the  first  bit  from  each  frame  is  examined  by  the  logic  of  the 
receiving  terminal.  If  this  sequence  does  not  follow  the  given  code  pattern  (framing  bit  pattern), 
a  synchronization  loss  is  detected,  and  the  next  position  is  examined  to  determine  whether  it 

is  actually  the  framing  bit.  It  takes  about  0.4  to  6  ms  to  detect  and  about  50  ms  (in  the  worst 
possible  case)  to  reframe. 


snonHinc  /  ^  \°  m  °rmatlon  and  fram*ng  bits,  we  need  to  transmit  signaling  bits  coi 
K  f"8  ”  *S'  “  We"  “  ttlePh°"'  On-hook/off-hook  signal  Whan  chan. 
Ihe  swjiche  dR  "  ",l  to  transmi*  signals  between  telephone  switching  systei 

S  m  IO  C™""""CaK  wiIh  cach  other  to  nse  the  channels  effect!*. 

.ersTon  21,"'  T  T* for ' instead  of  the  seven  bits  used  in  Ihe  ear 
rather  low  snecd  "lg1?han“  pr“,ldcd  i>y  the  eighth  bit  is  no  longer  available.  Since  onl 

mlon  r!!'  ™  8"af  8  ChaT  15  req“ired-  ralh"  «<■  extra  time  slots  for  this  ini 
mat, on,  we  use  one  mfomtatton  b„  (the  least  significant  bit)  of  every  sixth  sample  of  a  sig 


*  In  the  earlier  version  of  Tl,  quantizing  levels  L  = 
used  for  signaling. 


1 28  required  only  seven  information  bits.  The  eighth  bit  was 
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to  transmit  this  information.  This  means  that  every  sixth  sample  of  each  voice  signal  will  have 
a  possible  error  corresponding  to  the  least  significant  digit.  Every  sixth  frame,  therefore,  has 
7  x  24  =  168  information  bits,  24  signaling  bits,  and  1  framing  bit.  In  all  the  remaining  frames, 
there  are  192  information  bits  and  1  framing  bit.  This  technique  is  called  7=?  bit  encoding,  and 
the  signaling  channel  so  derived  is  called  robbed-bit  signaling.  The  slight  SNR  degradation 
suffered  by  impairing  one  out  of  six  frames  is  considered  to  be  an  acceptable  penalty.  The  sig¬ 
naling  bits  for  each  signal  occur  at  a  rate  of  8000/6  =  1333  bit/s.  The  frame  format  is  shown 
in  Fig.  6.21. 

The  older  seven-bit  framing  format  required  only  that  frame  boundaries  be  identified 
so  that  the  channels  could  be  located  in  the  bit  stream.  When  signaling  is  superimposed  on 
the  channels  in  every  sixth  frame,  it  is  necessary  to  identify,  at  the  receiver,  which  frames 
are  the  signaling  frames.  A  new  framing  structure,  called  the  superframe,  was  developed 
to  take  care  of  this.  The  framing  bits  are  transmitted  at  8  kbit/s  as  before  and  occupy  the 
first  bit  of  each  frame.  The  framing  bits  form  a  special  pattern,  which  repeats  in  12  frames: 
100011011100.  The  pattern  thus  allows  the  identification  of  frame  boundaries  as  before,  but 
also  allows  the  determination  of  the  locations  of  the  sixth  and  twelfth  frames  within  the  super- 
frame.  Note  that  the  superframe  described  here  is  12  frames  in  length.  Since  two  bits  per 
superframe  are  available  for  signaling  for  each  channel,  it  is  possible  to  provide  four-state 
signaling  for  a  channel  by  using  the  four  possible  patterns  of  the  two  signaling  bits:  00,  01, 
10,  and  11.  Although  most  switch-to-switch  applications  in  the  telephone  network  require 
only  two-state  signaling,  three-  and  four-state  signaling  techniques  are  used  in  certain  special 
applications. 

Advances  in  digital  electronics  and  in  coding  theory  have  made  it  unnecessary  to  use 
the  full  8  kbit/s  of  the  framing  channel  in  a  DS1  signal  to  perform  the  framing  task.  A  new 
superframe  structure,  called  the  extended  superframe  (ESF)  format,  was  introduced  during 
the  1970s  to  take  advantage  of  the  reduced  framing  bandwidth  requirement.  An  ESF  is  24 
frames  in  length  and  carries  signaling  bits  in  the  eighth  bit  of  each  channel  in  frames  6,  1 2,  1 8, 
and  24.  Sixteen-state  signaling  is  thus  possible  and  is  sometimes  used  although,  as  with  the 
superframe  format,  most  applications  require  only  two-state  signaling. 

The  8  kbit/s  overhead  (framing)  capacity  of  the  ESF  signal  is  divided  into  three  channels:  2 
kbit/s  for  framing,  2  kbit/s  for  a  cyclic  redundancy  check  (CRC-6)  error  detection  channel,  and 
4  kbit/s  for  a  data  channel.  The  highly  reliable  error  checking  provided  by  the  CRC-6  pattern 
and  the  use  of  the  data  channel  to  transport  information  on  signal  performance  as  received 
by  the  distant  terminal  make  ESF  much  more  attractive  to  service  providers  than  the  older 
superframe  format.  More  discussions  on  CRC  error  detection  can  be  found  in  Chapter  15. 

The  2  kbit/s  framing  channel  of  the  ESF  format  carries  the  repetitive  pattern  001011 . . . ,  a 
pattern  that  repeats  in  24  frames  and  is  much  less  vulnerable  to  counterfeiting  than  the  patterns 
associated  with  the  earlier  formats. 

For  various  reasons,  including  the  development  of  intelligent  network-switching  nodes, 
the  function  of  signaling  is  being  transferred  out  from  the  channels  that  carry  the  messages 
or  data  signals  to  separate  signaling  networks  called  common  channel  interoffice  signaling 
(CCIS)  systems.  The  universal  deployment  of  such  systems  will  significantly  decrease  the 
importance  of  robbed-bit  signaling,  and  all  eight  bits  of  each  message  (or  sample)  will  be 
transmitted  in  most  applications. 

The  Conference  on  European  Postal  and  Telegraph  Administration  (CEPT)  has  standard¬ 
ized  a  PCM  with  256  time  slots  per  frame.  Each  frame  has  30  x  8  =  240  information  bits, 
corresponding  to  30  speech  channels  (with  eight  bits  each).  The  remaining  16  bits  per  frame 
are  used  for  frame  synchronization  and  signaling.  Therefore,  although  the  bit  rate  is  2.048 
Mbit/s,  corresponding  to  32  voice  channels,  only  30  voice  channels  are  transmitted. 
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6.4  DIGITAL  MULTIPLEXING 

Several  low-bit-rate  signals  can  be  multiplexed,  or  combined,  to  form  one  high-bit-rate  signal, 
to  be  transmitted  over  a  high-frequency  medium.  Because  the  medium  is  time-shared  by  various 
incoming  signals,  this  is  a  case  of  TDM  (time  division  multiplexing).  The  signals  from  various 
incoming  channels,  or  tributaries,  may  be  as  diverse  as  a  digitized  voice  signal  (PCM),  a 
computer  output,  telemetry  data,  and  a  digital  facsimile.  The  bit  rates  of  various  tributaries 
need  not  be  the  same. 

To  begin  with,  consider  the  case  of  all  tributaries  with  identical  bit  rates.  Multiplexing  can 
be  done  on  a  bit-by-bit  basis  (known  as  bit  or  digit  interleaving)  as  shown  in  Fig.  6.22a,  or  on  a 
word-by-word  basis  (known  as  byte  or  word  interleaving).  Figure  6.22b  shows  the  interleaving 
ot  words,  formed  by  tour  bits.  The  North  American  digital  hierarchy  uses  bit  interleaving 
(except  at  the  lowest  level),  where  bits  are  taken  one  at  a  time  from  the  various  signals  to  be 
multiplexed.  Byte  interleaving,  used  in  building  the  DS1  signal  and  SONET-formatted  signals, 
involves  inserting  bytes  in  succession  from  the  channels  to  be  multiplexed. 

The  T1  carrier,  discussed  in  Sec.  6.3,  uses  eight-bit  word  interleaving.  When  the  bit  rates 
of  incoming  channels  are  not  identical,  the  high-bit-rate  channel  is  allocated  proportionately 
more  slots.  Four-channel  multiplexing  consists  of  three  channels  B,  C,  and  D  of  identical  bit 
rate  R  and  one  channel  (channel  A)  with  a  bit  rate  of  3 R.  (Fig.  6.22c, d).  Similar  results  can  be 
attained  by  combining  words  ot  different  lengths.  It  is  evident  that  the  minimum  length  of  the 
multiplex  frame  must  be  a  multiple  of  the  lowest  common  multiple  of  the  incoming  channel 
bit  rates,  and,  hence,  this  type  of  scheme  is  practical  only  when  some  fairly  simple  relation¬ 
ship  exists  among  these  rates.  The  case  of  completely  asynchronous  channels  is  discussed 
later. 

At  the  receiving  terminal,  the  incoming  digit  stream  must  be  divided  and  distributed  to  the 
appropriate  output  channel.  For  this  purpose,  the  receiving  terminal  must  be  able  to  correctly 
identify  each  bit.  This  requires  the  receiving  system  to  uniquely  synchronize  in  time  with  the 
beginning  of  each  frame,  with  each  slot  in  a  frame,  and  with  each  bit  within  a  slot.  This  is 
accomplished  by  adding  framing  and  synchronization  bits  to  the  data  bits.  These  bits  are  part 
of  the  so-called  overhead  bits. 


Figure  6.22  ci 
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6.4.1  Signal  Format 

Figure  6.23  illustrates  a  typical  format,  that  of  the  DM  1/2  multiplexer.  We  have  here  bit-by-bit 
interleaving  of  four  channels  each  at  a  rate  of  1.544  Mbit/s.  The  main  frame  (multiframe) 
consists  of  tour  subframes.  Each  subframe  has  six  overhead  bits:  for  example  the  subframe 

'  (hrhSt  l'nke:n  Flg^-23)  h,as  overhead  bits  Mo.  Ca,  Fo,  Ca.  Ca,  and  F,.  In  between  these 
overhead  bits  are  48  mterleaved  data  bits  from  the  four  channels  (12  data  bits  from  each 
channel).  We  begin  with  overhead  bit  M0.  followed  by  48  multiplexed  data  bits,  then  add 
a  second  overhead  bit  CA  followed  by  the  next  48  multiplexed  bits,  and  so  on.  Thus,  there 

H 76  hi  r  ™  *  4  =  ' 152  d3ta  bits  and  6  x  4  =  24  overhead  bits  making  a  total 
e  l!  f  r  ^e  TCy  ,S  1152/1,76  -  98%-  The  overhead  bits  with  subscript  0 
F  ire  al  l!  Th  p°^  *  SUbSCnpt  '  are  always  *•  Thus,  M0.  F0  are  all  Os  and  M,  and 
which  the  1 U  7  g'tS  ^  Peri°diC  0,0101  '  • '  and  P™ide  the  main  framing  pattern. 
I  6  !h?  mult,Plexer  ases  to  synchronize  on  the  frame.  After  locking  onto  this  pattern,  the 
ultiplexer  searches  for  the  01 11  pattern  formed  by  overhead  bits  M„M ,  M ,  M , .  This  further 
i  entities  the  four  subframes,  each  corresponding  to  a  line  in  Fig  6  23  It  is  possible,  although 
unlikely,  that  signal  bits  will  also  have  a  pattern  101010. ...  The  receiver  could  lock  onto  this 
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wrong  sequence.  The  presence  of  MoMiMjMi  provides  verification  of  the  genuine  FoFjFoFj 
sequence.  The  C  bits  are  used  to  transmit  additional  information  about  bit  stuffing,  as  discussed 
later. 

In  the  majority  of  cases,  not  all  incoming  channels  are  active  all  the  time:  some  transmit 
data,  and  some  are  idle.  This  means  the  system  is  underutilized.  We  can,  therefore,  accept  more 
input  channels  to  take  advantage  of  the  inactivity,  at  any  given  time,  of  at  least  one  channel. 
This  obviously  involves  much  more  complicated  switching  operations,  and  also  rather  careful 
system  planning.  In  any  random  traffic  situation  we  cannot  guarantee  that  the  number  of 
transmission  channels  demanded  will  not  exceed  the  number  available;  but  by  taking  account 
of  the  statistics  of  the  signal  sources,  it  is  possible  to  ensure  an  acceptably  low  probability  of 
this  occurring.  Multiplex  structures  of  this  type  have  been  developed  for  satellite  systems  and 
are  known  as  time  division  multiple-access  (TDMA)  systems. 

In  TDMA  systems  employed  for  telephony,  the  design  parameters  are  chosen  so  that  any 
overload  condition  lasts  only  a  fraction  of  a  second,  which  leads  to  acceptable  performance 
tor  speech  communication.  For  other  types  of  data  and  telegraphy,  transmission  delays  are 
unimportant.  Hence,  in  overload  condition,  the  incoming  data  can  be  stored  and  transmitted 
later. 


6.4.2  Asynchronous  Channels  and  Bit  Stuffing 

In  the  preceding  discussion,  we  assumed  synchronization  between  all  the  incoming  channels 
and  the  multiplexer.  This  is  difficult  even  when  all  the  channels  are  nominally  at  the  same 
rate.  For  example,  consider  a  1000  km  coaxial  cable  carrying  2  x  108  pulses  per  second. 
Assuming  the  nominal  propagation  speed  in  the  cable  to  be  2  x  108  m/s,  it  takes  1/200  second 
ot  transit  time  and  I  million  pulses  will  be  in  transit.  If  the  cable  temperature  increases  by  l°F> 
the  propagation  velocity  will  increase  by  about  0.01%.  This  will  cause  the  pulses  in  transit 
to  arrive  sooner,  thus  producing  a  temporary  increase  in  the  rate  of  pulses  received.  Because 
the  extra  pulses  cannot  be  accommodated  in  the  multiplexer,  they  must  be  temporarily  stored 
at  the  receiver.  II  the  cable  temperature  drops,  the  rate  of  received  pulses  will  drop,  and  the 

multiplexer  will  have  vacant  slots  with  no  data.  These  slots  need  to  be  stuffed  with  dummy 
digits  (pulse  stuffing). 

•  ?S'Sifa'S  'n  thC  North  American  network  are  often  generated  by  crystal  oscillators 
m  individual  channel  banks  or  other  digital  terminal  equipment.  Although  the  oscillators  are 
quite  stable,  they  will  not  oscillate  at  exactly  the  same  frequency,  leading  to  another  cause  of 
asynchronicity  in  the  network.  6 

This  shows  that  even  in  synchronously  multiplexed  systems,  the  data  are  rarely  received 

(also  known"01*'8  Tfi  f-  3  (kn°Wn  38  a"  and  pulse  stuffing 

e^tic ZeZr >  T  ,0  a'COmmodate  such  a"  situation.  Obviously,  this  method  of  an 
var  T  IT  ?  '  ^  ^  Whe"  the  cha""e>8  are  asynchronous. 

pulse  stuffing  and  n°»  vT  g  SCheme  exist:(|)  Positive  pulse  stuffing,  (2)  negative 
P  stuffing,  and  (3)  posit. ve/negative  pulse  stuffing.  In  positive  pulse  stuffing,  the  multiplexer 
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rate  is  higher  than  that  required  to  accommodate  all  incoming  tributaries  at  their  maximum 
rate.  Hence,  the  time  slots  in  the  multiplexed  signal  will  become  available  at  a  rate  exceeding 
that  of  the  incoming  data  so  that  the  tributary  data  will  tend  to  lag  (Fig.  6.24).  At  some  stage, 
the  system  will  decide  that  this  lag  has  become  great  enough  to  require  pulse  stuffing.  The 
information  about  the  stuffed-pulse  positions  is  transmitted  through  overhead  bits.  From  the 
overhead  bits,  the  receiver  knows  the  stuffed-pulse  position  and  eliminates  that  pulse. 

Negative  pulse  stuffing  is  a  complement  of  positive  pulse  stuffing.  The  time  slots  in  the 
multiplexed  signal  now  appear  at  a  slightly  slower  rate  than  those  of  the  tributaries,  and  thus 
the  multiplexed  signal  cannot  accommadate  all  the  tributary  pulses.  Information  about  any 
left-out  pulse  and  its  position  is  transmitted  through  overhead  bits.  The  positive/negative  pulse 
stuffing  is  a  combination  of  the  first  two  schemes.  The  nominal  rate  of  the  multiplexer  is  equal 
to  the  nominal  rate  required  to  accommodate  all  incoming  channels.  Hence,  we  may  need 
positive  pulse  stuffing  at  some  times  and  negative  stuffing  at  others.  All  this  information  is 
sent  through  overhead  bits. 

The  C  digits  in  Fig.  6.23  are  used  to  transmit  stuffing  information.  Only  one  stuffed  bit 
per  input  channel  is  allowed  per  frame.  This  is  sufficient  to  accommodate  expected  variations 
in  the  input  signal  rate.  The  bits  Ca  convey  information  about  stuffing  in  channel  A,  bits  Cb 
convey  information  about  stuffing  in  channel  B,  and  so  on.  The  insertion  of  any  stuffed  pulse  in 
any  one  subframe  is  denoted  by  setting  all  the  three  Cs  in  that  line  to  1.  No  stuffing  is  indicated 
by  using  Os  for  all  the  three  Cs.  If  a  bit  has  been  stuffed,  the  stuffed  bit  is  the  first  information 
bit  associated  with  the  immediate  channel  following  the  Fi  bit,  that  is,  the  first  such  bit  in  the 
last  48-bit  sequence  in  that  subframe.  For  the  first  subframe,  the  stuffed  bit  will  immediately 
follow  the  Fi  bit.  For  the  second  subframe,  the  stuffed  bit  will  be  the  second  bit  following  the 
Fj  bit,  and  so  on. 

6.4.3  Plesiochronous  (almost  Synchronous) 

Digital  Hierarchy 

We  now  present  the  digital  hierarchy  developed  by  the  Bell  System  and  currently  included 
in  the  ANSI  standards  for  telecommunications  (Fig.  6.25).  The  North  American  hierarchy  is 
implemented  in  North  America  and  Japan. 

Two  major  classes  of  multiplexers  are  used  in  practice.  The  first  category  is  used  for 
combining  low-data-rate  channels.  It  multiplexes  channels  of  rates  of  up  to  9600  bit/s  into  a 
signal  of  data  rate  of  up  to  64  kbit/s.  The  multiplexed  signal,  called  “digital  signal  level  0” 
(DSO)  in  the  North  American  hierarchy,  is  eventually  transmitted  over  a  voice-grade  channel. 
The  second  class  of  multiplexers  is  at  a  much  higher  bit  rate. 
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Figure  6.25 

North  American 
digital  hierarchy 
(AT&T  system). 


Channels 


139.264  Mbit/s 


There  are  four  orders,  or  levels,  of  multiplexing.  The  first  level  is  the  T1  multiplexer 
or  channel  hank,  consisting  of  24  channels  of  64  kbit/s  each.  The  output  of  this  multiplexer 
is  a  DSI  (digital  level  1)  signal  at  a  rate  of  1.544  Mbit/s.  Four  DS1  signals  are  multiplexed 
by  a  DM  1/2  multiplexer  to  yield  a  DS2  signal  at  a  rate  6.312  Mbit/s.  Seven  DS2  signals  are 
multiplexed  by  a  DM2/3  multiplexer  to  yield  a  DS3  signal  at  a  rate  of  44.736  Mbit/s.  Finally, 
three  DS3  signals  are  multiplexed  by  a  DM3/4NA  multiplexer  to  yield  a  DS4NA  signal  at  a 
rate  139.264  Mbit/s.  There  is  also  a  lower  rate  multiplexing  hierarchy,  known  as  the  digital 
data  system  (DDS),  which  provides  standards  for  multiplexing  digital  signals  with  rates  as 
low  as  —4  kbit/s  into  a  DSO  signal  for  transmission  through  the  network. 

The  inputs  to  a  T1  multiplexer  need  not  be  restricted  only  to  digitized  voice  channels 
alone.  Any  digital  signal  of  64  kbit/s  of  appropriate  format  can  be  transmitted.  The  case  of 
t  C  Ig  er  evels  is  similar.  For  example,  all  the  incoming  channels  of  the  DM  1/2  multiplexer 
need  not  be  DSI  signals  obtained  by  multiplexing  24  channels  of  64  kbit/s  each.  Some  of  them 
maybe  1.544  Mbit/s  digital  signals  of  appropriate  format,  and  so  on. 

ITI I  ?  UT  ?n(i  HJany  ot*ler  Parts  °f  the  world,  another  hierarchy,  recommended  by  the 
chLel^nt^TS'  ,rn  ?  Th'S  hierarch^  based  on  multiplexing  30  telephone 

in  Fill  6  06  St  .  'T'  cha™els)  into  an  E-l  carrier  at  2.048  Mbit/s  (30  channels)  is  shown 
Fig.  6.26  Starting  from  the  base  level  of  E- 1 ,  four  lower  level  lines  form  one  higher  level  line 

data  throuohnut  oH^St  Mtv  /2  Une™'lh  data  throughput  of  8.448  Mbit/s,  an  E-3  line  with 
•m  E  5  line  with  1  t  rh  E"4  hne  lme  with  data  throu?hput  of  1 39.264  Mbit/s,  and 

to  interlace  'with  tme  T  ^  °‘  ^  ^  Because  ditYerei1t  networks  "ust  * 
and SSin X world  p VZ7  systems  (North  American,  Japanese. 

common  interface.  '  emonstrates  the  relative  relationship  and  the  points  of  their 
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6.5  DIFFERENTIAL  PULSE  CODE 
MODULATION  (DPCM) 

PCM  is  not  a  very  efficient  system  because  it  generates  so  many  bits  and  requires  so  much 
bandwidth  to  transmit.  Many  different  ideas  have  been  proposed  to  improve  the  encoding 
efficiency  of  A/D  conversion.  In  general,  these  ideas  exploit  the  characteristics  of  the  source 
signals.  DPCM  is  one  such  scheme. 

In  analog  messages  we  can  make  a  good  guess  about  a  sample  value  from  knowledge  of 
past  sample  values.  In  other  words,  the  sample  values  are  not  independent,  and  generally  there 
is  a  great  deal  of  redundancy  in  the  Nyquist  samples.  Proper  exploitation  of  this  redundancy 
leads  to  encoding  a  signal  with  fewer  bits.  Consider  a  simple  scheme;  instead  ol  transmitting  the 
sample  values,  we  transmit  the  difference  between  the  successive  sample  values.  Thus,  if  m[k  ]  is 
the  Ath  sample,  instead  of  transmitting  m[k],  we  transmit  the  difference  d[k]  =  m[k]-m[k  - 1  ]. 
At  the  receiver,  knowing  d\k]  and  several  previous  sample  value  m[k  -  1  ],  we  can  reconstruct 
m[k].  Thus,  from  knowledge  of  the  difference  d{k],  we  can  reconstruct  m[k]  iteratively  at  the 
receiver.  Now,  the  difference  between  successive  samples  is  generally  much  smaller  than  the 
sample  values.  Thus,  the  peak  amplitude  mp  of  the  transmitted  values  is  reduced  considerably. 
Because  the  quantization  interval  Av  =  tnp/ L,  for  a  given  L  (or  n),  this  reduces  the  quantization 
interval  Av,  thus  reducing  the  quantization  noise,  which  is  given  by  Av/12.  This  means  that 
for  a  given  n  (or  transmission  bandwidth),  we  can  increase  the  SNR.  or  for  a  given  SNR,  we 
can  reduce  n  (or  transmission  bandwidth). 

We  can  improve  upon  this  scheme  by  estimating  (predicting)  the  value  of  the  Ath  sample 
/n[A'j  from  a  knowledge  of  several  previous  sample  values.  It  this  estimate  is  ],  then  we 
transmit  the  difference  (prediction  error)  d[k]  =  m[k]-m[k].M  the  receiver  also,  we  determine 
the  estimate  ih[k]  from  the  previous  sample  values,  and  then  generate  m\k\  by  adding  the 
received  d[k]  to  the  estimate  m[k].  Thus,  we  reconstruct  the  samples  at  the  receiver  iteratively. 
If  our  prediction  is  worth  its  salt,  the  predicted  (estimated)  value  m[k]  will  be  close  to  m[*], 
and  their  difference  (prediction  error)  d[k]  will  be  even  smaller  than  the  difference  between 
the  successive  samples.  Consequently,  this  scheme,  known  as  the  differential  PCM  (DPCM), 
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is  superior  to  the  naive  prediction  described  in  the  preceding  paragraph,  which  is  a  special  case 
of  DPCM,  where  the  estimate  of  a  sample  value  is  taken  as  the  previous  sample  value,  that  is, 
m\k\  =  m\k  -  1], 

Spirits  of  Taylor,  Maclaurin,  and  Wiener 

Before  describing  DPCM,  we  shall  briefly  discuss  the  approach  to  signal  prediction  (estima¬ 
tion).  To  the  uninitiated,  future  prediction  seems  like  mysterious  stuff,  fit  only  for  psychics, 
wizards,  mediums,  and  the  like,  who  can  summon  help  from  the  spirit  world.  Electrical 
engineers  appear  to  be  hopelessly  outclassed  in  this  pursuit.  Not  quite  so!  We  can  also  sum¬ 
mon  the  spirits  of  Taylor,  Maclaurin,  Wiener,  and  the  like  to  help  us.  What  is  more,  unlike 
Shakespeare  s  spirits,  our  spirits  come  when  called.*  Consider,  for  example,  a  signal  m(t ), 
which  has  derivatives  of  all  orders  at  t.  Using  the  Taylor  series  for  this  signal,  we  can  express 
m(t  4-  Ts)  as 


7\2  T 3 

m(*  +  Ts)  =  m{t)  +  Tsm{t)  +  -m(t)  +  H -  (6.42a) 

%  ™(0  +  Tsm(t)  for  small  Ts  (6.42b) 

Equation  (6.42a)  shows  that  from  a  knowledge  of  the  signal  and  its  derivatives  at  instant  /,  we 
can  predict  a  future  signal  value  at  /  +  Ts.  In  fact,  even  if  we  know  just  the  first  derivative, 
we  can  still  predict  this  value  approximately,  as  shown  in  Eq.  (6.42b).  Let  us  denote  the  Ath 
sample  ot  m(t)  by  m[k]y  that  is,  m(kTs)  =  m[k],  and  m(kTs  ±  Ts)  =  m[k  ±  1],  and  so  on. 

Setting  t  =  kTs  in  Eq.  (6.42b),  and  recognizing  that  m(kTs)  %  [m{kTs)  -  m(kTs  -  Ts)]/Ts,  we 
obtain 
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m[k  +  1]  %  m[*]  + 

=  2m[k)  —  m[k  -  1] 

This  shows  that  we  can  find  a  crude  prediction  of  the  (k  +  1  )th  sample  from  the  two  previou: 
samp  ex  e  approximation  in  Eq.  (6.42b)  improves  as  we  add  more  terms  in  the  series  oi 
the  right-hand  side.  To  determine  the  higher  order  derivatives  in  the  series,  we  require  mor. 

ZKS  mThe  RSt'  ThC  'arger  thC  nUmber  °f  past  samP'es  we  use,  the  better  will  be  th< 
prediction.  Thus,  in  general,  we  can  express  the  prediction  formula  as 

m[k]  *aim[k-  1]  +  a2m[k  -  2]  +  . . .  +  aNm[k  -  N]  (6-43 

The  right-hand  side  is  m[k],  the  predicted  value  of  Thus, 

m[*]  =„,«[*-!]  +  a2m[k  _  2]  +  . . .  +  aNm[k  _  N]  (6.44 

m?ou,Du1  PrediCI°r'  Urger  N  would  <«■»  i"  better  prediction  it 

to  sav  that  the  innnt  ic  ri  1  ’  m  ^  •  •  • »  m[k  —  /V],  although  it  is  customao 

'  y  P  ls  m\k\  and  the  output  is  m[k\.  Observe  that  this  equation  reduces  tc 


_  j  1  ^vv.1  in,  5cene  I : 

Glendower:  /  can  call  the  spirits  from  vasty  deep 
Hotspur:  Why.  so  can  I.  or  so  can  any  man; 

But  will  they  come  when  you  do  call  for  them ? 
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Figure  6.27 
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m[k ]  =  m[k  —  1  ]  in  the  case  of  the  first-order  prediction.  It  follows  from  Eq.  (6.42b),  where 
we  retain  only  the  first  term  on  the  right-hand  side.  This  means  that  «|  =  1 ,  and  the  first-order 
predictor  is  a  simple  time  delay. 

We  have  outlined  here  a  very  simple  procedure  for  predictor  design.  In  a  more  sophisticated 
approach,  discussed  in  Sec.  8.5,  where  we  use  the  minimum  mean  squared  error  criterion  for 
best  prediction,  the  prediction  coefficients  aj  in  Eq.  (6.44)  are  determined  from  the  statistical 
correlation  between  various  samples.  The  predictor  described  in  Eq.  (6.44)  is  called  a  linear 
predictor.  It  is  basically  a  transversal  filter  (a  tapped  delay  line),  where  the  tap  gains  are  set 
equal  to  the  prediction  coefficients,  as  shown  in  Fig.  6.27. 

Analysis  of  DPCM 

As  mentioned  earlier,  in  DPCM  we  transmit  not  the  present  sample  m[*],  but  d\k]  (the 
difference  between  m[k]  and  its  predicted  value  m[fc]).  At  the  receiver,  we  generate  m\k ] 
from  the  past  sample  values  to  which  the  received  d[k]  is  added  to  generate  nt[k]. 
There  is,  however,  one  difficulty  associated  with  this  scheme.  At  the  receiver,  instead  of 
the  past  samples  m[k  -  1],  m[k  -  2] . as  well  as  d[k],  we  have  their  quantized  ver¬ 
sions  mq[k  -  I],  mq[k  -  2] . Hence,  we  cannot  determine  m[k\.  We  can  determine  only 

mq[k],  the  estimate  of  the  quantized  sample  mq[k],  in  terms  of  the  quantized  samples 

mq[k  —  1],  mq[k  -2], _ This  will  increase  the  error  in  reconstruction.  In  such  a  case,  a  better 

strategy  is  to  determine  mq[k],  the  estimate  of  mq\k]  (instead  of  m[k]),  at  the  transmitter  also 

from  the  quantized  samples  mq[k  -  1],  mq[k  -  2] . The  difference  d[k]  =  m\k]  -  mq[k ]  is 

now  transmitted  via  PCM.  At  the  receiver,  we  can  generate  mq[k],  and  from  the  received  d[k ], 
we  can  reconstruct  mq[k]. 

Figure  6.28a  shows  a  DPCM  transmitter.  We  shall  soon  show  that  the  predictor  input  is 
mq[k].  Naturally,  its  output  is  mq[k],  the  predicted  value  of  mq[k].  The  difference 

d[k]  =  m[k]  -  mq[k\  (6.45) 

is  quantized  to  yield 

dq[k\=d[k]  +  q[k\  (6.46) 

where  q[k]  is  the  quantization  error.  The  predictor  output  mq[k]  is  fed  back  to  its  input  so  that 
the  predictor  input  mq[k]  is 


mq[k )  =  mq[  k]  +  dq[k] 

=  m[k]  —  d[k]  +  dq[k] 
=  «([£]  +  q[k] 


(6.47) 
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Figure  6.28 

DPCM  system: 

(a)  transmitter; 

(b)  receiver. 
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This  shows  that  irtq[k\  is  a  quantized  version  of  m\k\.  The  predictor  input  is  indeed  mq[k],  as 
assumed.  The  quantized  signal  dq[k]  is  now  transmitted  over  the  channel.  The  receiver  shown 
in  Fig.  6.28b  is  identical  to  the  shaded  portion  of  the  transmitter.  The  inputs  in  both  cases  are 
also  the  same,  namely,  dq[k\.  Therefore,  the  predictor  output  must  be  fhq 1 k ]  (the  same  as  the 
predictor  output  at  the  transmitter).  Hence,  the  receiver  output  (which  is  the  predictor  input)  is 
also  the  same,  viz.,  wi<y|/.']  =  ],  as  found  in  Eq.  (6.47).  This  shows  that  we  are  able  to 

receive  the  desired  signal  m[fc]  plus  the  quantization  noise  q[k].  This  is  the  quantization  noise 
associated  with  the  difference  signal  d[k],  which  is  generally  much  smaller  than  m[k].  The 
received  samples  mq[k]  are  decoded  and  passed  through  a  low-pass  filter  for  D/A  conversion. 


SNR  Improvement 

To  determine  the  improvement  in  DPCM  over  PCM,  let  mr  and  d„  be  the  peak  amplitudes 
of  m(t)  and  d(t).  respectively.  If  we  use  the  same  value  of  L  in  both  cases,  the  quantization 
S'Lp*',  *n  is  reduced  by  the  factor  dp/mp.  Because  the  quantization  noise  power  is 

(Ac)-/ 1  -  the  quantization  noise  in  DPCM  is  reduced  by  the  factor  ( mp/d„ )2,  and  the  SNR 
is  increased  by  the  same  factor.  Moreover,  the  signal  power  is  proportional  to  its  peak  value 
squared  (assuming  other  statistical  properties  invariant).  Therefore,  G„  (SNR  improvement 
due  to  prediction)  is  at  least 


Pm 

Gp  =  Td 

means  ST  ^  P°T  °f  and  *  lively.  In  terms  of  decibel  units,  tl 

cans  that  the  SNR  increases  by  1 0  log  w{Pm/Pd)  dB.  Therefore,  Eq.  (6.4 1 )  applies  to  DPC 

predictor  f  “  ^  by  10  l°^(P>n/Pd)  dB.  In  Example  8.24  a  second-orc 

found  to  be  s  6  HRf  I  ^  “  analyzed-  For  this  case,  the  SNR  improvement 

as  short  term  voiced"  the  SNR  imProvement  may  be  as  high  as  25  dB  in  such  cas 

as  short-term  voiced  speech  spectra  and  in  the  spectra  of  low-activity  images  12  Alternate 

T  f°r  DPCM  C0Uld  **»  that  JS by  3  to  3  hits , 

sample.  Thus,  telephone  systems  using  DPCM  can  often  operate  at  32  or  even  24  kbit/s. 
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6.6  ADAPTIVE  DIFFERENTIAL  PCM  (ADPCM) 

Adaptive  DPCM  (ADPCM)  can  further  improve  the  efficiency  of  DPCM  encoding  by  incor¬ 
porating  an  adaptive  quantizer  at  the  encoder.  Figure  6.29  illustrates  the  basic  configuration 
of  ADPCM.  For  practical  reasons,  the  number  of  quantization  level  L  is  fixed.  When  a  fixed 
quantization  step  Av  is  applied,  either  the  quantization  error  is  too  large  because  Av  is  too 
big  or  the  quantizer  cannot  cover  the  necessary  signal  range  when  Av  is  too  small.  Therefore, 
it  would  be  better  for  the  quantization  step  Av  to  be  adaptive  so  that  Av  is  large  or  small 
depending  on  whether  the  prediction  error  for  quantizing  is  large  or  small. 

It  is  important  to  note  that  the  quantized  prediction  error  dq[k]  can  be  a  good  indicator  of 
the  prediction  error  size.  For  example,  when  the  quantized  prediction  error  samples  vary  close 
to  the  largest  positive  value  (or  the  largest  negative  value),  it  indicates  that  the  prediction  error 
is  large  and  Av  needs  to  grow.  Conversely,  if  the  quantized  samples  oscillate  near  zero,  then 
the  prediction  error  is  small  and  Av  needs  to  decrease.  It  is  important  that  both  the  modulator 
and  the  receiver  have  access  to  the  same  quantized  samples.  Hence,  the  adaptive  quantizer  and 
the  receiver  reconstruction  can  apply  the  same  algorithm  to  adjust  the  Av  identically. 

Compared  with  DPCM,  ADPCM  can  further  compress  the  number  of  bits  needed  for  a 
signal  waveform.  For  example,  it  is  very  common  in  practice  for  an  8-bit  PCM  sequence  to  be 
encoded  into  a  4-bit  ADPCM  sequence  at  the  same  sampling  rate.  This  easily  represents  a  2: 1 
bandwidth  or  storage  reduction  with  virtually  no  loss. 

ADPCM  encoder  has  many  practical  applications.  The  ITU-T  standard  G.726  specifies  an 
ADPCM  speech  coder  and  decoder  (called  codec)  for  speech  signal  samples  at  8  kHz.7  The 
G726  ADPCM  predictor  uses  an  eighth-order  predictor.  For  different  quality  levels,  G.726 
specifies  four  different  ADPCM  rates  at  16,  24,  32,  and  40  kbit/s.  They  correspond  to  four 
different  bit  sizes  for  each  speech  sample  at  2  bits,  3  bits,  4  bits,  and  5  bits,  respectively,  or 
equivalently,  quantization  levels  of  4,  8,  16,  and  32,  respectively. 

The  most  common  ADPCM  speech  encoders  use  32  kbit/s.  In  practice,  there  are  multiple 
variations  of  ADPCM  speech  codec.  In  addition  to  the  ITU-T  G726  specification.7  these 
include  the  OKI  ADPCM  codec,  the  Microsoft  ADPCM  codec  supported  by  WAVE  players, 
and  the  Interactive  Multimedia  Association  (IMA)  ADPCM,  also  known  as  the  DV1  ADPCM. 
The  32  kbit/s  ITU-T  G726  ADPCM  speech  codec  is  widely  used  in  the  DECT  (digital  enhanced 
cordless  telecommunications)  system,  which  itself  is  widely  used  for  residential  and  business 
cordless  phone  communications.  Designed  for  short-range  use  as  an  access  mechanism  to  the 
main  networks,  DECT  offers  cordless  voice,  fax,  data,  and  multimedia  communications.  DECT 
is  now  in  use  in  over  100  countries  worldwide.  Another  major  user  of  the  32  kbit/s  ADPCM 
codec  is  the  Personal  Handy-phone  System  (or  PHS),  also  marketed  as  the  Personal  Access 
System  (PAS)  and  known  as  Xiaolingtong  in  China. 

PHS  is  a  mobile  network  system  similar  to  a  cellular  network,  operating  in  the  1 880  to  1 930 
MHz  frequency  band,  used  mainly  in  Japan,  China,  Taiwan,  and  elsewhere  in  Asia.  Originally 
developed  by  the  NTT  Laboratory  in  Japan  in  1989,  PHS  is  much  simpler  to  implement  and 
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deploy.  Unlike  cellular  networks,  PHS  phones  and  base  stations  are  low-power,  short-range 
facilities.  The  service  is  often  pejoratively  called  the  “poor  man’s  cellular”  because  of  its 
limited  range  and  poor  roaming  ability.  PHS  first  saw  limited  deployment  (NTT-Personal, 
DDI-Pocket,  and  ASTEL)  in  Japan  in  1995  but  has  since  nearly  disappeared.  Surprisingly, 
PHS  has  seen  a  resurgence  in  markets  like  China,  Taiwan,  Vietnam,  Bangladesh,  Nigeria, 
Mali,  Tanzania,  and  Honduras,  where  its  low  cost  of  deployment  and  hardware  costs  offset 
the  system’s  disadvantages.  In  China  alone,  there  was  an  explosive  expansion  of  subscribers, 
reaching  nearly  80  million  in  2006. 


6.7  DELTA  MODULATION 

Sample  correlation  used  in  DPCM  is  further  exploited  in  delta  modulation  (DM)  by  oversam¬ 
pling  (typically  four  times  the  Nyquist  rate)  the  baseband  signal.  This  increases  the  correlation 
between  adjacent  samples,  which  results  in  a  small  prediction  error  that  can  be  encoded  using 
only  one  bit  (L  =  2).  Thus,  DM  is  basically  a  1-bit  DPCM.  that  is,  a  DPCM  that  uses  only  two 
levels  (L  =  2)  for  quantization  of  m[k]  -  mq[k].  In  comparison  to  PCM  (and  DPCM),  it  is  a 
very  simple  and  inexpensive  method  of  A/D  conversion.  A  1  -bit  codeword  in  DM  makes  word 
1 1 anting  unnecessary  at  the  transmitter  and  the  receiver.  This  strategy  allows  us  to  use  fewer 
bits  per  sample  for  encoding  a  baseband  signal. 

In  DM,  we  use  a  first-order  predictor,  which,  as  seen  earlier,  is  just  a  time  delay  of  Ts 
(the  sampling  interval).  Thus,  the  DM  transmitter  (modulator)  and  receiver  (demodulator)  are 
identical  to  those  of  the  DPCM  in  Fig.  6.28,  with  a  time  delay  for  the  predictor,  as  shown  in 
Fig.  6.30,  from  which  we  can  write 

m  </[*!  =mq[k-  1  ]  +  dq[k]  (6.48) 


Hence, 


mq[k  -  1 J  =  mq[k  -  2]  +  dq[k  -  1  ] 


Figure  6.30 
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Substituting  this  equation  into  Eq.  (6.48)  yields 

mq[k ]  =  mq[k  -  2]  +  dq[k\  +  dq[k  -  1  ] 

Proceeding  iteratively  in  this  manner,  and  assuming  zero  initial  condition,  that  is,  mq\ 0]  =  0, 
we  write 


k 

mq\k]  =  Y^dq[m] 

m= 0 


(6.49) 


This  shows  that  the  receiver  (demodulator)  is  just  an  accumulator  (adder).  If  the  output 
dq[k]  is  represented  by  impulses,  then  the  accumulator  (receiver)  may  be  realized  by  an  integra¬ 
tor  because  its  output  is  the  sum  of  the  strengths  of  the  input  impulses  (sum  of  the  areas  under 
the  impulses).  We  may  also  replace  with  an  integrator  the  feedback  portion  of  the  modulator 
(which  is  identical  to  the  demodulator).  The  demodulator  output  is  m9[&],  which  when  passed 
through  a  low-pass  filter  yields  the  desired  signal  reconstructed  from  the  quantized  samples. 

Figure  6.3 1  shows  a  practical  implementation  of  the  delta  modulator  and  demodulator. 
As  discussed  earlier,  the  first-order  predictor  is  replaced  by  a  low-cost  integrator  circuit  (such 
as  an  RC  integrator).  The  modulator  (Fig.  6.31a)  consists  of  a  comparator  and  a  sampler  in 
the  direct  path  and  an  integrator-amplifier  in  the  feedback  path.  Let  us  see  how  this  delta 
modulator  works. 

The  analog  signal  m(t)  is  compared  with  the  feedback  signal  (which  serves  as  a  predicted 
signal)  mq(t).  The  error  signal  d(t)  =m(t)  -  mq(t)  is  applied  to  a  comparator.  If  d(t)  is  positive, 
the  comparator  output  is  a  constant  signal  of  amplitude  E,  and  if  d(t)  is  negative,  the  comparator 
output  is  —E.  Thus,  the  difference  is  a  binary  signal  (L  =  2)  that  is  needed  to  generate  a  1-bit 
DPCM.  The  comparator  output  is  sampled  by  a  sampler  at  a  rate  of  fs  samples  per  second, 
where  fs  is  typically  much  higher  than  the  Nyquist  rate.  The  sampler  thus  produces  a  train 
of  narrow  pulses  dq[k ]  (to  simulate  impulses)  with  a  positive  pulse  when  m(t)  >  mq(t)  and  a 
negative  pulse  when  m(t)<mq(t).  Note  that  each  sample  is  coded  by  a  single  binary  pulse 
( 1  -bit  DPCM),  as  required.  The  pulse  train  dq[k]  is  the  delta-modulated  pulse  train  (Fig.  6.3 1  d). 
The  modulated  signal  dq[k]  is  amplified  and  integrated  in  the  feedback  path  to  generate  mq(t) 
(Fig.  6.31c),  which  tries  to  follow  m{t). 

To  understand  how  this  works,  we  note  that  each  pulse  in  dq[k]  at  the  input  of  the  integrator 
gives  rise  to  a  step  function  (positive  or  negative,  depending  on  the  pulse  polarity)  in  ihq(t).  If, 
for  example,  m(/)  >  mq(t),  a  positive  pulse  is  generated  in  dq[k],  which  gives  rise  to  a  positive 
step  in  mq(t ),  trying  to  equalize  mq(t)  to  m(t)  in  small  steps  at  every  sampling  instant,  as  shown 
in  Fig.  6.3  lc.  It  can  be  seen  that  mq{t)  is  a  kind  of  staircase  approximation  of  m(t).  When  mq(l) 
is  passed  through  a  low-pass  filter,  the  coarseness  of  the  staircase  in  mq(t)  is  eliminated,  and 
we  get  a  smoother  and  better  approximation  to  ffi(t).  The  demodulator  at  the  receiver  consists 
of  an  amplifier-integrator  (identical  to  that  in  the  feedback  path  of  the  modulator)  followed  by 
a  low-pass  filter  (Fig.  6.31b). 

DM  Transmits  the  Derivative  of  m{t) 

In  PCM,  the  analog  signal  samples  are  quantized  in  L  levels,  and  this  information  is  transmitted 
by  n  pulses  per  sample  (n  =  log2  L).  A  little  reflection  shows  that  in  DM,  the  modulated  signal 
carries  information  not  about  the  signal  samples  but  about  the  difference  between  successive 
samples.  If  the  difference  is  positive  or  negative,  a  positive  or  a  negative  pulse  (respectively) 
is  generated  in  the  modulated  signal  dq[k).  Basically,  therefore,  DM  carries  the  information 
about  the  derivative  of  m(t),  hence,  the  name  “delta  modulation.”  This  can  also  be  seen  from 
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that  is,  if  m(t)  is  too  high,  mq(t)  cannot  follow  m(t)y  and  overloading  occurs.  This  is  the 
so-called  slope  overload,  which  gives  rise  to  the  slope  overload  noise.  This  noise  is  one 
of  the  basic  limiting  factors  in  the  performance  of  DM.  We  should  expect  slope  overload 
rather  than  amplitude  overload  in  DM,  because  DM  basically  carries  the  information  about 


m(t).  The  granular  nature  of  the  output  signal  gives  rise  to  the  granular  noise  similar  to  the 


quantization  noise.  The  slope  overload  noise  can  be  reduced  by  increasing  E  (the  step  size). 
This  unfortunately  increases  the  granular  noise.  There  is  an  optimum  value  of  £,  which  yields 
the  best  compromise  giving  the  minimum  overall  noise.  This  optimum  value  of  E  depends  on 
the  sampling  frequency  fs  and  the  nature  of  the  signal.12 

The  slope  overload  occurs  when  mq(t)  cannot  follow  m(t).  During  the  sampling  interval 
Ts,  mq(t)  is  capable  of  changing  by  £,  where  E  is  the  height  of  the  step.  Hence,  the  maximum 
slope  that  mq(t)  can  follow  is  E/Ts,  or  Efs ,  where  fs  is  the  sampling  frequency.  Hence,  no 
overload  occurs  if 


\m(t)\  <  Efs 


Consider  the  case  of  tone  modulation  (meaning  a  sinusoidal  message): 


m(t)  =  A  cos  cot 


The  condition  for  no  overload  is 


l«*(Olmax  =(oA  <  Efs 


(6.50) 


Hence,  the  maximum  amplitude  Amax  of  this  signal  that  can  be  tolerated  without  overload  is 
given  by 


(6.51) 


The  overload  amplitude  of  the  modulating  signal  is  inversely  proportional  to  the  frequency 
co.  For  higher  modulating  frequencies,  the  overload  occurs  for  smaller  amplitudes.  For  voice 
signals,  which  contain  all  frequency  components  up  to  (say)  4  kHz,  calculating  Amax  by  using 
co  =  2n  x  4000  in  Eq.  (6.51)  will  give  an  overly  conservative  value.  It  has  been  shown  by  de 
Jager13  that  Amax  for  voice  signals  can  be  calculated  by  using  wr  ~  2n  x  800  in  Eq.  (6.5 1 ), 


(6.52) 


[Amax]  voice  — 


Thus,  the  maximum  voice  signal  amplitude  Amax  that  can  be  used  without  causing  slope 
overload  in  DM  is  the  same  as  the  maximum  amplitude  of  a  sinusoidal  signal  of  reference 
frequency  fr  ( fr  —  800  Hz)  that  can  be  used  without  causing  slope  overload  in  the  same  system. 

Fortunately,  the  voice  spectrum  (as  well  as  the  television  video  signal)  also  decays  with 
frequency  and  closely  follows  the  overload  characteristics  (curve  c.  Fig.  6.32).  For  this  reason, 
DM  is  well  suited  tor  voice  (and  television)  signals.  Actually,  the  voice  signal  spectrum  (curve 
b)  decreases  as  l/o>  up  to  2000  Hz,  and  beyond  this  frequency,  it  decreases  as  1  /or.  If  we 
had  used  a  double  integration  in  the  feedback  circuit  instead  ot  a  single  integration,  Amax  in 
Eq.  (6.51)  would  be  proportional  to  \/o)2.  Hence,  a  better  match  between  the  voice  spectrum 
and  the  overload  characteristics  is  achieved  by  using  a  single  integration  up  to  2000  Hz  and 
a  double  integration  beyond  2000  Hz.  Such  a  circuit  (the  double  integration)  responds  fast 


350  SAMPLING  AND  ANALOG-TO-DIGITAL  CONVERSION 


Figure  6.32 

Voice  signal 
spectrum. 


Figure  6.33 

(a)  Conventional 
delta  modulator. 

(b)  E-A 
modulator. 

(c)  Simpler  E-A 
modulator. 


but  has  a  tendency  to  instability,  which  can  be  reduced  by  using  some  low-order  prediction 
along  with  double  integration.  A  double  integrator  can  be  built  by  placing  in  cascade  two 
low-pass  RC  integrators  with  time  constants  /?iCj  =  1/2007T  and  R2C2  =  1  /4000;r,  respec¬ 
tively.  This  results  in  single  integration  from  100  to  2000  Hz  and  double  integration  beyond 
2000  Hz. 


Sigma-Delta  Modulation 

While  discussing  the  threshold  of  coding  and  overloading,  we  illustrated  that  the  essence  of 
the  conventional  DM  is  to  encode  and  transmit  the  derivative  of  the  analog  message  signal. 
Hence,  the  receiver  of  DM  requires  an  integrator  as  shown  in  Fig.  6.31  and  also,  equivalently, 
in  Fig.  6.33a.  Since  signal  transmission  inevitably  is  subjected  to  channel  noise,  such  noise 
will  be  integrated  and  will  accumulate  at  the  receiver  output,  which  is  a  highly  undesirable 
phenomenon  that  is  a  major  drawback  of  DM. 

To  overcome  this  critical  drawback  of  DM,  a  small  modification  can  be  made.  First,  we 
can  view  the  overall  DM  system  consisting  of  the  transmitter  and  the  receiver  as  approximately 
distortionless  and  linear.  Thus,  one  of  its  serial  components,  the  receiver  integrator  1  /$,  may 
be  moved  to  the  front  of  the  transmitter  (encoder)  without  affecting  the  overall  modulator  and 
demodulator  response,  as  shown  in  Fig.  6.33b.  Finally,  the  two  integrators  can  be  merged  into 
a  smgle  one  after  the  subtractor,  as  shown  in  Fig.  6.33c.  This  modified  system  is  known  as  the 
sigma-delta  modulation  (E-AM) . 

As  we  found  in  the  study  of  preemphasis  and  deemphasis  filters  in  FM,  because  channel 
noise  and  the  message  signal  do  not  follow  the  same  route,  the  order  of  serial  components 
in  the  overall  modulation-demodulation  system  can  have  different  effects  on  the  SNR.  The 
seemingly  minor  move  of  the  integrator  l/s  in  fact  has  several  major  advantages: 


•  The  channel  noise  no  longer  accumulates  at  the  demodulator. 

i  r  ’TT!  ^°W  *recluency  content  °f  the  message  m(t)  is  preemphasized  by  the  integrato 
/jcv.  This  helps  many  practical  signals  (such  as  speech)  whose  low-frequency  component 
are  more  important.  4  J 

'  s,nooths  uk  sisnai  f°r  encoding  <Rg'  6'33w  Hen“’  °verioa,ii"' 
■  e„i-„7e‘r,or  i,Kreases  ,he  com“o"  “mp"s 

•  The  demodulator  is  simplified. 


6.8  VOC 


6.8  Vocoders  and  Video  Compression  351 


Figure  6.33 
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Adaptive  Delta  Modulation  (ADM) 

The  DM  discussed  so  far  suffers  from  one  serious  disadvantage.  The  dynamic  range  of  ampli¬ 
tudes  is  too  small  because  of  the  threshold  and  overload  effects  discussed  earlier.  To  address 
this  problem,  some  type  of  signal  compression  is  necessary.  In  DM,  a  suitable  method  appears 
to  be  the  adaptation  of  the  step  value  E  according  to  the  level  of  the  input  signal  derivative. 
For  example,  in  Fig.  6.31,  when  the  signal  m(t)  is  falling  rapidly,  slope  overload  occurs.  If  we 
can  increase  the  step  size  during  this  period,  the  overload  could  be  avoided.  On  the  other  hand, 
if  the  slope  of  m(t)  is  small,  a  reduction  of  step  size  will  reduce  the  threshold  level  as  well  as 
the  granular  noise.  The  slope  overload  causes  dq[k]  to  have  several  pulses  of  the  same  polarity 
in  succession.  This  calls  for  increased  step  size.  Similarly,  pulses  in  dq[k]  alternating  contin¬ 
uously  in  polarity  indicates  small-amplitude  variations,  requiring  a  reduction  in  step  size.  In 
ADM  we  detect  such  pulse  patterns  and  automatically  adjust  the  step  size.14  This  results  in  a 
much  larger  dynamic  range  for  DM. 


6-8  VOCODERS  AND  VIDEO  COMPRESSION 

PCM.  DPCM.  ADPCM,  DM.  and  E-AM  are  all  examples  of  what  are  known  as  waveform 
source  encoders.  Basically,  waveform  encoders  do  not  take  into  consideration  how  the  signals 
for  digitization  are  generated.  Hence,  the  amount  of  compression  achievable  by  waveform 
encoders  is  highly  limited  by  the  degree  of  correlation  between  successive  signal  samples. 
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Figure  6.34 

(a)  The  human 
speech 
production 
mechanism. 

(b)  Typical 
pressure 
impulses. 


For  a  low-pass  source  signal  with  finite  bandwidth  B  Hz,  even  if  we  apply  the  minimum 
Nyquist  sampling  rate  2 B  Hz  and  1-bit  encoding,  the  bit  rate  cannot  be  lower  than  2 B  bit/s. 
There  have  been  many  successful  methods  introduced  to  drastically  reduce  the  source  coding 
rates  of  speech  and  video  signals,  very  important  to  our  daily  communication  needs.  Unlike 
waveform  encoders,  the  most  successful  speech  and  video  encoders  are  based  on  the  human 
physiological  models  involved  in  speech  generation  and  in  video  perception.  Here  we  describe 
the  basic  principles  of  the  linear  prediction  voice  coders  (known  as  vocoders)  and  the  video 
compression  method  proposed  by  the  Moving  Picture  Experts  Group  (MPEG). 


6.8.1  Linear  Prediction  Coding  Vocoders 

Voice  Models  and  Model-Based  Vocoders 

Linear  prediction  coding  (LPC)  vocoders  are  model-based  systems.  The  model,  in  turn,  is 
based  on  a  good  understanding  of  the  human  voice  mechanism.  Fig.  6.34a  provides  a  cross- 
sectional  illustration  of  the  human  speech  apparatus.  Briefly,  human  speech  is  produced  by 
the  joint  interaction  ot  lungs,  vocal  cords,  and  the  articulation  tract,  consisting  of  the  mouth 
and  the  nose  cavity.  Based  on  this  physiological  speech  model,  human  voices  can  be  divided 
into  voiced  and  the  unvoiced  sound  categories.  Voiced  sounds  are  those  made  while  the  vocal 
cords  are  vibrating.  Put  a  finger  on  your  Adam’s  apple*  while  speaking,  and  you  can  feel  the 
vibration  the  vocal  cords  when  you  pronounce  all  the  vowels  and  some  consonants,  such  as  g 
as  in  gut,  b  as  in  but ,  and  ;?  as  in  nut.  Unvoiced  sounds  are  made  while  the  vocal  cords  are  not 
vibrating.  Several  consonants  such  as  k.  p,  and  t  are  unvoiced.  Examples  of  unvoiced  sounds 
include  h  in  hut,  c  in  cut ,  and  p  in  put. 

For  the  production  of  voiced  sounds,  the  lungs  expel  air  through  the  epiglottis,  causing 
the  vocal  cords  to  vibrate.  The  vibrating  vocal  cords  interrupt  the  airstream  and  produce  a 
quasi-penodic  pressure  wave  consisting  of  impulses.  The  pressure  wave  impulses  are  com¬ 
monly  called  pitch  impulses,  and  the  frequency  of  the  pressure  signal  is  the  pitch  frequency  or 
fundamental  trequency  as  shown  in  Fig.  6.34b.  This  is  the  part  of  the  voice  signal  that  defines 
the  speech  tone.  Speech  that  is  uttered  in  a  constant  pitch  frequency  sounds  monotonous.  In 

ordinary  cases,  the  pitch  frequency  of  a  speaker  varies  almost  constantly,  often  from  syllable 
to  syllable.  J 
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For  voiced  sound,  the  pitch  impulses  stimulate  the  air  in  the  vocal  tract  (mouth  and  nasal 
cavities).  For  unvoiced  sounds,  the  excitation  comes  directly  from  the  air  flow.  Extensive 
studies 15  17  have  shown  that  for  unvoiced  sounds,  the  excitation  to  the  vocal  tract  is  more  like  a 
broadband  noise.  When  cavities  in  the  vocal  tract  resonate  under  excitation,  they  radiate  a  sound 
wave,  which  is  the  speech  signal.  Both  cavities  form  resonators  with  characteristic  resonance 
frequencies  (formant  frequencies).  Changing  the  shape  (hence  the  resonant  characteristics)  of 
the  mouth  cavity  allows  different  sounds  to  be  pronounced.  Amazingly,  this  (vocal)  articulation 
tract  can  be  approximately  modeled  by  a  simple  linear  digital  filter  with  an  all-pole  transfer 
function 

W(z,  =  ;i!)= 

\  l=\ 

where  g  is  a  gain  factor  and  A(z)  is  known  as  the  prediction  filter,  much  like  the  feedback  filter 
used  in  DPCM  and  ADPCM.  One  can  view  the  function  of  the  vocal  articulation  apparatus  as 
a  spectral  shaping  filter  H (z). 


LPC  Models 

Based  on  this  human  speech  model,  a  voice  encoding  approach  different  from  waveform  coding 
can  be  established.  Instead  of  sending  actual  signal  samples,  the  model-based  vocoders  analyze 
the  voice  signals  segment  by  segment  to  determine  the  best-fitting  speech  model  parameters. 
As  shown  in  Fig.  6.35,  after  speech  analysis,  the  transmitter  sends  the  necessary  speech  model 
parameters  (formants)  for  each  voice  segment  to  the  receiver.  The  receiver  then  uses  the 
parameters  for  the  speech  model  to  set  up  a  voice  synthesizer  to  regenerate  the  respective 
voice  segments.  In  other  words,  what  a  user  hears  at  the  receiver  actually  consists  of  signals 
reproduced  by  an  artificial  voice  synthesizing  machine ! 

In  the  analysis  of  a  sampled  voice  segment  (consisting  of  multiple  samples),  the  pitch 
analysis  will  first  determine  whether  the  speech  is  a  voiced  or  an  unvoiced  piece.  If  the  signal 
is  classified  as  “voiced,”  the  pitch  analyzer  will  estimate  pitch  frequency  (or  equivalently  the 
pitch  period).  In  addition,  the  LPC  analyzer  will  estimate  the  all-pole  filter  coefficients  in  A(z). 
Because  the  linear  prediction  error  indicates  how  well  the  linear  prediction  filter  fits  the  voice 
samples,  the  LPC  analyzer  can  determine  the  optimum  filter  coefficients  by  minimizing  the 

r  J  ,  18  IQ 

mean  square  error  (MSE)  of  the  linear  prediction  error. 

Directly  transmitting  the  linear  prediction  (LP)  filter  parameters  is  unsound  because  the 
filter  is  very  sensitive  to  parameter  errors  due  to  quantization  and  channel  noises.  Worse  yet, 
the  LP  filter  may  even  become  unstable  because  of  small  coefficient  errors.  In  practice,  the 
stability  of  this  all-pole  linear  prediction  (LP)  filter  can  be  ensured  by  utilizing  the  modular 
lattice  filter  structure  through  the  well-known  Levinson-Durbin  algorithm.-0  21  Lattice  filter 
parameters,  known  as  reflection  coefficients  {r*},  are  less  sensitive  to  quantization  errors  and 


figure  6.35 

Ar>alysis  and 
synthesis  of  voice 
Sl9"als  in  an  LPC 
®ncoder  and 
decoder. 
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TABLE  6.1 

Quantization  Bit  Allocation  in  LPC-10  Vocoder 


Pitch  Period 

Voiced/Unvoiced 

Gain  g 

1 0  LP  Filter  Parameters,  bits/coefficient 

n~n  r5~rs  r9  no 

5  bits  4  bits  3  bits  2  bits  Voiced 

6  bits 

1  bit 

5  bits 

5  bits  Not  used  Unvoiced 

noise.  Transmission  is  further  improved  by  sending  their  log-area  ratios  (LAR),  defined  as 


or  by  sending  intermediate  values  from  the  Levinson-Durbin  recursion  known  as  the  partial 
reflection  coefficients  (PARCOR).  Another  practical  approach  is  to  find  the  equivalent  line 
spectral  pairs  (LSP)  as  representation  of  the  LPC  filter  coefficients  for  transmission  over 
channels.  LSP  has  the  advantage  of  low  sensitivity  to  quantization  noise.22,  2^  As  long  as 
the  /7th-order  all-pole  LP  filter  is  stable,  it  can  be  represented  by  p  real-valued,  line  spectral 
frequencies.  In  every  representation,  however,  a  /;th-order  synthesizer  filter  can  be  obtained 
by  the  LPC  decoder  from  the  quantization  of  p  real- valued  coefficients.  In  general  8  to  14  LP 
parameters  are  sufficient  for  vocal  tract  representation. 

We  can  now  use  a  special  LPC  example  to  illustrate  the  code  efficiency  of  such  model- 
based  vocoders.  In  the  so-called  LPC- 1 0  vocoder,*  the  speech  is  sampled  at  8  kHz.  1 80  samples 
(22.5  ms)  form  an  LPC  frame  for  transmission.-4  The  bits  per  speech  frame  are  allocated  to 
quantize  the  pitch  period,  the  voiced/unvoiced  flag,  the  filter  gain,  and  the  10  filter  coefficients, 
according  to  Table  6.1.  Thus,  each  frame  requires  between  32  (unvoiced)  and  53  (voiced)  bits. 
Adding  frame  control  bits  results  an  average  coded  stream  of  54  bits  per  speech  frame,  or  an 
overall  rate  of  2400  bit/s.24  Based  on  subjective  tests,  this  rather  minimal  LPC-10  codec  has 
low  mean  opinion  score  (MOS)  but  does  provide  highly  intelligible  speech  connections.  LPC- 
1 0  is  part  ol  the  FS- 1015,  a  low-rate  secure  telephony  codec  standard  developed  by  the  U.S. 
Department  of  Defense  in  1 984.  A  later  enhancement  to  LPC- 1 0  is  known  as  the  LPC- 1 0(e). 

Compared  with  the  64  kbit/s  PCM  or  the  32  kbit/s  ADPCM  waveform  codec,  LPC  vocoders 
are  much  more  efficient  and  can  achieve  speech  code  rates  below  9.6  kbit/s.  The  2.4  kbit/s 
LPC-10  example  can  provide  speech  digitization  at  a  rate  much  lower  than  even  the  speech 
waveform  sampling  rate  of  8  kHz.  The  loss  of  speech  quality  is  a  natural  trade-off.  To  better 
understand  the  ditterence  between  waveform  vocoders  and  the  model-based  vocoders  such  as 
LPC,  we  can  use  the  analogy  of  a  food  delivery  service.  Imagine  a  family  living  Alaska  that 
wishes  to  order  a  nice  meal  from  a  famous  restaurant  in  New  York  City.  For  practical  reasons, 
the  restaurant  would  have  to  send  prepared  dishes  uncooked  and  frozen-  then  the  family  would 
follow  the  cooking  directions.  The  food  would  probably  taste  fine,  but  the  meal  would  be 
missing  the  finesse  ol  the  original  chef.  This  option  is  like  speech  transmission  via  PCM.  The 
receiver  has  the  basic  ingredients  but  must  tolerate  the  quantization  error  (manifested  by  the 
ack  ol  the  chel  s  cooking  finesse).  To  reduce  transportation  weight,  another  option  is  for  the 
family  to  order  the  critical  ingredients  only.  The  heavier  but  common  ingredients  (such  as  rice 
an  potatoes)  can  be  acquired  locally.  This  approach  is  like  DPCM  or  ADPCM,  in  which  only 
the  unpredictable  part  of  the  voice  is  transmitted.  Finally,  the  family  can  simply  go  online  to 


So-caHed  because  i,  uses  order  p  =  10.  The  idea  is  to  allocate  two  parameters  for  each  possible  formant  frequency 
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order  the  chef’s  recipe.  All  the  ingredients  are  purchased  locally  and  the  cooking  is  also  done 
locally.  The  Alaskan  family  can  satisfy  their  gourmet  craving  without  receiving  a  single  food 
item  form  New  York!  Clearly,  the  last  scenario  captures  the  idea  of  model-based  vocoders. 
LPC  vocoders  essentially  deliver  the  recipe  (i.e.,  the  LPC  parameters)  for  voice  synthesis  at 
the  receiver  end. 

Practical  High-Quality  LP  Vocoders 

The  simple  dual-state  LPC  synthesis  of  Fig.  6.35  describes  no  more  than  the  basic  idea  behind 
model-based  voice  codecs.  The  quality  of  LP  vocoders  has  been  greatly  improved  by  a  number 
of  more  elaborate  codecs  in  practice.  By  adding  a  few  bits,  these  LP-based  vocoders  attempt 
to  improve  the  speech  quality  in  two  ways:  by  encoding  the  residual  prediction  error  and  by 
enhancing  the  excitation  signal. 

The  most  successful  methods  belong  to  the  class  known  as  code-excited  linear  prediction 
(CELP)  vocoders.  CELP  vocoders  use  a  codebook,  a  table  of  typical  LP  error  (or  residue) 
signals,  which  is  set  up  a  priori  by  designers.  At  the  transmitter,  the  analyzer  compares  the 
actual  prediction  residue  to  all  the  entries  in  the  codebook,  chooses  the  entry  that  is  the  closest 
match,  and  just  adds  the  address  (code)  for  that  entry  to  the  bits  for  transmission.  The  synthesizer 
receives  this  code,  retrieves  the  corresponding  residue  from  the  codebook,  and  uses  it  to  modify 
the  synthesizing  output.  For  CELP  to  work  well,  the  codebook  must  be  big  enough,  requiring 
more  transmission  bits.  The  FS- 1016  vocoder  is  an  improvement  over  FS- 1015  and  provides 
good  quality,  natural-sounding  speech  at  4.8  kbit/s.25  More  modem  variants  include  the  RPE- 
LTP  (regular  pulse  excitation,  long-term  prediction)  LPC  codec  used  in  GSM  cellular  systems, 
the  algebraic  CELP  (ACELP),  the  relaxed  CELP  (RCELP),  the  Qualcomm  CELP  (QCELP) 
in  CDMA  cellular  phones,  and  vector-sum  excited  linear  prediction  (VSELP).  Their  data  rates 
range  from  as  low  as  1.2  kbit/s  to  13  kbit/s  (full-rate  GSM).  These  vocoders  form  the  basis  of 
many  modem  cellular  vocoders,  voice  over  Internet  Protocol  (VoIP),  and  other  ITU-T  G-series 
standards. 

Video  Compression 

For  video  and  television  to  go  digital  we  face  a  tremendous  the  challenge.  Because  of  the  high 
video  bandwidth  (approximately  4.2  MHz),  use  of  direct  sampling  and  quantization  leads  to 
an  uncompressed  digital  video  signal  of  roughly  150  Mbit/s.  Thus,  the  modest  compression 
afforded  by  techniques  such  as  ADPCM  and  subband  coding26’ 27  is  insufficient.  The  key  to 
video  compression,  as  it  turns  out,  has  to  do  with  human  visual  perception. 

A  great  deal  of  research  and  development  has  resulted  in  methods  to  drastically  reduce  the 
digital  bandwidth  required  for  video  transmission.  Early  compression  techniques  compressed 
video  signals  to  approximately  45  Mbit/s  (DS3).  For  the  emerging  video  delivery  technolo¬ 
gies  of  HFC,  ADSL,  HDTV,  and  so  on,  however,  much  greater  compression  was  required. 
MPEG  approached  this  problem  and  developed  new  compression  techniques,  which  provide 
network  or  VCR  quality  video  at  much  greater  levels  of  compression.  MPEG  is  a  joint  effort 
of  the  International  Standards  Organizations  (ISO),  the  International  Electrotechnical  Com¬ 
mittee  (IEC),  and  the  American  National  Standards  Institute  (ANSI)  X3L3  Committee.28’ 29 
MPEG  has  a  very  informative  website  that  provides  extensive  information  on  MPEG  and  JPEG 
technologies  and  standards  (http://www.mpeg.org/index.html/).  MPEG  also  has  an  industrial 
forum  promoting  the  organization’s  products  (http://www.m4if.org/). 

The  concept  of  digital  video  compression  is  based  on  the  fact  that,  on  the  average,  a 
relatively  small  number  of  pixels  change  from  frame  to  frame.  Hence,  if  only  the  changes 
are  transmitted,  the  transmission  bandwidth  can  be  reduced  significantly.  Digitizing  allows 
the  noise-free  recovery  of  analog  signals  and  improves  the  picture  quality  at  the  receiver. 
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Compression  reduces  the  bandwidth  required  for  transmission  and  the  amount  of  storage  for  a 
video  program  and,  hence,  expands  channel  capacity.  Without  compression,  a  2-hour  digitized 
NTSC  video  program  would  require  roughly  100  gigabytes  of  storage,  far  exceeding  the 
capacity  of  any  DVD  disc. 

There  are  three  primary  MPEG  standards  in  use: 


MPEG- 1 :  Used  for  VCR-quality  video  and  storage  on  video  CD  (or  VCD)  at  a  data  rate  of  1 .5 
Mbit/s.  These  VCDs  were  quite  popular  throughout  Asia  (except  Japan).  MPEG-1 
decoders  are  available  on  most  computers.  VCD  is  also  a  very  popular  format  for 
karaoke. 

MPEG-2:  Supports  diverse  video  coding  applications  for  transmissions  ranging  in  quality 
from  VCR  to  high-definition  TV  (HDTV),  depending  on  data  rate.  It  offers  50:1 
compression  of  raw  video.  MPEG-2  is  a  highly  popular  format  used  in  DVD,  HDTV, 
terrestrial  digital  video  broadcasting  (DVB-T),  and  digital  video  broadcasting  by 
satellite  (DVB-S). 

MPEG-4  Provides  multimedia  (audio,  visual,  or  audiovisual)  content  streaming  over  differ¬ 
ent  bandwidths  including  internet.  MPEG-4  is  supported  by  Microsoft  Windows 
Media  Player,  Real  Networks,  and  Apple’s  Quicktime  and  iPod.  MPEG-4  recently 
converged  with  an  ITU-T  standard  known  as  H.264,  to  be  discussed  later. 


Figure  6.36 

(a)  Frame  1 . 

(bj  Frame  2. 

(c)  Information 

transferred 

between 

frames  1  and  2. 


The  power  of  video  compression  is  staggering.  By  comparison,  NTSC  broadcast  television  in 
digital  form  requires  45  to  1 20  Mbit/s,  whereas  MPEG-2  requires  1 .5  to  1 5  Mbit/s.  On  the  other 
hand  HDTV  would  require  800  Mbit/s  uncompressed  which,  under  MPEG-2  compression,  will 
transmit  at  19.39  Mbit/s. 

There  are  two  types  of  MPEG  compression,  which  eliminate  redundancies  in  the 
audiovisual  signals  that  are  not  perceptible  by  the  listener  or  the  viewer: 


1.  Video 


Temporal  or  interframe  compression  by  predicting  interframe  motion  and  removing 
interframe  redundancy 

Spatial  or  intraframe  compression,  which  forms  a  block  identifier  for  a  group  of  pixels 
having  the  same  characteristics  (color,  intensity,  etc.)  for  each  frame.  Only  the  block 
identifier  is  transmitted. 


2.  Audio,  which  uses  a  psychoacoustic  model  of  masking  effects. 

The  basis  tor  video  compression  is  to  remove  redundancy  in  the  video  signal  stream.  As  ai 
example  of  intertrame  redundancy,  consider  Fig.  6.36a  and  b.  In  Fig.  6.36a  the  runner  is  ii 
position  A  and  in  Fig.  6.36b  he  is  in  position  B.  Note  that  the  background  (cathedral,  buildings 
and  bridge)  remains  essentially  unchanged  from  frame  to  frame.  Figure  6.36c  represents  th 
nonredundant  information  for  transmission;  that  is,  the  change  between  the  two  frames.  Th 
runner  image  on  the  left  represents  the  blocks  of  frame  1  that  are  replaced  by  background 

in  frame  .  The  runner  image  on  the  right  represents  the  blocks  of  frame  1  that  replace  th 
background  in  trame  2. 

Video  compression  starts  with  an  encoder,  which  converts  the  analog  video  signal  froti 

o«dTammt0ra  dlg'ta‘  t0rmat  °n  a  Pixel'by-pixel  basis.  Each  video  frame  is  divide. 

Z:  III are  analyzed  by  the  encoder  to  determine  which  blocks  must  b 

takes  n  cem  ,  haVC  Slgnificant  chan§es  from  frame  to  frame.  This  proces 

takes  place  in  two  stages: 
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Figure  6.36 

(a)  Frame  1 . 

(b)  Frame  2. 

(c)  Information 
transferred 
between 

frames  1  and  2. 


1.  Motion  estimation  and  compensation.  Here  a  motion  estimator  identifies  the  areas  or  groups 
of  blocks  from  a  preceding  frame  that  match  corresponding  areas  in  the  current  frame  and 
sends  the  magnitude  and  direction  of  the  displacement  to  a  predictor  in  the  decoder.  The 
frame  difference  information  is  called  the  residual. 

2.  Transforming  the  residual  on  a  block-by-block  basis  into  more  compact  form. 

The  encoded  residual  signal  is  transformed  into  a  more  compact  form  by  means  of  a  discrete 
cosine  transform  (DCT)  (see  Sec.  6.5.2  in  Haskel  et  al.,28),  which  uses  a  numerical  value  to 
represent  each  pixel  and  normalizes  that  value  for  more  efficient  transmission.  The  DCT  is  of 
the  form 


N-\N-\ 

F(i,k)  =  COS 

n=0  m= 0 


[" (2m  -f  \)knl 

L  2JV  J  | 

L  2N  J 

where / ( n ,  m)  is  the  value  assigned  to  the  block  in  the  (/*,  m)  position.  The  inverse  transform  is 


N- \N-\ 


f(n,m)  =  - ^  £  F(/,*) 


n= 0  m= 0 


f  (2  n+  1  )jn  ] 

[  (2m  +  l)A'7r"| 

l  2N  H 

l  W  J 

The  DCT  is  typically  multiplied,  for  an  8  x  8  block,  by  the  expression  C(j)C(k)/ 4,  where 


C(x)  = 


for  x  =  0 
otherwise 


Tables  6.2  and  6.3  depict  the  pixel  block  values  before  and  after  the  DCT.  One  can 
notice  from  Table  6.3  that  there  are  relatively  few  meaningful  elements,  that  is,  elements 
with  significant  values  relative  to  the  values  centered  about  the  0, 0  position.  Because  of  this, 
most  of  the  matrix  values  may  be  assumed  to  be  zero,  and,  upon  inverse  transformation,  the 
original  values  are  quite  accurately  reproduced.  This  process  reduces  the  amount  of  data  that 
must  be  transmitted  greatly,  perhaps  by  a  factor  of  8  to  10  on  the  average.  Note  that  the  size 
of  the  transmitted  residual  may  be  that  of  an  individual  block  or,  at  the  other  extreme,  that  of 
the  entire  picture. 

The  transformed  matrix  values  of  a  block  (Table  6.4)  are  normalized  so  that  most  of  the 
values  in  the  block  matrix  are  less  than  1 .  Then  the  resulting  normalized  matrix  is  quantized  to 
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TABLE  6.2 

8x8  Pixel  Block  Residual 


n 


158 

158 

158 

163 

161 

161 

162 

162 

157 

157 

157 

162 

163 

161 

162 

162 

157 

157 

157 

160 

161 

161 

161 

161 

155 

155 

155 

162 

162 

161 

160 

159 

159 

159 

159 

160 

160 

162 

161 

159 

156 

156 

156 

158 

163 

160 

155 

150 

156 

156 

156 

159 

156 

153 

151 

144 

155 

155 

155 

155 

153 

149 

144 

139 

TABLE  6.3 

Transformed  8x8  Pixel  Block  Residual  DCT  Coefficients 


i 

1259.6 

1.0 

-12.1 

5.2 

2.1 

1.7 

-2.7 

-1.3 

22.6 

-17.5 

6.2 

-3.2 

2.9 

-0.1 

-0.4 

-1.2 

-10.9 

9.3 

-1.6 

-1.5 

0.2 

0.9 

-0.6 

0.1 

7.1 

-1.9 

-0.2 

1.5 

-0.9 

-0.1 

0.0 

0.3 

sO 

d 

1 

0.8 

1.5 

-1.6 

-0.1 

0.7 

0.6 

-1.3 

-1.8 

-0.2 

-1.6 

-0.3 

0.8 

1.5 

-1.0 

-1.0 

-1.3 

0.4 

-0.3 

1.5 

-0.5 

-1.7 

1.1 

0.8 

2.6 

1.6 

3.8 

-1.8 

-1.9 

1.2 

0.6 

-0.4 

TABLE  6.4 

Normalized  and  Quantized 
Residual  DCT  Coefficients 


in 


21 

0 

-1 

0 

0 

0 

0 

0 

2 

-1 

0 

0 

0 

0 
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0 

-1 

1 
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0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

F»gure  6.37 

Zi9zag  DCT 
coefficient 
scanning  pattern. 


obtain  Table  6.4.  Normalization  is  accomplished  by  a  dynamic  matrix  of  multiplicative  values, 
which  are  applied  element  by  element  to  the  transformed  matrix.  The  normalized  matrix  ot 
Table  6.4  is  the  block  information  transmitted  to  the  decoder.  The  denormalized  matrix  pictured 
in  Table  6.5  and  the  reconstructed  (inverse-transformed)  residual  in  Table  6.6  are  determined 
by  the  decoder.  The  transformation  proceeds  in  a  zigzag  pattern,  as  illustrated  in  Fig.  6.37. 

MPEG  approaches  the  motion  estimation  and  compensation  to  remove  temporal  (frame-to- 
frame)  redundancy  in  a  unique  way.  MPEG  uses  three  types  of  frame,  the  intraframe  or  I-fran* 
(sometimes  called  the  independently  coded  or  intracoded  frame),  the  predicted  (predictive)  or 


TABLE  6.5 

Denormalized  DCT  Coefficients 
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l'n 

1260 

0 
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0 

0 

0 

0 

0 

23 
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0 
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0 

0 

0 
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10 

0 

0 

0 

0 

0 

0 

k  0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

TABLE  6.6 

Inverse  DCT  Coefficients  Reconstructed  Residual 


n 


158 

158 

158 

163 

161 

161 

162 

162 

157 

157 

157 

162 

163 

161 

162 

162 

157 

157 

157 

160 

161 

161 

161 

161 

155 

155 

155 

162 

162 

161 

160 

159 

159 

159 

159 

160 

160 

162 

161 

159 

156 

156 

156 

158 

163 

160 

155 

150 

156 

156 

156 

159 

156 

153 

151 

144 

155 

155 

155 

155 

153 

149 

144 

139 

F|gure  6.37 

Zi9*>g  DCT 
coefficient 
scanning  pattern. 


P-frame,  and  the  bidirectionally  predictive  frame  or  B-frame.  The  P-frames  are  predicted  from 
the  I-frames.  The  B-frames  are  bidirectionally  predicted  from  either  past  or  future  frames.  An 
I-frame  and  one  or  more  P-frames  and  B-frames  make  up  the  basic  MPEG  processing  pattern, 
called  a  group  of  pictures  (GOP).  Most  of  the  frames  in  an  MPEG  compressed  image  are 
B-frames.  The  I-frame  provides  the  initial  reference  for  the  frame  differences  to  start  the 
MPEG  encoding  process.  Note  that  the  bidirectional  aspect  of  the  procedure  introduces  a 
delay  in  the  transmission  of  the  frames.  This  is  because  the  GOP  is  transmitted  as  a  unit  and, 
hence,  transmission  cannot  start  until  the  GOP  is  complete  (Fig.  6.38).  The  details  of  the 
procedure  are  beyond  the  scope  of  this  text.  There  are  many  easily  accessible  books  that  cover 
this  subject  in  detail.  In  addition,  one  may  find  numerous  references  to  MPEG  compression 
and  HDTV  on  the  internet. 
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Figure  6.38 

MPEG  temporal 
frame  structure. 


Bidirectional  interpolation 


Other  Video  Compression  Standards 

We  should  mention  that  in  addition  to  MPEG,  there  is  a  parallel  attempt  by  ITU-T  to  standardize 
video  coding.  These  standards  apply  similar  concepts  for  video  compression.  Today,  the  well- 
known  ITU-T  video  compression  standards  are  the  H.26x  series,  including  H.261,  H.263,  and 
H.264.  H.261  was  developed  for  transmission  of  video  at  a  rate  of  multiples  of  64  kbit/s  in 
applications  such  as  videophone  and  videoconferencing.  Similar  to  MPEG  compression,  H.261 
uses  motion-compensated  temporal  prediction. 

H.263  was  designed  for  very  low  bit  rate  coding  applications,  such  as  videoconferencing. 
It  uses  block  motion-compensated  DCT  structure  for  encoding.30  Based  on  H.261,  H.263  is 
better  optimized  for  coding  at  low  bit  rates  and  achieves  much  higher  efficiency  than  H.261 
encoding.  Flash  Video,  a  highly  popular  format  for  video  sharing  on  many  web  engines  such 
as  YouTube  and  MySpace,  uses  a  close  variant  of  the  H.263  codec  called  the  Sorenson  Spark 
codec. 

In  fact,  H.264  represents  a  recent  convergence  between  ITU-T  and  MPEG  and  is  a  joint 
effort  of  the  two  groups.  Also  known  as  MPEG-4  Part  10,  H.264  typically  outperforms  MPEG- 
2  by  cutting  the  data  rate  nearly  in  half.  This  versatile  standard  supports  video  applications 
over  multiple  levels  ot  bandwidth  and  quality,  including,  mobile  phone  service  at  50  to  60 
kbit/s,  Internet/standard  definition  video  at  1  to  2  Mbit/s,  and  high-definition  video  at  5  to  8 
Mbit/s.  H.264  is  also  supported  in  many  other  products  and  applications  including  iPod,  direct 
broadcasting  satellite  TV,  some  regional  terrestrial  digital  TV,  Mac  OS  X  (Tiger),  and  Sony’s 
Playstation  Portable. 


A  Note  on  High-Definition  Television  (HDTV) 

Utilizing  MPEG-2  for  video  compression,  high-definition  television  (HDTV)  is  one  of  the 
advanced  television  (ATV)  functions  along  with  525-line  compressed  video  for  direct  broadcast 
satellite  (DBS)  or  cable.  The  concept  of  HDTV  appeared  in  the  late  1970s.  Early  development 
work  was  performed  primarily  in  Japan  based  on  an  analog  system.  In  the  mid- 1 980s  it  became 
apparent  that  the  bandwidth  requirements  of  an  analog  system  would  be  excessive,  and  work 
began  on  a  digital  system  that  could  utilize  the  6  MHz  bandwidth  of  NTSC  television.  In 
the  early  1990s  seven  digital  systems  were  proposed,  but  testing  indicated  that  none  would 
be  highly  satisfactory.  Therefore,  in  1993  the  FCC  suggested  the  formation  of  an  industrial 
Grand  Alliance”  (GA)  to  develop  a  common  HDTV  standard.  In  December  1997,  Standard 
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A/53  tor  broadcast  transmission,  proposed  by  the  Advanced  Television  Systems  Committee 
(ATSC),  was  finalized  by  the  FCC  in  the  United  States. 

The  GA  HDTV  standard  is  based  on  a  16:9  aspect  ratio  (motion  picture  aspect  ratio) 
rather  than  the  4:3  aspect  ratio  of  NTSC  television.  HDTV  uses  MPEG-2  compression  at 
19.39  Mbit/s  and  a  digital  modulation  format  called  8-VSB  (vestigial  sideband),  which  uses 
an  eight-amplitude-level  symbol  to  represent  3  bits  of  information.  Transmission  is  in  207- 
byte  blocks,  which  include  20  parity  bytes  for  Reed-Solomon  forward  error  correction.  The 
remaining  187-byte  packet  format  is  a  subset  of  the  MPEG-2  protocol  and  includes  headers 
for  timing,  switching,  and  other  transmission  control. 

The  Advanced  Television  Systems  Group,  the  successor  to  the  Grand  Alliance,  has  been 
developing  standards  and  recommended  practices  for  HDTV.  These  are  found,  along  with  a 
great  deal  of  other  information,  on  their  website:  http://www.atsc.org/. 


6.9  MATLAB  EXERCISES 

In  the  MATLAB  exercises  of  this  section,  we  provide  examples  of  signal  sampling,  signal 
reconstruction  from  samples,  uniform  quantization,  pulse-coded  modulation  (PCM),  and  delta 
modulation  (DM). 

Sampling  and  Reconstruction  of  Lowpass  Signals 

In  the  sampling  example,  we  first  construct  a  signal  g(t)  with  two  sinusoidal  components  of 
1 -second  duration;  their  frequencies  are  1  and  3  Hz.  Note,  however,  that  when  the  signal 
duration  is  infinite,  the  bandwidth  of  g(t)  would  be  3  Hz.  However,  the  finite  duration  of  the 
signal  implies  that  the  actual  signal  is  not  band-limited,  although  most  of  the  signal  content 
stays  within  a  bandwidth  of  5  Hz.  For  this  reason,  we  select  a  sampling  frequency  of  50 
Hz,  much  higher  than  the  minimum  Nyquist  frequency  of  6  Hz.  The  MATLAB  program. 
Exsample. m,  implements  sampling  and  signal  reconstruction.  Figure  6.39  illustrates  the 
original  signal,  its  uniform  samples  at  the  50  Hz  sampling  rate,  and  the  frequency  response  of 
the  sampled  signal.  In  accordance  with  our  analysis  of  Section  6.1,  the  spectrum  of  the  sampled 
signal  gT(t)  consists  of  the  original  signal  spectrum  periodically  repeated  every  50  Hz. 


%  ( Exsample. m) 

%  Example  of  sampling,  quantization,  and  zero-order  hold 
clear; elf ; 

td=0 . 002  ,*  %original  sampling  rate  500  Hz 

t= [0 : td: 1 . ] ;  %time  interval  of  1  second 
xsig=sin (2*pi*t ) -sin(6*pi*t) ;  %  lHz+3Hz  sinusoids 

Lsig=length (xsig) ; 

t s  =  0 . 0 2 ;  %new  sampling  rate  =  50Hz. 

Nfactor=ts/td; 

%  send  the  signal  through  a  16-level  uniform  quantizer 
[s_out ,  sq_out , sqh_out , Delta, SQNR] =sampandquant (xsig, 16, td, ts) ; 

%  receive  3  signals: 

%  1.  sampled  signal  s_out 

%  2.  sampled  and  quantized  signal  sq_out 

%  3.  sampled,  quantized,  and  zero-order  hold  signal  sqh_out 

% 
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Figure  6.39 

The  relation¬ 
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uniformly 
sampled  signal 
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%  calculate  the  Fourier  transforms 
Lf  ft=2/sceil  (log2  (Lsig)  +1)  ; 

Fmax=l/ (2*td) ; 

Faxis=l inspace ( -Fmax, Fmax,  Lf ft ) ; 

Xsig=f f tshif t (fft (xsig,Lfft) ) ; 

S_out=f ft shift (fft (s_out,Lfft) ) ; 

%  Examples  of  sampling  and  reconstruction  using 
%  a)  ideal  impulse  train  through  LPF 

%  b)  flat  top  pulse  reconstruction  through  LPF 

%  plot  the  original  signal  and  the  sample  signals  in  time 
%  and  frequency  domain 
figure (1) ; 

subplot (311) ;  sfigla=plot (t,xsig, 'k' ) ; 
hold  on;  sf iglb=plot ( t , s_out ( 1 : Lsig ) , ' b ' ) ;  hold  off¬ 
set  (sfigla,  'Linewidth' ,2) ;  set (sfiglb, 'Linewidth' ,2.)  ; 
xlabel(/time  (sec)'); 

title ( ' Signal  {\it  g}({\it  t})  and  its  uniform  samples'); 
subplot (312) ;  sf iglc=plot (Faxis, abs (Xsig) ) • 
xlabel (' frequency  (Hz) ' ) ; 
axis (  [-150  150  0  300] ) 

set (sf iglc, ' Linewidth' , 1) ;  title (' Spectrum  of  {\it  g}({\it  t}) ' > 
subplot (313) ;  sfigld=plot (Faxis, abs (S_out) ) ; 
xlabel (' frequency  (Hz) ' ) ; 
axis ([-150  150  0  300/Nfactor] ) 
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set (sfiglc, ' Linewidth' , 1) ;  title (' Spectrum  of  {\it  g}  T({\it  t})'); 
%  calculate  the  reconstructed  signal  from  ideal  sampling  and 
%  ideal  LPF 

%  Maximum  LPF  bandwidth  equals  to  BW=f loor ( (Lf f t/Nfactor) /2) ; 
BW=1°;  %Bandwidth  is  no  larger  than  10Hz. 

H_lpf  =  zeros (1#  Lf ft) ;H_lpf (Lf f t/2-BW:Lf ft/2+BW-l) =1;  %ideal  LPF 
S_recv=Nfactor*S_out . *H_lpf ;  %  ideal  filtering 

s_recv=real (if ft (f f tshif t (S_recv) ) ) ;  %  reconstructed  f-domain 

s_recv=s_recv (1 :Lsig) ;  %  reconstructed  t-domain 

%  plot  the  ideally  reconstructed  signal  in  time 
%  and  frequency  domain 
figure (2) 

subplot  (211) ;  sf ig2a=plot (Faxis, abs (S_recv)  ) ; 
xlabel (' frequency  (Hz)'); 
axis (  [-150  150  0  300] ) ; 

title (' Spectrum  of  ideal  filtering  (reconstruction)'); 
subplot  (212) ;  sf ig2b=plot (t ,xsig, ' k- . ' , t , s_recv (1 : Lsig) , ' b' )  ; 
legend ( ' original  signal ' , ' reconstructed  signal ' ) ; 
xlabel(#time  (sec)'); 

title (' original  signal  versus  ideally  reconstructed  signal'); 
set (sfig2b# 'Linewidth' ,2) ; 

%  non-ideal  reconstruction 
ZOH=ones (l,Nf actor) ; 

s_ni=kron (downsample (s_out,Nf act or) ,  ZOH) ; 

S_ni=f ft shift (f ft (s_ni, Lfft) ) ; 

S_recv2=S_ni . *H_lpf ;  %  ideal  filtering 

s_recv2=real (if f t (f ftshift (S_recv2) ) ) ;  %  reconstructed  f-domain 

s_recv2=s_recv2 (1 :Lsig)  ;  %  reconstructed  t-domain 

%  plot  the  ideally  reconstructed  signal  in  time 
%  and  frequency  domain 
figure (3) 

subplot  (211) ;  sf ig3a=plot (t,xsig, 'b' , t , s_ni (1 : Lsig) , 'b' ) ; 
xlabel('time  (sec)'); 

title (' original  signal  versus  flat-top  reconstruction'); 
subplot (212 ) ;  sfig3b=plot (t^xsig, 'b' , t, s_recv2 (1 :Lsig)  ,  'b--') ; 
legend ('original  signal', 'LPF  reconstruction'); 
xlabel ( ' time  (sec) ' ) ; 

set (sfig3a, 'Linewidth' ,2) ;  set (sfig3b, 'Linewidth' ,2) ; 
title ('original  and  flat- top  reconstruction  after  LPF'); 


To  construct  the  original  signal  g(t)  from  the  impulse  sampling  train  gT(t ),  we  applied  an 
ideal  low-pass  filter  with  bandwidth  10  Hz  in  the  frequency  domain.  This  corresponds  to  the 
interpolation  using  the  ideal  sine  function  as  shown  in  Sec.  6.1.1.  The  resulting  spectrum,  as 
shown  in  Fig.  6.40,  is  nearly  identical  to  the  original  message  spectrum  of  g(t).  Moreover,  the 
time  domain  signal  waveforms  are  also  compared  in  Fig.  6.40  and  show  near  perfect  match. 

In  our  last  exercise  in  sampling  and  reconstruction,  given  in  the  same  program,  we  use 
a  simple  rectangular  pulse  of  width  Ts  (sampling  period)  to  reconstruct  the  original  signal 
from  the  samples  (Fig.  6.41).  A  low-pass  filter  is  applied  on  the  rectangular  reconstruction 
and  also  shown  in  Fig.  6.41 .  It  is  clear  from  comparison  to  the  original  source  signal  that  the 
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Figure  6.40 

Reconstructed 
signal  spectrum 
and  waveform 
from  applying 
the  ideal  impulse 
sampling  and 
ideal  low-pass 
filter  reconstruc¬ 
tion. 


Spectrum  of  ideal  filtering  (reconstruction) 


Figure  6.41 

Reconstructed 
signal  spectrum 
and  waveform 
from  applying 
the  simple 
rectangular 
reconstruction 
pulse  (Fig.  6.6) 
followed  by  LPF 
without 
equalization. 


recovered  signal  is  still  very  close  to  the  original  signal  g(t).  This  is  because  we  have  chosen 
a  high  sampling  rate  such  that  7),  =  Ts  is  so  small  that  the  approximation  of  Eq.  (6.17)  holds. 
Certainly,  based  on  our  analysis,  by  applying  the  low-pass  equalization  filter  of  Eq.  (6.16),  the 
reconstruction  error  can  be  greatly  reduced. 

PCM  Illustration 

The  uniform  quantization  of  an  analog  signal  using  L  quantization  levels  can  be  implemented 
by  the  MATLAB  function  uniquan  .m. 


%  (uniquan. m) 

function  [q_out ,  Delta ,  SQNR]  =uniquan  ( sig__in,  L) 

%  Usage 

%  t<3__out ,  Delta,  SQNR]  =uniquan  (sig  in,L) 

%  L  -  number  of  uniform  quantization  levels 
%  sig_in  -  input  signal  vector 

%  Function  outputs: 

%  q_out  -  quantized  output 

%  Delta  -  quantization  interval 

SQNR  -  actual  signal  to  quantization  noise  ratio 
sig_pmax-max(sig_in) ;  %  finding  the  positive  peak 

sig_nmax=min(sig_in) ;  %  finding  the  negative  peak 

Delta= (sig_pmax-sig_nmax) /L;  %  quantization  interval 
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Figure  6.41 

Reconstructed 
signal  spectrum 
and  waveform 
from  applying 
the  simple 
rectangular 
reconstruction 


without 

equalization. 


Time,  second 


q_level=sig_nmax+Delta/2 : Delta : sig_pmax-Delta/2 ;  %  define  Q-levels 

L_sig= length (sig_in) ;  %  find  signal  length 

sigp= (sig_in-sig_nmax) /Delta+1/2 ;  %  convert  into  1/2  to  L+l/2  range 

qindex=round (sigp) ;  %  round  to  1,  2,  ...  L  levels 

qindex=min(qindex,L) ;  %  eleminate  L+l  as  a  rare  possibility 

q_out=q_level (qindex) ;  %  use  index  vector  to  generate  output 

SQNR=20*logl0  (norm  (sig_in)  /norm  (sig_in-q__out )  )  ;  %actual  SQNR  value 
end 


The  function  sampandquant  .m  executes  both  sampling  and  uniform  quantization 
simultaneously.  The  sampling  period  ts  is  needed,  along  with  the  number  L  of  quantization 
levels,  to  generate  the  sampled  output  s_out,  the  sampled  and  quantized  output  sq_out, 
and  the  signal  after  sampling,  quantizing,  and  zero-order-hold  sqh_out. 


%  (sampandquant .m) 

function  [s_out,  sq_out,  sqh_out,  Delta,  SQNR]  =sampandquant  (sig_in,  L,  td,  ts) 
%  Usage 

%  [ s_out  #  sq_out ,  sqh_out , Delta ,  SQNR]  =  sampandquant  (sig_in#  L,  td,  fs) 

%  l  number  of  uniform  quantization  levels 

%  sig  in  -  input  signal  vector 

%  td  -  original  signal  sampling  period  of  sig_in 
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%  ts  new  sampling  period 

%  NOTE:  td*fs  must  be  a  positive  integer; 

%  Function  outputs: 

%  s_out  -  sampled  output 

%  sq_out  -  sample-and-quantized  output 

%  sqh_out-  sample,  quantize,  and  hold  output 

%  Delta  -  quantization  interval 

%  SQNR  -  actual  signal  to  quantization  noise  ratio 

if  (rem(ts/td, 1) ==0) 
nf ac=round (ts/td) ; 
p_zoh=ones ( 1 , nf ac) ; 
s_out=downsample (sig_in, nfac)  ; 

[sq_out, Delta, SQNR] =uniquan ( s_out , L ) ; 
s_out=upsample (s_out , nf ac) ; 
sqh_out=kron (sq_out , p_zoh) ; 
sq_out=upsample (sq_out , nf ac) ; 
else 

warning ( ' Error !  ts/td  is  not  an  integer!'); 
s _ out  =  []  ; sq_out=  []  ;sqh__out=  []  ;Delta=  []  ;SQNR=  []  ; 

end 

end 


The  MATLAB  program  ExPCM.m  provides  a  numerical  example  that  uses  these  two 
MATLAB  functions  to  generate  PCM  signals. 


%  (ExPCM.m) 

%  Example  of  sampling,  quantization,  and  zero-order  hold 
clear; cl f ; 

td=0 . 002  ,*  %original  sampling  rate  500  Hz 

t= [0 : td: 1 . ] ;  %time  interval  of  1  second 
xsig=sin (2*pi*t) -sin(6*pi*t) ;  %  lHz+3Hz  sinusoids 

Lsig=length (xsig) ; 

Lf ft=2~ceil  (log2 (Lsig) +1) ; 

Xsig=f f tshif t (fft (xsig,Lfft) ) ; 

Fmax=l/ ( 2  *  td) ; 

Faxis=linspace ( -Fmax, Fmax, Lf f t) ; 
ts=0.02;  %new  sampling  rate  =  50Hz. 

Nfact=ts/td; 

%  send  the  signal  through  a  16-level  uniform  quantizer 

[s_out , sq_out , sqh_outl( Delta, SQNR] =sampandquant (xsig, 16 , td,  ts)  ; 

%  obtained  the  PCM  signal  which  is 

l  "  samPled'  quantized,  and  zero-order  hold  signal  sqh_out 

!.  Pl<*  thS  ori9inal  signal  and  the  PCM  signal  in  time  domain 
figure (1) ; 

SU^1°t(211);Sfi9l=pl0t(t'xsi9''k'-t-sqh  out  1  ( 1 :  Ls  ig )  ,  '  b '  )  ; 

set(sfigl, 'Linewidth' ,2) ;  ~ 

title  1 'Signal  (lit  g(({\it  t}>  and  it.  16  level  PCM  signal') 
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xlabel ( 7  time  (sec . ) ' ) ; 

%  send  the  signal  through  a  16 -level  uniform  quantizer 
[s_out  #  sq_out ,  sqh_out2/  Delta,  SQNR]  =sampandquant  (xsig,  4  ,  td,  ts)  ; 

%  obtained  the  PCM  signal  which  is 

%  -  sampled,  quantized,  and  zero-order  hold  signal  sqh_out 

%  plot  the  original  signal  and  the  PCM  signal  in  time  domain 
subplot (212) ; sf ig2=plot (t,xsig, 'k' , t, sqh_out2 (1 sLsig) , 'b' ) ; 
set (sf ig2 , ' Linewidth' , 2) ; 

title ( ' Signal  {\it  g}({\it  t})  and  its  4  level  PCM  signal') 
xlabel ( # time  (sec . ) ' ) ; 

Lf ft=2~ceil (log2 (Lsig) +1) ; 

Fmax=l/ (2*td) ; 

Faxis=l inspace ( -Fmax, Fmax, Lf f t) ; 

SQHl  =  f ft shift (f ft (sqh_outl, Lf f t)  )  ; 

SQH2=f ft shift (f f t (sqh_out2 , Lf f t ) ) ; 

%  Now  use  LPF  to  filter  the  two  PCM  signals 
BW=10 ;  %Bandwidth  is  no  larger  than  10Hz. 

H_lpf= zeros (1, Lf f t) ;H_lpf (Lf f t/2-BW : Lf f t/2+BW-l ) =1 ;  %ideal  LPF 
Sl_recv=SQHl . *H_lpf ;  %  ideal  filtering 

s_recvl  =  real (if f t (f ftshif t (Sl_recv) )  )  ;  %  reconstructed  f -domain 

s_recvl=s_recvl (1 :Lsig) ;  %  reconstructed  t-domain 

S2_recv=SQH2 . *H_lpf ;  %  ideal  filtering 

s_recv2=real (if ft (f ftshif t (S2_recv) )) ;  %  reconstructed  f-domain 

s_recv2=s_recv2 (1 :Lsig) ;  %  reconstructed  t-domain 

%  Plot  the  filtered  signals  against  the  original  signal 
figure  (2) 

subplot (211) ; sf ig3=plot (t,xsig, 'b- ' , t, s_recvl, 'b- . ' ) ; 
legend ( ' original ' , ' recovered' ) 
set (sf ig3 , ' Linewidth'  ,  2) ; 

title (' Signal  {\it  g}({\it  t})  and  filtered  16-level  PCM  signal') 
xlabel ( ' time  (sec . ) ' ) ; 

subplot (212 ) ; sf ig4=plot (t ,xsig, 'b-' , t , s_recv2 ( 1 : Lsig) ,'b-.') ; 

legend ( ' original' , ' recovered' ) 
set (sf ig4 , ' Linewidth'  ,  2)  ; 

title (' Signal  {\it  g } ( { \ it  t})  and  filtered  4-level  PCM  signal') 
xlabel ('time  (sec.)'); 


In  the  first  example,  we  maintain  the  50  Hz  sampling  frequency  and  utilize  L  =  16 
uniform  quantization  levels.  The  resulting  PCM  signal  is  shown  in  Fig.  6.42.  This  PCM 
signal  can  be  low-pass-filtered  at  the  receiver  and  compared  against  the  original  message 
signal,  as  shown  in  Fig.  6.43.  The  recovered  signal  is  seen  to  be  very  close  to  the  original 
signal  g(t). 

To  illustrate  the  effect  of  quantization,  we  next  apply  L  =  4  PCM  quantization  levels.  The 
resulting  PCM  signal  is  again  shown  in  Fig.  6.42.  The  corresponding  signal  recovery  is  given 
in  Fig.  6.43.  It  is  very  clear  that  smaller  number  of  quantization  levels  (L  =  4)  leads  to  much 
larger  approximation  error. 
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Figure  6.42 

Original  signal 
and  the  PCM 
signals  with 
different  numbers 
of  quantization 
levels. 


Signal  g(t)  and  its  16-level  PCM  signal 


Figure  6.43 

Comparison 
between  the 
original  signal 
and  the  PCM 
signals  after 
low-pass  filtering 
to  recover  the 
original 
message. 


Delta  Modulation 

Instead  of  applying  PCM.  we  illustrate  the  practical  effect  of  stepsize  selection  A  in  the  design 
ot  DM  encoder.  The  basic  function  to  implement  DM  is  given  in  deltamod .  m. 


%  (deltamod. m) 

function  s_DMout=  deltamod (sig_in, Delta, td, ts) 

%  Usage 

%  s_DMout  =  deltamod (xsig, Delta, td, ts) ) 

%  Delta  -  DM  stepsize 

%  sig_in  -  input  signal  vector 

td  -  original  signal  sampling  period  of 
%  ts  new  sampling  period 

%  NOTE:  td*fs  must  be  a  positive  integer; 

%  Function  outputs: 

%  s_DMout  -  DM  sampled  output 

if  (rem ( ts/td, 1) ==0) 
nfac=round (ts/td) ; 
p_zoh=ones (l,nfac) ; 


s_down= down sample (sig_in/ nf ac) ; 
Num_it=length (s  down); 


sig_in 
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Figure  6.43 

Comparison 
between  the 
original  signal 
and  the  PCM 
signals  after 
low-pass  filtering 
to  recover  the 
original 
message. 


Signal  g(t)  and  filtered  4-PCM  signal 


s_DMout (1) = -Delta/2 ; 
for  k=2:Num_it 

xvar=s_DMout (k-1)  ; 

s_DMout (k) =xvar+Delta*sign (s_down (k-1) -xvar) ; 

end 

s_DMout=kron (s_DMout , p_zoh) ; 
else 

warning (' Error !  ts/td  is  not  an  integer!'); 
S_DMout  =  []  ; 

end 

end 


To  generate  DM  signals  with  different  step  sizes,  we  apply  the  same  signal  g{t)  as  used 
in  the  PCM  example.  The  MATLAB  program  ExDM .  m  applies  three  step  sizes:  A|  =  0.2, 
A2  =  2 A  i ,  and  A3  =  4 A 1 . 


%  (ExDM.m) 

%  Example  of  sampling,  quantization,  and  zero-order  hold 
clear ;clf ; 
td=0 . 002 ; 


%original  sampling  rate  500  Hz 
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t= [0 : td : 1 . ] ;  %time  interval  of  1  second 
xsig=sin (2*pi*t ) -sin(6*pi*t) ;  %  lHz+3Hz  sinusoids 

Lsig=length (xsig) ; 

ts=0 . 02 ;  %new  sampling  rate  =  50Hz. 

Nf act=ts/td ; 

%  send  the  signal  through  a  16 -level  uniform  quantizer 
Deltal=0.2;  %  First  select  a  small  Delta=0.2  in  DM 

s_DMoutl=deltamod (xsig, Deltal , td, ts) ; 

%  obtained  the  DM  signal 

%  plot  the  original  signal  and  the  DM  signal  in  time  domain 
figure (1) ; 

subplot (311) ; sf igl=plot (t,xsig, 'k'  ,  t, s_DMoutl (1 :Lsig) ,  ' b' )  ; 
set (sf igl , ' Linewidth' , 2) ; 

title (' Signal  {\it  g}({\it  t})  and  DM  signal') 
xlabel('time  (sec.)');  axis([0  1  -2.2  2.2]); 

% 

%  Apply  DM  again  by  doubling  the  Delta 
Delta2=2*Deltal;  % 

s_DMout2=deltamod (xsig, Del ta2  ,  td,  ts)  ; 

%  obtained  the  DM  signal 

%  plot  the  original  signal  and  the  DM  signal  in  time  domain 
subplot (312 ) ; sf ig2=plot (t , xsig, 'k' , t , s_DMout2 ( 1 : Lsig)  , 'b' )  ; 
set (sf ig2 , ' Linewidth' , 2) ; 

title (' Signal  { \ i t  g}({\it  t})  and  DM  signal  with  doubled  stepsize') 
xlabel ( ' time  (sec.)');  axis([0  1  -2.2  2.2]); 

% 

Delta3=2*Delta2 ;  %  Double  the  DM  Delta  again. 

s_DMout  3  =de 1 1 amod (xsig, Delta3 , td, ts ) ; 

-s  plot  the  original  signal  and  the  DM  signal  in  time  domain 
subplot (313) ; sf ig3=plot (t,xsig, 'k' , t , s_DMout3 ( 1 : Lsig) , 'b'  )  ; 
set (sf ig3 , ' Linewidth' , 2) ; 

title (  Signal  {\it  g } ( { \ it  t})  and  DM  signal  with  quadrupled 
stepsize' ) 

xlabel ('time  (sec.)');  axis([0  1  -2.2  2.2]); 


To  illustrate  the  eltect  of  DM,  the  resulting  signals  from  the  DM  encoder  are  shown  in 
Fig.  6.44.  This  example  clearly  shows  that  when  the  step  size  is  too  small  (A|),  there  is  a 
severe  overloading  effect  as  the  original  signal  varies  so  fast  that  the  small  step  size  is  unable 
to  catch  up.  Doubling  the  DM  step  size  clearly  solves  the  overloading  problem  in  this  example- 
However,  quadrupling  the  step  size  (A3)  would  lead  to  unnecessarily  large  quantization  error. 
This  example  thus  confirms  our  earlier  analysis  that  a  careful  selection  of  the  DM  step  size  is 
critical. 


Figure  6.44 

Examples  of 
delta  modulation 
output  with  three 
different  step 
sizes:  (a)  small 
step  size  leads  to 
overloading; 

(b)  reasonable 
step  size; 

(c)  large  step 
size  causes  large 
quantization 
errors. 
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PROBLEMS 


6.1-1  Figure  P6.I-1  shows  Fourier  spectra  of  signals  g|(r)  and  g2(t).  Determine  the  Nyquist  interval 
and  the  sampling  rate  for  signals  g,  (/),  g2(t),  gj(t),  g">(,),* and  g,  (t)g2(t). 

Hint:  Use  the  frequency  convolution  and  the  width  property  of  the  convolution. 


Figure  P.6.1-1 


Fi9ure  P.6.1-5 
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Show  that  the  signal  g{t)  can  be  recovered  by  passing  g(t)  through  an  ideal  low-pass  filter  of 
bandwidth  B  Hz  and  a  gain  of  4. 


6.1-4  A  low-pass  signal  g(t)  sampled  at  rate  of fs  >  2 B  needs  reconstruction.  The  sampling  interval  is 

Ts  =  1  //,. 

(a)  If  the  reconstruction  pulse  used  is 


specify  the  equalizer  filter  £(/)  to  recover  g(t). 

(b)  If  the  reconstruction  pulse  used  is 


specify  the  equalizer  filter  E(f)  to  recover  g(t). 

(c)  If  the  reconstruction  pulse  used  is 


27Tt_ 

17 


p(t)  =  sin 


specify  the  equalizer  filter  E(f)  to  recover  g(/). 
(d  )  If  the  reconstruction  pulse  used  is 


specify  the  equalizer  filter  £(/)  to  recover  g(t). 

(e)  Sketch  and  compare  the  recovered  signals  of  parts  (a)  to  (d)  in  the  time  domain. 

6.1-5  Signals  g \ (/)  =  104  n(  104/)  and  g2(t)  =  5(0  are  applied  at  the  inputs  of  ideal  low-pass  filters 
H |  (/)  =  n(//20,000)  and  H2(f)  =  ri(//IO.OOO)  (Fig.  P6.1-5).  The  outputs  vj  (/)  and  .v2(r) 
of  these  filters  are  multiplied  to  obtain  the  signal  >’(0  =  yi(/).V2(/).  Find  the  Nyquist  rate  of 
y  1  (t),y2(t),  and  y(t).  Use  the  convolution  property  and  the  width  property  of  convolution  to 
determine  the  bandwidth  of  yj  (t)v2(t ).  See  also  Prob.  6.1-1. 


figure  P.6.1-5 


6.1-6  A  zero-order-hold  circuit  (Fig.  P6. 1-6)  is  often  used  to  reconstruct  a  signal  g(t)  from  its  samples. 

(a)  Find  the  unit  impulse  response  of  this  circuit. 

(b)  Find  the  transfer  function  H(f)  and  sketch  \H (/)|. 

(c)  Show  that  when  a  sampled  signal  g(t)  is  applied  at  the  input  of  this  circuit,  the  output  is  a 
staircase  approximation  of  g(t).  The  sampling  interval  is  Ts. 
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Figure  P.6. 1*6 


6.1- 7  (a)  A  first-order-hold  circuit  can  also  be  used  to  reconstruct  a  signal  g(t )  from  its  samples.  The 

impulse  response  of  this  circuit  is 

where  Ts  is  the  sampling  interval.  Consider  a  typical  sampled  signal  g(/)  and  show  that  this 
circuit  performs  the  linear  interpolation.  In  other  words,  the  filter  output  consists  of  sample  tops 
connected  by  straight-line  segments.  Follow  the  procedure  discussed  in  Sec.  6.1.1  (Fig.  6.2b). 

(b)  Determine  the  transfer  function  of  this  filter  and  its  amplitude  response,  and  compare  it  with 
the  ideal  filter  required  for  signal  reconstruction. 

(c)  This  filter,  being  noncausal,  is  unrealizable.  Suggest  a  modification  that  will  make  this  filter 
realizable.  How  would  such  a  modification  affect  the  reconstruction  of  g(t)  from  its  samples? 
How  would  it  affect  the  frequency  response  of  the  filter? 

6.1- 8  Prove  that  a  signal  cannot  be  simultaneously  time-limited  and  band-limited. 

Hint.  Show  that  the  contrary  assumption  leads  to  contradiction.  Assume  a  signal  simultaneously 
time-limited  and  band-limited  so  that  G(f)  =  Ofor|/|  >  B.  In  this  case,  G(f)  =  G(f)  U(f  /2B') 

for  B  >  B.  This  means  that  g(t)  is  equal  to  g(t)  *  2£'sinc  (InB't).  Show  that  the  latter  cannot 
be  time-limited. 

6.1- 9  The  American  Standard  Code  for  Information  Interchange  (ASCII)  has  128  binary-coded 

characters.  If  a  certain  computer  generates  100,000  characters  per  second,  determine  the 
following. 

(a)  The  number  of  bits  (binary  digits)  required  per  character. 

(b)  The  number  of  bits  per  second  required  to  transmit  the  computer  output,  and  the  minimum 
bandwidth  required  to  transmit  this  signal. 

U )  I  or  single  error  detection  capability,  an  additional  bit  (parity  bit)  is  added  to  the  code  of  each 
character.  Modify  your  answers  in  parts  (a)  and  (b)  in  view  of  this  information. 

6.1-10  In  a  nonideal  sampler,  the  averaging  filter  impulse  response  is  shown  to  be 


<7(0 


■G£0- 


r?/v  iS  t0  be  higher  than  ,he  Ny^'«  frequency.  Design  a  reconstructed 

gam  o  recover  t  e  original  analog  signal.  Determine  all  the  necessary  filter  responses. 


bandwidth  equals  15  kHz.^  S'8nalS  d'gltally  by  usinS  PCM  Assume  that  the  audio  signal 


<a)  deterrnineTh 1  Samp/^S  ^^"liformly  quantized  intoL  =  65, 536  levels  and  then  binary-coded. 

determine  the  number  ol  binary  digits  required  to  encode  a  sample. 

<b>  IlilToT?0  T31  haSaVerage  P°wer  of  01  W  and  peak  voltage  of  I  V.  Find  the  resulting 
ratio  of  signal  to  quantization  noise  (SQNR)  of  the  uniform  quantizer  output  in  part  (a). 
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(c)  Determine  the  number  of  binary  digits  per  second  (bit/s)  required  to  encode  the  audio  signal. 

(d)  For  practical  reasons  discussed  in  the  text,  signals  are  sampled  at  a  rate  well  above  the  Nyquist 
rate.  Practical  CDs  use  44,100  samples  per  second.  If  L  —  65,536,  determine  the  number 
of  bits  per  second  required  to  encode  the  signal  and  the  minimum  bandwidth  required  to 
transmit  the  encoded  signal. 

6.2- 2  A  television  signal  (video  and  audio)  has  a  bandwidth  of  4.5  MHz.  This  signal  is  sampled, 

quantized,  and  binary-coded  to  obtain  a  PCM  signal. 

(a)  Determine  the  sampling  rate  if  the  signal  is  to  be  sampled  at  a  rate  20%  above  the  Nyquist 
rate. 

(b)  If  the  samples  are  quantized  into  1024  levels,  determine  the  number  of  binary  pulses  required 
to  encode  each  sample. 

(c)  Determine  the  binary  pulse  rate  (bits  per  second)  of  the  binary-coded  signal  and  the  minimum 
bandwidth  required  to  transmit  this  signal. 

6.2- 3  In  a  satellite  radio  system,  128  stations  of  stereo  quality  are  to  be  packaged  in  one  data  stream. 

For  each  station,  two  (left  and  right)  signals  of  bandwidth  15, 000  Hz  are  sampled,  quantized,  and 
binary-coded  into  PCM  signals.  The  transmitter  must  multiplex  the  data  from  the  128  stations 
into  a  single  stream  via  time  multiplexing. 

(a)  If  the  maximum  acceptable  quantization  error  in  sample  amplitudes  is  0.25%  of  the  peak 
signal  voltage,  find  the  minimum  number  of  bits  needed  for  a  uniform  quantizer. 

(b)  If  the  sampling  rate  must  be  20%  higher  than  the  Nyquist  rate,  find  the  minimum  bit  rate  of 
the  multiplexed  data  stream  based  on  the  quantizer  of  part  (a). 

(c)  If  5%  more  bits  are  added  to  the  multiplexed  data  for  error  protection  and  synchronization, 
determine  the  minimum  bandwidth  needed  to  transmit  the  final  data  stream  to  receivers. 

6.2- 4  A  message  signal  m(t)  is  normalized  to  peak  voltages  of  ±1  V.  The  average  message  power 

equals  20  mW.  To  transmit  this  signal  by  binary  PCM  without  compression,  uniform  quantization 
is  adopted.  If  the  SQNR  is  required  to  be  at  least  43  dB,  determine  the  minimum  number  of  bits 
required  to  code  the  uniform  quantizer.  Determine  the  SNR  obtained  with  this  quantizer. 

6.2- 5  Repeat  Prob.  6.2-4  if  the  message  signal  is  given  by  Fig.  P2.8-2(d). 

6.2- 6  Repeat  Prob.  6.2-4  if  the  message  signal  is  of  the  form  in  Fig.  P2.8-4(b). 

6.2- 7  Repeat  Prob.  6.2-4  if  a  /i-law  compandor  is  applied  with  \x  =  100  to  achieve  a  nonuniform 

quantizer. 

6.2- 8  For  a  PCM  signal,  determine  L  if  the  compression  parameter  n  =  100  and  the  minimum  SNR 

required  is  45  dB.  Determine  the  output  SNR  with  this  value  of  L.  Remember  that  L  must  be  a 

power  of  2,  that  is,  L  =  2"  for  a  binary  PCM. 

6.2- 9  Five  sensor  output  signals,  each  of  bandwidth  240  Hz.  are  to  be  transmitted  simultaneously  by 

binary  PCM.  The  signals  must  be  sampled  at  least  20%  above  the  Nyquist  rate.  Framing  and 
synchronizing  requires  an  additional  0.5%  extra  bits.  The  PCM  encoder  of  Prob.  6.2-7  is  used 
to  convert  these  signals  before  they  are  time-multiplexed  into  a  single  data  stream.  Determine 
the  minimum  possible  data  rate  (bits  per  second)  that  must  be  transmitted,  and  the  minimum 
bandwidth  required  to  transmit  the  multiplex  signal. 


6.2-10  A  signal  band-limited  to  1  MHz  is  sampled  at  a  rate  25%  higher  than  the  Nyquist  rate  and  quantized 
into  256  levels  using  a  /i-law  quantizer  with  /r  =  255. 
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(a)  Determine  the  signal-to-quantization-noise  ratio. 

(b)  The  SNR  (the  received  signal  quality)  found  in  part  (a)  was  unsatisfactory.  It  must  be  increased 
at  least  by  10  dB.  Would  you  be  able  to  obtain  the  desired  SNR  without  increasing  the 
transmission  bandwidth  if  a  sampling  rate  20%  above  the  Nyquist  rate  were  found  to  be 
adequate?  If  so,  explain  how.  What  is  the  maximum  SNR  that  can  be  realized  in  this  way? 

6.2-11  The  output  SNR  of  a  13-bit  PCM  was  found  to  be  insufficient  at  30  dB.  To  achieve  the  desired 

SNR  of  42  dB,  it  was  decided  to  increase  the  number  of  quantization  levels  L.  Find  the  fractional 

increase  in  the  transmission  bandwidth  required  for  this  increase  in  L. 

6.7- 1  In  a  single-integration  DM  system,  the  voice  signal  is  sampled  at  a  rate  of  64  kHz,  similar  to 

PCM.  The  maximum  signal  amplitude  is  normalized  as  Amax  =  1. 

(a)  Determine  the  minimum  value  of  the  step  size  E  to  avoid  slope  overload. 

(b)  Determine  the  granular  noise  power  N0  if  the  voice  signal  bandwidth  is  3.4  kHz. 

(c)  Assuming  that  the  voice  signal  is  sinusoidal,  determine  the  output  signal/power  S0  and  the 
SNR. 

(d)  Assuming  that  the  voice  signal  amplitude  is  uniformly  distributed  in  the  range  (—1,  1). 
determine  S0  and  the  SNR. 

(e)  Determine  the  minimum  transmission  bandwidth. 

6.7- 2  Consider  a  message  signal 


m(t)  =e  21  cos  2000jrf  •  u(t) 

(a)  Determine  the  minimum  step  size  E  necessary  to  avoid  slope  overload. 

(b)  Calculate  the  minimum  average  quantization  noise  power  based  on  part  (a). 


7 


PRINCIPLES  OF  DIGITAL  DATA 
TRANSMISSION 


hroughout  most  of  the  twentieth  century,  a  significant  percentage  of  communication 


systems  was  in  analog  form.  However,  by  the  end  of  the  1990s,  the  digital  format  began 


A  to  dominate  most  applications.  One  does  not  need  to  look  far  to  witness  the  continuous 
migration  from  analog  to  digital  communications:  from  audiocassette  tape  to  MP3  and  CD. 
from  NTSC  analog  TV  to  digital  HDTV,  from  traditional  telephone  to  VoIP,  and  from  VHS 
videotape  to  DVD.  In  fact,  even  the  last  analog  refuge  of  broadcast  radio  is  facing  a  strong 
digital  competitor  in  the  form  of  satellite  radio.  Given  the  dominating  importance  of  digital 
communication  systems  in  our  lives  today,  it  is  never  too  early  to  study  the  basic  principles 
and  various  aspects  of  digital  data  transmission,  as  we  will  do  in  this  chapter. 

This  chapter  deals  with  the  problems  of  transmitting  digital  data  over  a  channel.  Hence, 
the  starting  messages  are  assumed  to  be  digital.  We  shall  begin  by  considering  the  binary  case, 
where  the  data  consist  of  only  two  symbols:  1  and  0.  We  assign  a  distinct  waveform  (pulse) 
to  each  of  these  two  symbols.  The  resulting  sequence  of  these  pulses  is  transmitted  over  a 
channel.  At  the  receiver,  these  pulses  are  detected  and  are  converted  back  to  binary  data  (Is 
and  Os). 


7.1  DIGITAL  COMMUNICATION  SYSTEMS 


A  digital  communication  system  consists  of  several  components,  as  shown  in  Fig.  7.1.  In 
this  section,  we  conceptually  outline  their  functionalities  in  the  communication  systems.  The 
details  of  their  analysis  and  design  will  be  given  in  dedicated  sections  later  in  this  chapter. 


7.1.1  Source 

The  input  to  a  digital  system  is  in  the  form  of  a  sequence  of  digits.  The  input  could  be  the 
output  from  a  data  set,  a  computer,  or  a  digitized  audio  signal  (PCM,  DM,  or  LPC),  digital 
facsimile  or  HDTV,  or  telemetry  data,  and  so  on.  Although  most  of  the  discussion  in  this  chapter 
is  confined  to  the  binary  case  (communication  schemes  using  only  two  symbols),  the  more 
general  case  of  M-ary  communication,  which  uses  M symbols,  will  also  be  discussed  in  Secs.  7.7 

and  7.9. 
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Figure  7.1 

Fundamental 
building  blocks 
of  digital 
communication 
systems. 
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7.1.2  Line  Coder 

The  digital  output  ot  a  source  encoder  is  converted  (or  coded)  into  electrical  pulses  (wave- 
orms)  lor  the  purpose  ot  transmission  over  the  channel.  This  process  is  called  line  coding 
or  transrn.ssion.codmg.  There  are  many  possible  ways  of  assigning  waveforms  (pulses)  to 

code  U  a  tr*  h  '  6  1.nar^  CUSe  symbols)’  f°r  example,  conceptually  the  simplest  line 
t/em  l„  n  ?erC  a  'S  by  3  PUlSC  P(t)  and  a  0  is  transmitted  by  no  pulse 

mined  hv  '  i  S  TT  m  ■  ^notber  commonly  used  code  is  polar,  where  1  is  trans- 

he  rn  ,  ,  eff  0  ,S  transmitted  by  a  pulse  -pit)  (Fig.  7.2b).  The  polar  scheme  is 

(em'  r^ d obabinl "  requires  the  least  Power  for  a  given  noise  immunity 
TZnate  mlrt  T  1*  COde  in  PCM  is  bipolar,  also  known  as  pseudoternary 

TpuUe  m  ,  o7  in  V,erS,°  r  (AMI>’  WhCre  °  is  encoded  by  "o  pulse  and  1  is  encoded  by 
Zr,  Dulses  ren~re  t  PC  8  °"  Whether  **  previous  1  »  encoded  by  -p(t)  or  pit).  I" 

has  disadvantage  t  hTt  f §  COnSeC.Ut,ve  ls  altemate  in  sign,  as  shown  in  Fig.  7.2c.  This  code 
has  advantage  that  tf  one  smgle  error  is  made  in  the  detecting  of  pulses,  the  received  pulse 
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sequence  will  violate  the  bipolar  rule  and  the  error  can  be  detected  (although  not  corrected) 
immediately.* 

Another  line  code  that  appeared  promising  earlier  is  the  duobinary  (and  modified  duobi¬ 
nary)  proposed  by  Lender.1,2  This  code  is  better  than  the  bipolar  in  terms  of  bandwidth 
efficiency.  Its  more  prominent  variant,  the  modified  duobinary  line  code,  has  seen  applica¬ 
tions  in  hard  disk  drive  read  channels,  in  optical  10  Gbit/s  transmission  for  metronetworks, 
and  in  the  first-generation  modems  for  integrated  services  digital  networks  (ISDN).  Details  of 
duobinary  line  codes  will  be  discussed  later  in  this  chapter. 

In  our  discussion  so  far,  we  have  used  half-width  pulses  just  for  the  sake  of  illustration.  We 
can  select  other  widths  also.  Full-width  pulses  are  often  used  in  some  applications.  Whenever 
full-width  pulses  are  used,  the  pulse  amplitude  is  held  to  a  constant  value  throughout  the 
pulse  interval  (i.e.,  it  does  not  have  a  chance  to  go  to  zero  before  the  next  pulse  begins). 
For  this  reason,  these  schemes  are  called  non-return-to-zero  or  NRZ  schemes,  in  contrast 
to  return-to-zero  or  RZ  schemes  (Fig.  7.2a-c).  Figure  7.2d  shows  an  on-off  NRZ  signal, 
whereas  Fig.  7.2e  shows  a  polar  NRZ  signal. 


7. 1 .3  Multiplexer 

Generally  speaking,  the  capacity  of  a  physical  channel  (e.g.,  coaxial  cable,  optic  fiber)  for 
transmitting  data  is  much  larger  than  the  data  rate  of  individual  sources.  To  utilize  this  capac¬ 
ity  effectively,  we  combine  several  sources  by  means  of  a  digital  multiplexer.  The  digital 
multiplexing  can  be  achieved  through  frequency  division  or  time  division,  as  we  have  already 
discussed.  Alternatively,  code  division  is  also  a  practical  and  effective  approach  (to  be  discussed 
in  Chapter  1 2).  Thus  a  physical  channel  is  normally  shared  by  several  messages  simultaneously. 


7. 1 .4  Regenerative  Repeater 

Regenerative  repeaters  are  used  at  regularly  spaced  intervals  along  a  digital  transmission  line 
to  detect  the  incoming  digital  signal  and  regenerate  new  “clean”  pulses  for  further  transmission 
along  the  line.  This  process  periodically  eliminates,  and  thereby  combats,  accumulation  of  noise 
and  signal  distortion  along  the  transmission  path.  The  ability  of  such  regenerative  repeaters 
to  effectively  eliminate  noise  and  signal  distortion  effects  is  one  of  the  biggest  advantages  of 
digital  communication  systems  over  their  analog  counterparts. 

If  the  pulses  are  transmitted  at  a  rate  of  Rb  pulses  per  second,  we  require  the  periodic 
timing  information— the  clock  signal  at  Rb  Hz— to  sample  the  incoming  pulses  at  a  repeater. 
This  timing  information  can  be  extracted  from  the  received  signal  itself  if  the  line  code  is 
chosen  properly.  When  the  RZ  polar  signal  in  Fig.  7.2b  is  rectified,  for  example,  it  results  in  a 
periodic  signal  of  clock  frequency  Rh  Hz,  which  contains  the  desired  periodic  timing  signal  of 
frequency  Rb  Hz.  When  this  signal  is  applied  to  a  resonant  circuit  tuned  to  frequency  Rh ,  the 
output,  which  is  a  sinusoid  of  frequency  Rb  Hz,  can  be  used  for  timing.  The  on-off  signal  can 
be  expressed  as  a  sum  of  a  periodic  signal  (of  clock  frequency)  and  a  polar,  or  random,  signal 
as  shown  in  Fig.  7.3.  Because  of  the  presence  of  the  periodic  component,  we  can  extract  the 
timing  information  from  this  signal  by  using  a  resonant  circuit  tuned  to  the  clock  frequency.  A 
bipolar  signal,  when  rectified,  becomes  an  on-off  signal.  Hence,  its  timing  information  can  be 
extracted  using  the  same  way  as  that  tor  an  on-olt  signal. 


*  This  assumes  no  more  than  one  error  in  sequence.  Multiple  errors  in  sequence  could  cancel  their  respective  effects 
and  remain  undetected.  However,  the  probability  of  multiple  errors  is  much  smaller  than  that  of  single  errors.  Even 
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Figure  7.3 

An  on-off  signal 
(a)  is  a  sum  of  a 
random  polar 
signal  (b)  and  a 
clock  frequency 
periodic 
signal  (c). 
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The  timing  signal  (the  resonant  circuit  output)  is  sensitive  to  the  incoming  bit  pattern.  In 
the  on-off  or  bipolar  case,  a  0  is  transmitted  by  ‘no  pulse.-  Hence,  if  there  are  too  many  Os  in  a 
sequence  (no  pulses),  there  is  no  signal  at  the  input  of  the  resonant  circuit  and  the  sinusoidal 
output  of  the  resonant  circuit  starts  decaying,  thus  causing  error  in  the  timing  information.  We 
shall  discuss  later  ways  of  overcoming  this  problem.  A  line  code  in  which  the  bit  pattern  does 
not  altect  the  accuracy  ot  the  timing  information  is  said  to  be  a  transparent  line  code.  The  RZ 
po  ar  scheme  (where  each  bit  is  transmitted  by  some  pulse)  is  transparent,  whereas  the  on-off 
and  bipolar  are  nontransparent. 


7.2  LINE  CODING 


.*}  a  <|dn  ^transmitted  by  various  transmission  or  line  codes.  We  have  given  example 

(I sdr  ihl  P°  'P°lai' Each  lme  code  has  its  advantages  and  disadvantages.  Among  otht 

desirable  propert.es,  a  lme  code  should  have  the  following  properties. 

•  Transmission  bandwidth  should  be  as  small  as  possible. 

’  ooZ SES'  F7  3  giVe"  bandWidth  and  3  SPecified  ^echon  error  rate,  the  transmitte 
power  should  be  as  low  as  possible. 

"  is  desirablc  »  detect,  and  preferably  come 

and  can  easily  to-  Ini'  in  IOf  examP*e-  a  single  error  will  cause  bipolar  violatio 
and  can  easrly  be  de.ee, ed.  Error  correcring  codes  will  be  discussed  in  deprh  in  Chaprer  15 

'  - '' lsdKiraW'=,ota«e2er„powerspec,raldensi>y(PSD). 

iwer  ta  lo^~  “P  8  ™  »fK"  used  a,  the  repealers.-  Signified 

the  pulse  slream  when  L  coIph"ngTs  used'1'  ^  aVO‘ded  bWaUSe  “  caUSeS  *  Wande'  “ 
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modulated 
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generation  from 
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Adequate  timing  content.  It  should  be  possible  to  extract  timing  or  clock  information  from 
the  signal. 

Transparency .  It  should  be  possible  to  correctly  transmit  a  digital  signal  regardless  of  the 
pattern  ot  Is  and  Os.  We  saw  earlier  that  a  long  string  of  Os  could  cause  problems  in  timing 
extraction  for  the  on-off  and  bipolar  cases.  A  code  is  transparent  if  the  data  are  so  coded  that 
for  every  possible  sequence  of  data,  the  coded  signal  is  received  faithfully. 


7.2. 1  PSD  of  Various  Line  Codes 

In  Example  3.23  we  discussed  a  procedure  for  finding  the  PSD  of  a  polar  pulse  train.  We  shall 
use  a  similar  procedure  to  find  a  general  expression  for  PSD  of  the  baseband  modulation  (line 
coding)  output  signals  as  shown  in  Fig.  7.1.  In  particular,  we  directly  apply  the  relationship 
between  the  PSD  and  the  autocorrelation  function  of  the  baseband  modulation  signal  given  in 
Section  3.8  [Eq.  (3.86)]. 

In  the  following  discussion,  we  consider  a  generic  pulse  p(t)  whose  corresponding  Fourier 
transform  is  P(f).  We  can  denote  the  line  code  symbol  at  time  k  as  a^ .  When  the  transmission 
rate  is  Rf,  =  1  /Tt>  pulses  per  second,  the  line  code  generates  a  pulse  train  constructed  from  the 
basic  pulse  p(t)  with  amplitude  a*  starting  at  time  t  =  kTb\  in  other  words,  the  kth  symbol  is 
transmitted  as  a^pit  -  kTf>).  Figure  7.4a  provides  an  illustration  of  a  special  pulse  /?(/),  whereas 
Fig.  7.4b  shows  the  corresponding  pulse  train  generated  by  the  line  coder  at  baseband.  As  shown 
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in  Fig.  7.4(b),  counting  a  succession  of  symbol  transmissions  Tf,  second  apart,  the  baseband 
signal  is  a  pulse  train  of  the  form 

y(t)  =  YjakP{t-kTh)  (7.1) 

Note  that  the  line  coder  determines  the  symbol  { a k)  as  the  amplitude  of  the  pulse  p(t  -  kTb). 

The  values  a k  are  random  and  depend  on  the  line  coder  input  and  the  line  code  itself; 
y(r)  is  a  pulse-amplitude-modulated  (PAM)  signal.  The  on-off,  polar,  and  bipolar  line  codes 
are  all  special  cases  of  this  pulse  train  y(r),  where  ak  takes  on  values  0,  1,  or  —1  randomly, 
subject  to  some  constraints.  We  can,  therefore,  analyze  many  line  codes  according  to  the  PSD 
of  y(t).  Unfortunately,  the  PSD  of  y(t)  depends  on  both  ak  and  /?(/).  If  the  pulse  shape  p(t) 
changes,  we  may  have  to  derive  the  PSD  all  over  again.  This  difficulty  can  be  overcome  by  the 
simple  artifice  of  selecting  a  PAM  signal  x(t)  that  uses  a  unit  impulse  for  the  basic  pulse  p(t) 
(Fig.  7.4c).  The  impulses  are  at  the  intervals  of  7),  and  the  strength  (area)  of  the  Jtth  impulse 
is  a k.  If  x(t)  is  applied  to  the  input  of  a  filter  that  has  a  unit  impulse  response  h(t)  =  p{t) 
(Fig.  7.4d),  the  output  will  be  the  pulse  train  y(t)  in  Fig.  7.4b.  Also,  applying  Eq.  (3.92),  the 
PSD  of  y(t)  is 

Sy(f)  =  \P(f)\2Sx(f) 

This  relationship  allows  us  to  determine  Sy(f ),  the  PSD  of  a  line  code  corresponding  to  any 
pulse  shape  p(t),  once  we  know  Sx(f).  This  approach  is  attractive  because  of  its  generality. 

We  now  need  to  derive  TZx(r)y  the  time  autocorrelation  function  of  the  impulse  train  jc(/). 
This  can  be  conveniently  done  by  considering  the  impulses  as  a  limiting  form  of  the  rectangular 
pulses,  as  shown  in  Fig.  7.5a.  Each  pulse  has  a  width  e  0,  and  the  kth  pulse  height 

u  °k 
"k  = - ►  00 

€ 

This  way,  we  guarantee  that  the  strength  of  the  *th  impulse  is  ak,  or 

thk  =  ak 

If  we  designate  the  corresponding  rectangular  pulse  train  by  x (t).  then  by  definition  [Eq.  (3.83) 
in  Section  3.8] 


fti(r)  =  lim 

T-±oo 


l  rW 

T  /  J 
I  J—T/2 


x(t)x(t  -  z)  dt 


(7.2) 


Because  %(r)  is  an  even  function  of  r  [Eq.  (3.84)],  we  need  to  consider  only  positive  r.  T 
begin  with,  consider  the  case  of  r  <  e.  In  this  case  the  integral  in  Eq.  (7.2)  is  the  area  under  th 

wfthlhe  JLt r*  ?P  iS' m  de'ayed  by  T(T  <  f)- As  seen  from  Fig-  7-5b,  the  area  associate 

with  the  kth  pulse  is  hfc-r),  and 
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Figure  7.5 
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where 


During  the  averaging  interval  T 


(T  -*■  oo),  there  are  N  pulses  (N 


oo),  where 


(7.3b) 


(7.4) 
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and  from  Eq.  (7.3b) 


(7.5) 


Observe  that  the  summation  is  over  N  pulses.  Hence,  Ro  is  the  time  average  of  the  square  of 
the  pulse  amplitudes  a *.  Using  our  time  average  notation,  we  can  express  Rq  as 


(7.6) 


We  also  know  that  TZ;x(t)  is  an  even  function  of  r  [see  Eq.  (3.84)].  Hence,  Eq.  (7.3)  can  be 
expressed  as 


(7.7) 


This  is  a  triangular  pulse  of  height  Ro/eTh  and  width  2e  centered  at  r  =  0  (Fig.  7.5d).  This 
is  expected  because  as  r  increases  beyond  e,  there  is  no  overlap  between  the  delayed  signal 
~  ^  ant^  *0)*  hence,  7Z-x(t)  =  0,  as  seen  from  Fig.  7.5d.  But  as  we  increase  r  further, 
we  find  that  the  £th  pulse  of  x(t  -  r)  will  start  overlapping  the  ( k  +  l)th  pulse  of  Jc(r)  as  r 
approaches  7/,  (Fig.  7.5c).  Repeating  the  earlier  argument,  we  see  that '/?.;  ( r)  will  have  another 
triangular  pulse  of  width  2e  centered  at  r  =  Th  and  of  height  R\/eTh  where 


k 

=  Amoo 

k 


—  akak+ 1 


of  triangular  pulses  of  width  2e  centered  at 
centered  at  ±nTh  is  Rn/(Tb,  where 


"rSo  7  £“*«*+» 


k 


k 


“  °kak+n 


R„  is  essentially  the  discrete  aulocotrelahon  function  of  the  line  code  symbols  {« |. 
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To  find  TZx(t),  we  let  €  — >  0  in  Tix(r).  As  €  — *  0,  the  width  of  each  triangular  pulse— ►  0 
and  the  height-*  oo  in  such  a  way  that  the  area  is  still  finite.  Thus,  in  the  limit  as  €  -*  0,  the 
triangular  pulses  become  impulses.  For  the  /ith  pulse  centered  at  nTby  the  height  is  Rn/eTh  and 
the  area  is  Rn/Tb.  Hence,  (Fig.  7.5e) 


j  w 

nx{r)=—  Y  R«&(r-nTb) 

ib  n——oo 

The  PSD  Sx(f )  is  the  Fourier  transform  of  7£t(r).  Therefore, 


1  00 

sx(f)  =  -  Y  R»e-jn2nFh 

b  n—- oo 

Recognizing  that  /?_„  =  Rn  [because  TZ(r)  is  an  even  function  of  r],  we  have 

oo 

R()  +  2  Y,  R»  cos  n2nJT, >, 
n=l 


(7.8) 


(7.9) 


(7.10) 


The  input  x(t)  to  the  filter  with  impulse  response  h(t)  =  p(t)  results  in  the  output  v(r),  as 
shown  in  Fig.  7.4d.  If  p(t)  <=>  P(f ),  the  transfer  function  of  the  filter  is  H(f)  =  P(f),  and 
according  to  Eq.  (3.92) 


Sy(f)  = 


\P(f)\2Sx(f) 


\p(f)\2 

Tb 


Y  R»e 


_n=—oo 


-jnlnjTb 


\P{f)\2 

Tb 


OO 

R„  +  2  Y,  Rn  cos  ”271 -fT b 

n=  1 


(7.11a) 

(7.11b) 

(7.11c) 


Thus,  the  PSD  of  a  line  code  is  fully  characterized  by  its  Rn  and  the  pulse-shaping  selection 
P(f).  We  shall  now  use  this  general  result  to  find  the  PSDs  of  various  specific  line  codes  by 
first  determining  the  symbol  autocorrelation  /?„. 


7.2.2  Polar  Signaling 

In  polar  signaling.  1  is  transmitted  by  a  pulse  p(t)  and  0  is  represented  by  -p(t).  In  this  case, 
a/c  is  equally  likely  to  be  1  or  —  1,  and  is  always  1 .  Hence, 


There  are  N  pulses  and  aj  =  1  for  each  one.  and  the  summation  on  the  right-hand  side  above 
is  N.  Hence, 
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Moreover,  both  a*  and  1  are  either  1  or  —1.  Hence,  is  either  1  or  —  1.  Because  the 

pulse  amplitude  a*  is  equally  likely  to  be  1  and  - 1  on  the  average,  out  of  N  terms  the  product 
1  is  equal  to  1  for  N /2  terms  and  is  equal  to  —  1  for  the  remaining  N /2  terms.  Therefore, 


Possible  Values  of 


a* 

Ok  + 1 

-1 

+1 

1  [N 

*i=  tan  -  T(l)  + 

N-*-  oo  N  |_  2 

N  1 

-(-1)  =0  (7.12b) 

-1 

1 

-1 

2  J 

+1 

-1  1 

Arguing  this  way,  we  see  that  the  product  akak+n  is  also  equally  likely  to  be  1  or  -1 .  Hence, 


Therefore  from  Eq.  (7.1  lc) 


Rn  =  0  n  >  1 


‘b 


(7.12c) 


Th 


(7.13) 


For  the  sake  ot  comparison  of  various  schemes,  we  shall  consider  a  specific  pulse  shape. 
Let  p(t)  be  a  rectangular  pulse  of  width  T/,/ 2  (half-width  rectangular  pulse),  that  is. 


and 


Therefore 


p{f)  =  T±  sinc 


Syif)  =  sine2 


(7.14) 


(7.15) 


Figure  7  6  shows  the  spectrum  Sy(f).  It  is  clear  that  the  polar  signal  has  most  of  its  power  con¬ 
centrated  in  lower  frequencies.  Theoretically,  the  spectrum  becomes  very  small  as  frequency 
increases  but  never  becomes  totally  zero  above  a  certain  frequency.  To  define  a  meaningful  mea¬ 
sure  ot  bandwidth,  we  consider  its  first  non-dc  null  frequency  to  be  its  essential  bandwidth.’ 

From  polar  signal  spectrum,  the  essential  bandwidth  of  the  signal  is  seen  to  be  2Rb  & 
(where  Rh  is  the  clock  frequency).  This  is  4  times  the  theoretical  bandwidth  (Nyquist  band- 
width)  required  to  transmit  Rh  pulses  per  second.  Increasing  the  pulse  width  reduces  the 
bandwidth  (expansion  in  the  time  domain  results  in  compression  in  the  frequency  domain). 

frequency  above  dc  is  no,  always  a  good  measure  of  signal 

contained  in  the  main  (or  first)  lobe  of  the'  PSdTw!  w  ll^TaS’i^KsD  ^  ““  1  ^"“Kfcodes 
(Fig.  7.9).  in  mos,  practical  cases,  this  approximation  is  accept 


Figure  7.6 

Power  spectral 
density  of  a 
polar  signal. 
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Figure  7.6 


w 


density  of  a 
polar  signal. 


Power  spectral 


Tb/4 


-4 Rh  -3 Rh  -2Rh  -Rh  0  Rb  2 Rh  3 Rh  4 Rh 


For  a  full-width  pulse*  (maximum  possible  pulse  width),  the  essential  bandwidth  is  half,  that 
is  Rb  Hz.  This  is  still  twice  the  theoretical  bandwidth.  Thus,  polar  signaling  is  not  the  most 
bandwidth  efficient. 

Second,  polar  signaling  has  no  capability  for  error  detection  or  error  correction.  A  third 
disadvantage  of  polar  signaling  is  that  it  has  nonzero  PSD  at  dc  (f  =  0).  This  will  rule  out  the  use 
of  ac  coupling  during  transmission.  The  ac  mode  of  coupling,  which  permits  transformers  and 
blocking  capacitors  to  aid  in  impedance  matching  and  bias  removal,  and  allows  dc  powering 
of  the  line  repeaters  over  the  cable  pairs,  is  very  important  in  practice.  Later,  we  shall  show 
how  a  PSD  of  a  line  code  may  be  forced  to  zero  at  dc  by  properly  shaping  p(t). 

On  the  positive  side,  polar  signaling  is  the  most  efficient  scheme  from  the  power  require¬ 
ment  viewpoint.  For  a  given  power,  it  can  be  shown  that  the  error-detection  probability  for  a 
polar  scheme  is  the  lowest  among  all  signaling  techniques  (see  Chapter  11).  Polar  signaling  is 
also  transparent  because  there  is  always  some  pulse  (positive  or  negative)  regardless  of  the  bit 
sequence.  There  is  no  discrete  clock  frequency  component  in  the  spectrum  of  the  polar  signal. 
Rectification  of  the  RZ  polar  signal,  however,  yields  a  periodic  signal  of  clock  frequency  and 
can  readily  be  used  to  extract  timing. 


7.2.3  Constructing  a  DC  Null  in  PSD  by  Pulse  Shaping 

Because  Sy(f),  the  PSD  of  a  line  code  contains  a  factor  \P(f)\2,  we  can  force  the  PSD  to  have 
a  dc  null  by  selecting  a  pulse  p(t)  such  that  P(f)  is  zero  at  dc  (f  =  0).  Because 


*  Scheme  using  the  full-width  pulse  p(t)  =  n (t/Tb)  is  an  example  of  a  non-retum-to-zero  (NRZ)  scheme.  The 
half-width  pulse  scheme,  on  the  other  hand,  is  an  example  of  a  retum-to-zero  (RZ)  scheme. 
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Figure  7.7 

Split-phase 
(Manchester  or 
twinned-binary) 
signal,  (a)  Basic 
pulse  p(t)  for 
Manchester 
signaling. 

(b)  Transmitted 
waveform  for 
binary  data 
sequence  using 
Manchester 
signaling. 


pit) 

n 

0 

U 

(a) 


I  0  0 


0  1 


0  0  0 


uiniLFinjuiniLT . 


(b) 


I  Tb 


we  have 


P( 0)  = 


p(t)  dt 


Hence,  if  the  area  under  p(t)  is  made  zero,  P( 0)  is  zero,  and  we  have  a  dc  null  in  the  PSD.  For 
a  rectangular  pulse,  one  possible  shape  oi  p(t)  to  accomplish  this  is  shown  in  Fig.  7.7a.  When 
we  use  this  pulse  with  polar  line  coding,  the  resulting  signal  is  known  as  Manchester  code,  or 
split-phase  (also  called  twinned-binary),  signal.  The  reader  can  use  Eq.  (7.13),  to  show  that 
tor  this  pulse,  the  PSD  of  the  Manchester  line  code  has  a  dc  null  (see  Prob.  7.2-4). 


7.2.4  On-Off  Signaling 

In  on-off  signaling,  a  1  is  transmitted  by  a  pulse  p{t)  and  a  0  is  transmitted  by  no  pulse.  Hence, 
a  pulse  strength  ak  is  equally  likely  to  be  1  or  0.  Out  of  N  pulses  in  the  interval  of  T  seconds. 
ak  is  1  lor  <V/_  pulses  and  is  0  tor  the  remaining  N /2  pulses  on  the  average.  Hence, 


Ro  =  lim 

N->  oo 


^  [la»J + f  <°)2] = I 


(7.16) 


To  compute  R„  we  need  to  consider  the  product  akak+n.  Since  ak  and  ak+n  are  equally  likelj 

*°  nn  ^r^'the  Product  ak<*k+n  is  equally  likely  to  be  1  x  1,  1  x  0,  0  x  1  or  0  x  0,  that  is 

’  ;  '  °*  Theretore  on  the  average,  the  product  akak+n  is  equal  to  1  for  N /4  terms  and  0  foi 
3jV/4  terms  and  1 


n  >  1 


Therefore,  [Eq.  (7.9)] 


(7.17) 


s‘v: 1  =  sk  +  ik  t 


n=~  00 


-  sk  +  jk  E  e~1"2nrrt 


n—- 00 


(7.18a) 

(7.18b) 
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Equation  (7.18b)  is  obtained  from  Eq.  (7.18a)  by  splitting  the  term  1/27),  corresponding  to 
/?o  into  two.  1/47),  outside  the  summation  and  1/47),  inside  the  summation  (corresponding  to 
n  =  0).  We  now  use  the  formula  (see  the  footnote  for  a  proof*) 


Substitution  of  this  result  in  Eq.  (7.18b)  yields 


(7.19a) 


and  the  desired  PSD  of  the  on-off  waveform  y(t)  is  [from  Eq.  (7. 1  la)] 


(7.19b) 


Note  that  unlike  the  continuous  PSD  spectrum  of  polar  signaling,  the  on-off  PSD  of  Eq.  (7. 1 9b) 
also  has  an  additional  discrete  part.  This  discrete  part  may  be  nullified  if  the  pulse  shape  is 
chosen  such  that 


n  =  0,  ±1,... 

For  the  example  case  of  a  half-width  rectangular  pulse  [see  Eq.  (7.1 4)J, 


(7.20) 


The  resulting  PSD  is  shown  in  Fig.  7.8.  The  continuous  component  of  the  spectrum  is 
(Tb/ 16)  sine2  (7t/T/7/2).  This  is  identical  (except  for  a  scaling  factor)  to  the  spectrum  of  the 
polar  signal  [Eq.  (7.15)].  The  discrete  component  is  represented  by  the  product  of  an  impulse 
train  with  the  continuous  component  (7),/ 16)  sine2  (7ijTb/2).  Hence  this  component  appears 
as  periodic  impulses  with  the  continuous  component  as  the  envelope.  Moreover,  the  impulses 
repeat  at  the  clock  frequency  Rb  =  1/7*  because  its  fundamental  frequency  is  2n/Th  rad/s,  or 
1  /7*  Hz.  This  is  a  logical  result  because  as  Fig.  7.3  shows,  an  on-off  signal  can  be  expressed 
as  a  sum  of  a  polar  and  a  periodic  component.  The  polar  component  vi(f)  is  exactly  half 


*  The  impulse  train  in  Fig.  3.24a  of  Example  3.13  is  Sj^(t)  =  Xn^-oc  &(t  ~  n^b )•  Moreover,  the  Fourier  series  for 
this  impulse  train  as  found  in  Eq.  (2.100)  is 


We  take  the  Fourier  transform  of  both  sides  of  this  equation,  and  use  the  fact  that  8(t  -  nTb)  <=>  e  J^JTb  and 
gjnlnRfjt  ^  _  nRb)  xhis  yields 
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Figure  7.8 

Power  spectral 
density  (PSD)  of 
an  on-off  signal. 


the  polar  signal  discussed  earlier.  Hence,  the  PSD  of  this  component  is  one-fourth  the  PSD  in 
Eq.  (7. 15).  The  periodic  component  is  of  clock  frequency  R b\  it  consists  of  discrete  components 
of  frequency  R/,  and  its  harmonics. 

On-off  signaling  has  very  little  to  brag  about.  For  a  given  transmitted  power,  it  is  less 
immune  to  noise  interference  than  the  polar  scheme,  which  uses  a  positive  pulse  for  1  and  a 
negative  pulse  for  0.  This  is  because  the  noise  immunity  depends  on  the  difference  of  ampli' 
tudes  representing  1  and  0.  Hence,  for  the  same  immunity,  if  on-off  signaling  uses  pulses  of 
amplitudes  2  and  0,  polar  signaling  need  use  only  pulses  of  amplitudes  1  and  -1.  It  is  simple 
to  show  that  on-olt  signaling  requires  twice  as  much  power  as  polar  signaling.  If  a  pulse  of 
amplitude  1  or  - 1  has  energy  E,  then  the  pulse  of  amplitude  2  has  energy  (2 )2E  =  4 E.  Because 
1  /Tb  digits  are  transmitted  per  second,  polar  signal  power  is  (£)(  1  /Tb)  =  E/Tb-  For  the  on-off 
case,  on  the  other  hand,  each  pulse  energy  is  4 E,  though  on  average  such  a  pulse  is  transmitted 
over  halt  of  the  time  while  nothing  is  transmitted  over  the  other  half.  Hence,  the  average  signal 
power  of  on-off  is 


wh.ch  is  twice  that  required  for  the  polar  signal.  Moreover,  unlike  the  polar  case,  on-off 
signaling  is  not  transparent.  A  long  string  of  Os  (or  offs)  causes  the  absence  of  a  signal  and 
can  lead  to  errors  in  timing  extraction.  In  addition,  all  the  disadvantages  of  polar  signaling, 
(e.g.,  excessive  transmission  bandwidth,  nonzero  power  spectrum  at  dc  no  error  detection  (or 
correction)  capability  are  also  present  in  on-off  signaling. 


7.2.5  Bipolar  Signaling 

The  signaling  scheme  used  in  PCM  for  telephone  networks  is  called  bipolar  (pseudoternan 
or  alternate  mark  inverted).  A  0  is  transmitted  by  no  pulse,  and  a  1  is  transmitted  by  a  pul* 
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p(t)  or  - p(t ),  depending  on  whether  the  previous  1  was  transmitted  by  -p(t)  or  p(t).  With 
consecutive  pulses  alternating,  we  can  avoid  dc  wander  and  thus  cause  a  dc  null  in  the  PSD. 
Bipolar  signaling  actually  uses  three  symbols  [p(t),  0,  and  —  p(t)],  and,  hence,  it  is  in  reality 
ternary  rather  than  binary  signaling. 

To  calculate  the  PSD,  we  have 


k 


On  the  average,  half  of  the  aks  are  0,  and  the  remaining  half  are  either  1  or  - 1 ,  with  a?  =  1 . 
Therefore, 


To  compute  R i,  we  consider  the  pulse  strength  product  There  are  four  equally 

likely  sequences  of  two  bits:  11,  10,  01,  00.  Since  bit  0  is  encoded  by  no  pulse  (ak  =  0), 
the  product  akak+ \  is  zero  for  the  last  three  of  these  sequences.  This  means,  on  the  average, 
that  3N/4  combinations  have  akak+\  =  0  and  only  N  /4  combinations  have  nonzero  akak+ j. 
Because  of  the  bipolar  rule,  the  bit  sequence  11  can  be  encoded  only  by  two  consecutive 
pulses  of  opposite  polarities.  This  means  the  product  akak+\  =  —  1  for  the  N /4  combinations. 
Therefore 


To  compute  R2  in  a  similar  way,  we  need  to  observe  the  product  akak+ 2.  For  this,  we  need 
to  consider  all  possible  combinations  of  three  bits  in  sequence.  There  are  eight  equally  likely 
combinations:  111,  101,  110,  100,  011,  010,  001,  000.  The  last  six  combinations  have  either 
the  first  and/or  the  last  bit  0.  Hence  akak+2  =  0  for  all  these  six  combinations.  The  first  two 
combinations  are  the  only  ones  that  yield  nonzero  akak+ 2.  From  the  bipolar  rule,  the  first 
and  the  third  pulses  in  the  combination  111  are  of  the  same  polarity,  yielding  akak+ 2  =  1- 
But  for  101,  the  first  and  the  third  pulse  are  of  opposite  polarity,  yielding  akak+2  =  -1. 
Thus,  on  the  average,  akak+ 2  =  1  for  N/S  terms,  -1  for  N/S  terms  and  0  for  3N/4  terms. 
Hence, 


In  general 


For  n  >  2,  the  product  akak+n  can  be  1 ,  - 1 ,  or  0.  Moreover,  an  equal  number  of  combinations 
have  values  1  and  —  1.  This  causes  Rn  =  0.  Thus 


Rn=  0  n  >  1 
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Figure  7.9 

PSD  of  bipolar, 
polar,  and 
split-phase 
signals 

normalized  for 
equal  powers. 
Half-width 
rectangular 
pulses  are  used. 


Figure  7.10 

(a)  HDB3  signal 
and  (b)  its  PSD. 


and  [see  Eq.  (7.1  lc)] 


\P(f)\2 

Sy(f)  =  [1  -  cos  2nJTh]  (7.21a) 

2  Tb 

(7.21b) 

Note  that  Sy(f  )  =  0  for /  =  0  (dc),  regardless  of  P(f).  Hence,  the  PSD  has  a  dc  null,  which  is 
desirable  for  ac  coupling.  Moreover,  sin2  (7 rJTb)  =  0  at/  =  1  /Tb,  that  is,  at/  =  \/Th  =  Rb 
Hz.  Thus,  regardless  of  P(f ),  we  are  assured  of  the  first  non-dc  null  bandwidth  Rh  Hz.  For  the 
half-width  pulse 


Sy{f)  =  Tt  sine2  (~^  sin2  (nJTb) 


(7.22) 


This  is  shown  in  Fig.  7.9.  The  essential  bandwidth  of  the  signal  is  Rh  (Rb  =  1  /Tb),  which  is 
halt  that  of  polar  using  the  same  half-width  pulse  or  on-off  signaling  and  twice  the  theoretical 
minimum  bandwidth.  Observe  that  we  were  able  to  obtain  the  bandwidth  Rb  for  polar  (or 
on-off)  case  tor  tull-width  pulse.  For  the  bipolar  case,  the  bandwidth  is  Rb  Hz  whether  the 
pulse  is  half-width  or  full-width. 


Bipolar  signaling  has  several  advantages:  ( 1 )  its  spectrum  has  a  dc  null;  (2)  its  bandwidth  is 
not  excessive;  (3)  it  has  single-error-detection  capability.  This  is  because  even  single  detection 
error  w, I  cause  a  violation  of  the  alternating  pulse  rule,  and  this  will  be  immediately  detected. 

I  a  bipolar  signal  is  rectified,  we  get  an  on-off  signal  that  has  a  discrete  component  at  the  clock 
frequency.  Among  the  disadvantages  of  a  bipolar  signal  is  the  requirement  for  twice  as  much 
power  (3  dB)  as  a  polar  signal  needs.  This  is  because  bipolar  detection  is  essentially  equivalent 
to  on-off  signaling  from  the  detection  point  of  view.  One  distinguishes  between  +p(t)  or  -P«) 
trom  0  rather  than  between  ±p(t). 


Anotner  disadvantage  of  bipolar  signaling  is  that  it  is  not  transparent.  In  practice,  vu. 
substitution  schemes  are  used  to  prevent  long  strings  of  logic  zeros  from  allowing  the  extra 
clock  signals  to  drift  away.  We  shall  now  discuss  two  such  schemes. 


icted 


7.2  Line  Coding  393 


Figure  7.10 

(a)  HDB3  signal 
and  (b)  its  PSD. 


Input  digits  010111000010110100000000001011010100001 


Coded  digits  oioin jo 0_0_v]i  ono  i[i_o_o_v]|_oo_v]o 01011010  1  jjfiFTvji 
Transmitted 

waveform  y(t)  ^  ^  un,  un,  ,  ,  fl,  un,  ,  fl,  ,  ,  ■  , 

V  V  v  v° 

(a) 


High-Density  Bipolar  (HDB)  Signaling 

The  HDB  scheme  is  an  ITU  (formerly  CCITT)  standard.  In  this  scheme  the  problem  of  nontrans¬ 
parency  in  bipolar  signaling  is  eliminated  by  adding  pulses  when  the  number  of  consecutive 
0s  exceeds  N.  Such  a  modified  coding  is  designated  as  high-density  bipolar  coding  (HDBN), 
where  N  can  take  on  any  value  1,  2,  3, ... .  The  most  important  of  the  HDB  codes  is  HDB3 
format,  which  has  been  adopted  as  an  international  standard. 

The  basic  idea  of  the  HDBN  code  is  that  when  a  run  of  N  +  1  zeros  occurs,  this  group  of 
zeros  is  replaced  by  one  of  the  special  N  +  I  binary  digit  sequences.  To  increase  the  timing 
content  of  the  signal,  the  sequences  are  chosen  to  include  some  binary  Is.  The  Is  included 
deliberately  violate  the  bipolar  rule  for  easy  identification  of  the  substituted  sequence.  In  HDB3 
coding,  for  example,  the  special  sequences  used  are  000V  and  B00V  where  B=1  that  conforms 
to  the  bipolar  rule  and  V=1  that  violates  the  bipolar  rule.  The  choice  of  sequence  000V  or 
B00V  is  made  in  such  a  way  that  consecutive  V  pulses  alternate  signs  to  avoid  dc  wander 
and  to  maintain  the  dc  null  in  the  PSD.  This  requires  that  the  sequence  BOOV  be  used  when 
there  are  an  even  number  of  Is  following  the  last  special  sequence  and  the  sequence  000V  be 
used  when  there  are  an  odd  number  ot  Is  following  the  last  sequence.  Figure  7. 10a  shows  an 
example  of  this  coding.  Note  that  in  the  sequence  BOOV.  both  B  and  V  are  encoded  by  the 
same  pulse.  The  decoder  has  to  check  two  things — the  bipolar  violations  and  the  number  of  0s 
preceding  each  violation  to  determine  if  the  previous  1  is  also  a  substitution. 

Despite  deliberate  bipolar  violations,  HDB  signaling  retains  error  detecting  capability. 
Any  single  error  will  insert  a  spurious  bipolar  violation  (or  will  delete  one  of  the  deliberate 
violations).  This  will  become  apparent  when,  at  the  next  violation,  the  alternation  of  viola¬ 
tions  does  not  appear.  This  also  shows  that  deliberate  violations  can  be  detected  despite  single 
errors.  Figure  7.10b  shows  the  PSD  of  HDB3  as  well  as  that  of  a  bipolar  signal  to  facilitate 
comparison.3 
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Binary  with  N  Zero  Substitution  (BNZS)  Signaling 

A  class  of  line  codes  similar  to  HDBN  is  the  binary  with  N  zero  substitution,  or  BNZS 
code,  where  if  N  zeros  occur  in  succession,  they  are  replaced,  by  one  of  the  two  special 
sequences  containing  some  Is  to  increase  timing  content.  There  are  deliberate  bipolar  violations 
just  as  in  HDBN.  Binary  with  eight-zero  substitution  (B8ZS)  is  used  in  DS1  signals  of  the 
digital  telephone  hierarchy  in  Chapter  6.  It  replaces  any  string  of  eight  zeros  in  length  with  a 
sequence  of  ones  and  zeros  containing  two  bipolar  violations.  Such  a  sequence  is  unlikely  to  be 
counterfeited  by  errors,  and  any  such  sequence  received  by  a  digital  channel  bank  is  replaced 
by  a  string  of  eight  logic  zeros  prior  to  decoding.  The  sequence  used  as  a  replacement  consists 
of  the  pattern  000VB0VB.  Similarly,  in  B6ZS  code  used  in  DS2  signals,  a  string  of  six  zeros 
is  replaced  with  OVBOVB,  and  DS3  signal  features  a  three-zero  B3ZS  code.  The  B3ZS  code 
is  slightly  more  complex  than  the  others  in  that  either  BOV  or  00V  is  used,  the  choice  being 
made  so  that  the  number  of  B  pulses  between  consecutive  V  pulses  is  odd.  These  BNZS  codes 
with  N  =  3, 6,  or  8  involve  bipolar  violations  and  must  therefore  be  carefully  replaced  by  their 
equivalent  zero  strings  at  the  receiver. 

There  are  many  other  transmission  (line)  codes,  too  numerous  to  list  here.  A  list  of  codes 
and  appropriate  references  can  be  found  in  Bylanski  and  Ingram.3 


7.3  PULSE  SHAPING 

The  PSD  Sy(f)  ot  a  digital  signal  y(t )  can  be  controlled  by  a  choice  of  line  code  or  by  P(f)< 
the  pulse  shape.  In  the  last  section  we  discussed  how  the  PSD  is  controlled  by  a  line  code.  In 
this  section  we  examine  how  Sy(f)  is  influenced  by  the  pulse  shape  p(t),  and  we  learn  how  to 
shape  a  pulse p(t)  to  achieve  a  desired  Sy(f ).  The  PSD  Sy(f)  is  strongly  and  directly  influenced 
by  the  pulse  shape  p(t)  because  Sy(f)  contains  the  term  \P(f)\2.  Thus,  in  comparison  to  the 
nature  ot  the  line  code,  the  pulse  shape  is  a  more  direct  and  potent  factor  in  terms  of  shaping 
the  PSD  Sy(f). 


7.3.1  Intersymbol  Interferences  (ISI)  and  Effect 

In  the  last  section,  we  used  a  simple  half-width  rectangular  pulse  p(t )  for  the  sake  of  illustra 
non.  Strictly  speaking,  in  this  case  the  bandwidth  of  Sy(f)  is  infinite,  since  P(f)  has  infinit 
bandwidth.  But  we  found  that  the  essential  bandwidth  of  Sy(f)  was  finite.  For  example,  most  o 
the  power  of  a  bipolar  signal  is  contained  within  the  essential  band  0  to  Rh  Hz.  Note,  howevei 
that  the  PSD  is  small  but  is  still  nonzero  in  the  range/  >  Rb  Hz.  Therefore,  when  such 
signal  is  transmitted  over  a  channel  of  bandwidth  Rh  Hz,  a  significant  portion  of  its  spectrur 
is  transmitted,  but  a  small  portion  of  the  spectrum  is  suppressed.  In  Sec.  3.5  and  Sec.  3.6,  w 
saw  how  such  a  spectral  distortion  tends  to  spread  the  pulse  (dispersion).  Spreading  of  a  pub 
beyond  its  allotted  time  interval  Th  will  cause  it  to  interfere  with  neighboring  pulses.  This  i 
known  as  intersymhol  interference  or  ISI. 

ISI  is  not  noise.  ISI  is  caused  by  nonideal  channels  that  are  not  distortionless  over  th 
entire  signal  bandwidth.  In  the  case  of  half-width  rectangular  pulse,  the  signal  bandwidth  is 

strictly  speaking,  infinity.  ISI.  as  a  manifestation  of  channel  distortion,  can  cause  errors  « 
pulse  detection  if  it  is  large  enough. 

To  resolve  the  difficulty  of  ISI,  let  us  review  briefly  our  problem.  We  need  to  transmit 
pulse  every  Th  interval,  the  *th  pulse  being akp(t-kTb).  The  channel  hasafinitebandwidth.au 


figure  7.11 
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we  are  required  to  detect  the  pulse  amplitude  ak  correctly  (i.e.,  without  ISI).  In  our  discussion 
so  tar,  we  have  considered  time-limited  pulses.  Since  such  pulses  cannot  be  band-limited,  part 
of  their  spectra  is  suppressed  by  a  band-limited  channel.  This  causes  pulse  distortion  (spreading 
out)  and,  consequently,  ISI.  We  can  try  to  resolve  this  difficulty  by  using  pulses  that  are  band- 
limited  to  begin  with  so  that  they  can  be  transmitted  intact  over  a  band-limited  channel.  But 
band-limited  pulses  cannot  be  time-limited.  Obviously,  various  pulses  will  overlap  and  cause 
ISI.  Thus,  whether  we  begin  with  time-limited  pulses  or  band-limited  pulses,  it  appears  that  ISI 
cannot  be  avoided.  It  is  inherent  in  the  finite  transmission  bandwidth.  Fortunately,  there  is  an 
escape  from  this  blind  alley.  Pulse  amplitudes  can  be  detected  correctly  despite  pulse  spreading 
(or  overlapping),  if  there  is  no  ISI  at  the  decision-making  instants.  This  can  be  accomplished 
by  a  properly  shaped  band-limited  pulse.  To  eliminate  ISI,  Nyquist  proposed  three  different 
criteria  for  pulse  shaping,4  where  the  pulses  are  allowed  to  overlap.  Yet,  they  are  shaped  to 
cause  zero  (or  controlled)  interference  with  all  the  other  pulses  at  the  decision-making  instants. 
Thus,  by  limiting  the  noninterference  requirement  only  at  the  decision-making  instants,  we 
eliminate  the  need  for  the  pulse  to  be  totally  nonoverlapping.  We  shall  consider  only  the  first 
two  criteria.  The  third  is  much  less  useful  than  the  first  two  criteria,5  and  hence,  will  not  be 
considered  here. 


7.3.2  Nyquist's  First  Criterion  for  Zero  ISI 

In  the  first  method,  Nyquist  achieves  zero  ISI  by  choosing  a  pulse  shape  that  has  a  nonzero 
amplitude  at  its  center  (say  t  =  0)  and  zero  amplitudes  at  t  =  ±nTb  (n  =  1,  2,  3, . . .),  where 
Tb  is  the  separation  between  successive  transmitted  pulses  (Fig.  7.1  la).  Thus, 


P(t)  = 


1 

0 


/  =  0 


t  =  ±nTb 


(7.23) 


A  pulse  satisfying  this  criterion  causes  zero  ISI  at  all  the  remaining  pulse  centers,  or  signaling 
instants  as  shown  in  Fig.  7. 1  la,  where  we  show  several  successive  pulses  (dashed)  centered  at 
t  =  0,  Tb ,  2Tb,  3 Tb,  . . .  (Tb  =  I  /Rb)-  For  the  sake  of  convenience,  we  have  shown  all  pulses 
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I 


to  be  positive.*  It  is  clear  from  this  figure  that  the  samples  at  t  =  0,  7*,  27),,  37),,  . . .  consist 


of  the  amplitude  of  only  one  pulse  (centered  at  the  sampling  instant)  with  no  interference  from 


the  remaining  pulses. 


Now  transmission  of  /?/,  bit/s  requires  a  theoretical  minimum  bandwidth  Rb/2  Hz.  It  would 
be  nice  if  a  pulse  satisfying  Nyquist’s  criterion  had  this  minimum  bandwidth  /?/,/ 2  Hz.  Can  we 
find  such  a  pulse  /?(/)?  We  have  already  solved  this  problem  (Example  6.1  with  B  =  /?/,/ 2), 
where  we  showed  that  there  exists  one  (and  only  one)  pulse  which  meets  Nyquist’s  criterion 
(7.23)  and  has  a  bandwidth  Rb/ 2  Hz.  This  pulse,  p(t)  =  sine  (n /?/,f),  (Fig.  7.11b)  has  the 
property 


=  0 


(7.24a) 


Moreover,  the  Fourier  transform  of  this  pulse  is 


(7.24b) 


which  has  a  bandwidth  R/,/2  Hz  as  seen  from  Fig.  7.1  lc.  We  can  use  this  pulse  to  transmit  at 
a  rate  of  Rf)  pulses  per  second  without  ISI,  over  a  bandwidth  only  Rb/ 2. 

This  scheme  shows  that  we  can  attain  the  theoretical  limit  of  performance  by  using  a 
sine  pulse.  Unfortunately,  this  pulse  is  impractical  because  it  starts  at  — oo.  We  will  have  to 
wait  an  infinite  time  to  generate  it.  Any  attempt  to  truncate  it  would  increase  its  bandwidth 
beyond  Rb/ 2  Hz.  But  even  if  this  pulse  were  realizable,  it  would  have  an  undesirable  feature: 
namely,  it  decays  too  slowly  at  a  rate  1  /t.  This  causes  some  serious  practical  problems.  For 
instance,  if  the  nominal  data  rate  of  Rb  bit/s  required  for  this  scheme  deviates  a  little,  the  pulse 
amplitudes  will  not  vanish  at  the  other  pulse  centers.  Because  the  pulses  decay  only  as  \jU 
the  cumulative  interference  at  any  pulse  center  from  all  the  remaining  pulses  is  of  the  form 
1  /**)•  ^  we^  known  that  the  infinite  series  of  this  form  does  not  converge  and  can  add  up 
to  a  very  large  value.  A  similar  result  occurs  if  everything  is  perfect  at  the  transmitter  but  the 
sampling  rate  at  the  receiver  deviates  from  the  rate  of/?/,  Hz.  Again,  the  same  thing  happens 
if  the  sampling  instants  deviate  a  little  because  of  pulse  time  jitter,  which  is  inevitable  even  in 
the  most  sophisticated  systems.  This  scheme  therefore  fails  unless  everything  is  perfect,  which 
is  a  practical  impossibility.  And  all  this  is  because  sine  (jrRht)  decays  too  slowly  (as  1 
solution  is  to  find  a  pulse  p(t)  that  satisfies  Eq.  (7.23)  but  decays  faster  than  \/t.  Nyquist  has 
shown  that  such  a  pulse  requires  a  bandwidth  kRb/ 2,  with  1  <  k  <  2. 

This  can  be  proved  as  follows.  Let  p{t)  <=»  P{f)%  where  the~bandwidth  of  P(f)  is  in  *e 
range  (Rh/ 2,  Rh)  (Fig.  7.12a).  The  desired  pulse  p(t)  satisfies  Eq  (7.23).  If  we  sample />(') 
7),  seconds  by  multiplying p(t)  by  STb(t),  (an  impulse  train),  then  because  of  the  property 
(7._  ),  all  the  samples,  except  the  one  at  the  origin,  are  zero.  Thus,  the  sampled  signal p(t) >s 


(7.25) 


P(t)  =  p(t)8Tb(t)  =  8{t) 


ollowin  the  analysis  of  Eq.  (6.4)  in  Chapter  6,  we  know  that  the  spectrum  of  a  sampled  signal 
Pit  is  /Th  times)  the  spectrum  of  p(t)  repeating  periodically  at  intervals  of  the  sampling 


reasoning.  Showing  negative  pulses  would  make  life  fig^ 


iomW  considering  all  positive  pulses  does  not  affect  our 
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Figure  7. 1 2 
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criterion  pulse. 
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frequency  /?/,.  Therefore,  the  Fourier  transform  of  both  sides  of  Eq.  (7.25)  yields 

1  °°  1 

—  Y  P(f~  nRh)  =  1  where  /?/,  =  —  (7.26) 

Tb  Tb 

or 

00 

£  P(f-nRh)  =  Tb  (7.27) 

n=— oo 

Thus,  the  sum  of  the  spectra  formed  by  repeating  P(f)  spaced  Rb  apart  is  a  constant  Tb,  as 
shown  in  Fig.  7.12b.* 

Consider  the  spectrum  in  Fig.  7. 1 2b  over  the  range  0  <f  <  Rh.  Over  this  range  only  two 
terms  P{f)  and  P(f  -  Rb)  in  the  summation  in  Eq.  (7.27)  are  involved.  Hence 

P(f)  +  P(f  ~Rb)  =  Tb  0  <f<Rh 

Letting  x  =  f  -  Rb/ 2,  we  have 

P  (x  +  0.5 Rb)  +  P( x-  0.5 Rb)  =  Tb  \x\  <  0.5 Rb  (7.28a) 

or,  alternatively, 

PL+^j+p(x-^j  =  Tb  kl  <  0.5Rb  (7.28b) 

Use  of  the  conjugate  symmetry  property  [Eq.  (3.1 1)]  on  Eq.  (7.28)  yields 

=  Tb  W  <  0.5Rb  (7.29) 


*  Observe  that  if  Kh  >  2  B.  where  B  is  the  bandwidth  (in  hertz)  of  P{f),  the  repetitions  of  P(f)  are  nonoverlapping, 
and  condition  (7.27)  cannot  be  satisfied.  For  Rh  =  2 B.  the  condition  is  satisfied  only  for  the  ideal  low-pass 
P(f)lp(t)  =  sine  (;r Rbl)],  which  is  not  realizable.  Hence,  we  must  have  B  >  Rb/2. 
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It  we  choose -P(f)  to  be  real-valued  and  positive  then  only  | /></)!  needs  to  satisfy  Eq.  (7.29). 
Because  \P(f)\  is  real,  Eq.  (7.29)  implies 

\P  (y  +  X)  I  +  |p  (t  “  *)  |  =  Tb  w  <  0.5 Rb  (7.30) 

Hence  \P(f)\  should  be  of  the  form  shown  in  Fig.  7. 13.  This  curve  has  an  odd  symmetry  about 
e  se  o  axes  intersecting  at  point  a  [the  point  on  \P(f)\  curve  at/  =  Rb/2],  Note  that  this 


|/,(0.5/fy,)|  =  0.5|/>(0)| 


the  minimum  h  7 ' 'a  u ^  ’  'S  'i*Kk  +f*'  whereA  is  the  bandwidth  in  excess  of 
minimum  bandJridlhtf, /2:‘  ^  '  **  ^  rati°  °f  ^  CXCeSS  bandwidth/*  to  the  theoretical 


_ excess  bandwidth 

theoretical  minimum  bandwidth 

fx 

0.5  Rb 


=  2 \fxTb 

(7.31) 

Observe  that  because/,  cannot  be  larger  than  Rb/2, 

0  <  r  <  1 

(7.32) 

l«,hlsrTr/HfT'hl,T'icai  i* 

v  b/  Hz.  Therefore,  the  bandwidth  of  P(f )  is 

Rb/2  Hz,  and  the  excess 

BT  =  ~  A-  —  U+r)Rb 

2  2  2 

(7.33) 
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The  constant  r  is  called  the  roll-off  factor  and  is  also  expressed  in  terms  of  percent.  For 
example,  if  P(f)  is  a  Nyquist  first  criterion  spectrum  with  a  bandwidth  that  is  50%  higher  than 
the  theoretical  minimum,  its  roll-off  factor  r  =  0.5  or  50%. 

A  filter  having  an  amplitude  response  with  the  same  characteristics  is  required  in  the 
vestigial  sideband  modulation  discussed  in  Sec.  4.5  [Eq.  (4.26)].  For  this  reason,  we  shall 
refer  to  the  spectrum  P(f)  in  Eqs.  (7.29)  and  (7.30)  as  a  vestigial  spectrum.  The  pulse 
pit )  in  Eq.  (7.23)  has  zero  ISI  at  the  centers  of  all  other  pulses  transmitted  at  a  rate  of 
Rb  pulses  per  second.  A  pulse  p(t)  that  causes  zero  1S1  at  the  centers  of  all  the  remaining 
pulses  (or  signaling  instants)  is  the  Nyquist  first  criterion  pulse.  We  have  shown  that  a  pulse 
with  a  vestigial  spectrum  [Eq.  (7.29)  or  Eq.  (7.30)]  satisfies  the  Nyquist’s  first  criterion  for 
zero  ISI. 

Because  0  <  r  <  1,  the  bandwidth  of  P(f)  is  restricted  to  the  range  Rh/ 2  to  Rh  Hz.  The 
pulse  p(t)  can  be  generated  as  a  unit  impulse  response  of  a  filter  with  transfer  function  P(f). 
But  because  Pif)  =  0  over  a  frequency  band,  it  violates  the  Paley-Wiener  criterion  and  is 
therefore  unrealizable.  However,  the  vestigial  roll-off  characteristic  is  gradual,  and  it  can  be 
more  closely  approximated  by  a  practical  filter.  One  family  of  spectra  that  satisfies  Nyquist’s 
first  criterion  is 


(7.34) 


2 


Figure  7.14a  shows  three  curves  from  this  family,  corresponding  to/v  — 0  (r  —  0), 
fx  =  fth/4  (r_o.5)  and  fx  =  Rb/2  (r=l).  The  respective  impulse  responses  are  shown  in 
Fig.  7.14b.  It  can  be  seen  that  increasing  fx  (or  r)  improves  pit);  that  is,  more  gradual  cutoff 
reduces  the  oscillatory  nature  of  pit)  and  causes  it  to  decay  more  rapidly  in  time  domain.  For 
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the  case  of  the  maximum  value  of  fx  =  Rb/2  (r  =  1),  Eq.  (7.34)  reduces  to 

P(f)  =  \  (1  +  cos  7rJTb )  n  (7.35a) 

=c“;0ir)n(f)  <7J5M 

This  characteristic  of  Eq.  (7.34)  is  known  in  the  literature  as  the  raised-cosine  characteristic, 
because  it  represents  a  cosine  raised  by  its  peak  amplitude.  Eq.  (7.35)  is  also  known  as  the 
full-cosine  roll-off  characteristic.  The  inverse  Fourier  transform  of  this  spectrum  is  readily 
found  as  (see  Prob  7.3-8) 


...  .,  cos  nRbt  . 

P(t)  =  Rh~\ — 7Z ~  sinc  (*Rbt) 
1  -  4Rrt2 


(7.36) 


This  pulse  is  shown  in  Fig.  7.14b  (r  —  1).  We  can  make  several  important  observations  about 
the  raised-cosine  pulse.  First,  the  bandwidth  of  this  pulse  is  Rb  Hz  and  has  a  value  Rbatt  =  0 
and  is  zero  not  only  at  all  the  remaining  signaling  instants  but  also  at  points  midway  between 
all  the  signaling  instants.  Second,  it  decays  rapidly,  as  \/t\  As  a  result,  the  raised-cosine  pulse 
is  re  ative  y  insensitive  to  deviations  ot/fy,,  sampling  rate,  timing  jitter,  and  so  on.  Furthermore, 
the  pulse-generating  filter  with  transfer  function  P(f)  [Eq.  (7.35b)]  is  closely  realizable.  The 

I  luse  c  aracteristic  that  goes  along  with  this  filter  is  very  nearly  linear,  so  that  no  additional 
phase  equalization  is  needed. 

.  hould  be  remembered  that  it  is  the  pulses  received  at  the  detector  input  that  should 
a\e  tic  orm  or  zero  IS1.  In  practice,  because  the  channel  is  not  ideal  (distortionless),  the 

“  „  fn  I'  U'  ^  Shaped  S°that  after  Passin8  trough  the  channel  with  transfer 
th  .  ‘  ey  Wl  received  with  the  proper  shape  (such  as  raised  -cosine  pulses)  at 
the  receiver.  Hence,  the  transmitted  pulse  Pi(t)  should  satisfy 

Pi(f)Hc(f)  =  P(f) 

H  (f)  asa  chan  'o'*  vest'g‘a*  sP®ctrum  ‘n  Eq.  (7.30).  For  convenience,  the  transfer  function 
out-of-band  noises  *  S°  UdC  d  receiver  rtlter  designed  to  reject  interference  and  other 


From  Eq.  (7.33) 


Rb  -  - - By 

1  +  r  1 


dwke'otV^A  smaller6  PU'Se  tra"smission  rate  varies  from  2 BT  to  B,  .  depending  on  the 

creating  the  same  omhl  glves  a  signaling  rate.  But  the  pulse  p(t)  decays  slowly, 

pulse  r  =  1  and  R  ~  r™'  ?  1  <Jiscussed  for  the  sinc  pulse.  For  the  raised-cosine 

decays  faster  as  1  h2  and  ''  ^  athJeVe  half  the  theoretical  maximum  rate.  But  the  pulse 
y  Iaster  as  1 A  and  is  less  vulnerable  to  ISI. 
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7.3.3  Controlled  ISI  or  Partial  Response  Signaling 

The  Nyquist  criterion  pulse  requires  in  a  bandwidth  somewhat  larger  than  the  theoretical 
minimum.  If  we  wish  to  further  reduce  the  pulse  bandwidth,  we  must  find  a  way  to  widen  the 
pulse  pit)  (the  wider  the  pulse,  the  narrower  the  bandwidth).  Widening  the  pulse  may  result  in 
interference  (ISI)  with  the  neighboring  pulses.  However,  in  the  binary  transmission  with  just 
two  possible  symbols,  a  known  and  controlled  amount  of  ISI  may  be  possible  to  remove  or 
compensate  because  there  are  only  a  few  possible  interference  patterns. 

Consider  a  pulse  specified  by  (see  Fig.  7.15): 


p(nTb)  = 


n  =  0,  1 
for  all  other  n 


(7.37) 


This  leads  to  a  known  and  controlled  ISI  from  the  Arth  pulse  to  the  very  next  transmitted 
pulse.  We  use  polar  signaling  by  means  of  this  pulse.  Thus,  1  is  transmitted  by  pit)  and  0  is 
transmitted  by  using  the  pulse  -pit).  The  received  signal  is  sampled  at  t  =  nTb.  and  the  pulse 
pit)  has  zero  value  at  all  n  except  for  n  =  0  and  I,  where  its  value  is  1  (Fig.  7.15).  Clearly, 
such  a  pulse  causes  zero  ISI  with  all  the  pulses  except  the  succeeding  pulse.  Therefore,  we 
need  to  worry  about  the  ISI  with  the  succeeding  pulse  only.  Consider  two  such  successive 
pulses  located  at  0  and  Tb,  respectively.  If  both  pulses  were  positive,  the  sample  value  of  the 
resulting  signal  at  t  =  Tb  would  be  2.  If  the  both  pulses  were  negative,  the  sample  value  would 
be  -2.  But  if  the  two  pulses  were  of  opposite  polarity,  the  sample  value  would  be  0.  With 
only  these  three  possible  values,  the  signal  sample  clearly  allows  us  to  make  correct  decision 
at  the  sampling  instants.  The  decision  rule  is  as  follows.  If  the  sample  value  is  positive,  the 
present  bit  is  1  and  the  previous  bit  is  also  1.  If  the  sample  value  is  negative,  the  present 
bit  is  0  and  the  previous  bit  is  also  0.  If  the  sample  value  is  zero,  the  present  bit  is  the 
opposite  of  the  previous  bit.  Knowledge  of  the  previous  bit  then  allows  the  determination  of  the 
present  bit. 

Table  7.1  shows  a  transmitted  bit  sequence,  the  sample  values  of  the  received  signal  xit) 
(assuming  no  errors  causes  by  channel  noise),  and  the  detector  decision.  This  example  also 
indicates  the  error  detecting  property  of  this  scheme.  Examination  of  samples  of  the  waveform 
yit)  in  Table  7. 1  shows  that  there  are  always  an  even  number  of  zero-valued  samples  between 
two  full-valued  samples  of  the  same  polarity  and  an  odd  number  of  zero-valued  samples 
between  two  full-valued  samples  of  opposite  polarity.  Thus,  the  first  sample  value  of  .xit)  is  2, 
and  the  next  full-valued  sample  (the  fourth  sample)  is  2.  Between  these  full-valued  samples 
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TABLE  7.1 

Transmitted  Bits  and  the  Received  Samples  in  Controlled  ISI  Signaling 

Information  sequence  1101100010111 
Samples  y(kT/,)  1  20020  -2  -2  0  0  022 

Detected  sequence  1101100010111 


ot  the  same  polarity,  there  are  an  even  number  (i.e.,  2)  of  zero-valued  samples.  If  one  of  the 
sample  values  is  detected  wrong,  this  rule  is  violated,  and  the  error  is  detected. 

The  pulse  p(t)  goes  to  zero  at  /  =  -7*,  and  27/,,  resulting  in  the  pulse  width  (of  the 
primary  lobe)  50  k  higher  than  that  ot  the  first  criterion  pulse.  This  pulse  broadening  in  the 
time  domain  leads  to  reduction  of  its  bandwidth.  This  is  the  second  criterion  proposed  by 
N \ cjn i st .  This  scheme  ot  controlled  ISI  is  also  known  as  correlative  or  partial-response 

scheme.  A  pulse  satisfying  the  second  criterion  in  Eq.  (7.37)  is  also  known  as  the  duobinarv 
pulse. 


7.3.4  Example  of  a  Duobinary  Pulse 

I  I  we  restrict  the  pulse  bandwidth  to  Rh/2.  then  following  the  procedure  of  Example  7. 1 ,  we  can 
for'the  duobhiary  puL °n^  tbe  f°'*ow‘ng  pulse  p(t)  meets  the  requirement  in  Eq.  (7.37) 


(7.38) 


(7.39) 


p(t)  =  sin  (;r/?faf> 
nRbt(l  —  Rht) 

The  Fourier  transform  />(/)  of  the  pulse  p(r)  is  given  by  (see  Prob  7.3-10) 

bmrPvdl'’r,l.,a™  "V»"Ps"r'e  SPeCln,m  |P(01  are  shown  Tig.  7. 16.*  This  pulse  transmits 

. . 

pulse  In  fact  it  is  m^T"^  '*!!' We. are  able  to  ach,eve  the  theoretical  rate  using  the  duobinary 

Here  is  the  c  itch' A  nip  ^  ru  dth.  We  have  achieved  this  rate  for  binary  information, 

information  because  it  cannot  1^)^™^'°"  ^  qUal'fy  “  U"  indePendent  P'eCe  f 

The  duobinary  pulse  would  fail  if  the  n,  ^  ^  UC'  ‘  mUSt  be  selected  from  a  fin,te 
that  is  if  the  pulses  were  t  1  .  .  P  scs  were  tru*y  independent  pieces  of  information. 

The  —  * 

interference  patterns  between  pulses  which  n  therc  are  °nly  3  finite  (known)  numbe^0 
despite  interference  permits  correct  determination  of  pulse  amplitudes 


The  phase  spectrum  is  linear  with  0p(f)  =  ~7tJT^ 


Figure  7.16 

(a)  The  minimum 
bandwidth  pulse 
that  satisfies  the 
duobinary  pulse 
criterion  and 

(b)  its  spectrum. 


7.3  Pulse  Shaping 


403 
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7.3.5  Pulse  Relationship  between  Zero-ISI, 

Duobinary,  and  Modified  Duobinary 

Now  we  can  establish  the  simple  relationship  between  a  pulse  pa(t)  satisfying  the  first  Nyquist 
criterion  (zero  ISI)  and  a  duobinary  pulse  ph(t )  (with  controlled  ISI).  From  Eqs.  (7.23)  and 
(7.37),  it  is  clear  that  pa(kTb)  and  pb(kTb)  only  differ  for  k  =  1.  They  have  identical  sample 
values  for  all  other  integer  k.  Therefore,  one  can  easily  construct  a  pulse  pb(t)  from  pa(t)  by 

Pb(t)  =  Pa(t)  +  Pa(t  -  Tb) 

This  addition  is  the  “controlled”  ISI  or  partial-response  signaling  that  we  deliberately  intro¬ 
duced  to  reduce  the  bandwidth  requirement.  To  see  what  effect  “duobinary”  signaling  has  on 
the  spectral  bandwidth,  consider  the  relationship  of  the  two  pulses  in  the  frequency  domain: 

Pb(f)  =  Pa(f)[  1  +  e~i2nfTb]  (7.40a) 

\Pb(f)\  =  \Pa{f)\>/2(  1  +cos(27t/7),)2  |cos  (nJTb)\  (7.40b) 

We  can  see  that  partial-response  signaling  is  actually  forcing  a  frequency  null  at  2njTh  =  n  or, 
equivalently/  =  0.5/7/,.  Therefore,  conceptually  we  can  see  how  partial-response  signaling 
provides  an  additional  opportunity  to  reshape  the  PSD  or  the  transmission  bandwidth.  Indeed, 
duobinary  signaling,  by  forcing  a  frequency  null  at  0.5/7/,,  forces  its  essential  bandwidth  to 
be  at  the  minimum  transmission  bandwidth  needed  for  a  data  rate  of  \/Th  (as  discussed  in 
Sec.  6.1.3). 
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Figure  7.17 

Equivalent 

duobinory 

signaling. 


In  fact,  many  physical  channels  such  as  magnetic  recording  have  a  zero  gain  at  dc.  There¬ 
fore,  it  makes  no  sense  for  the  baseband  signal  to  have  any  dc  component  in  its  PSD.  Modified 
partial-response  signaling  is  often  adopted  to  force  a  null  at  dc.  One  notable  example  is  the 
so-called  modified  duobinary  signaling  that  requires 


Pc(nTb)  = 


1 

-1 

0 


n  =  - 1 
n  =  1 

for  all  other  integers  n 


(7.41) 


A  similar  argument  indicates  that /?<•(/)  can  be  generated  from  any  pulse  p„ (t)  satisfying  the 
first  Nyquist  criterion  via 


Pc(t)  =  Pa(t  +  Tb)  -  pa(t  -  Tb) 

Equivalently,  in  the  frequency  domain,  the  duobinary  pulse  is 

Pc(f)  =  2jPa(f)  sin  (2nJTb) 

which  uses  sin  (- nfTb )  to  force  a  null  at  dc  to  comply  with  the  physical  channel  constraint. 


7.3.6  Detection  of  Duobinory  Signaling 
and  Differential  Encoding 

For  the  controlled  ISI  method  of  duobinary  signaling.  Fig.  7.17  shows  the  basic  transmitter 
la^ram  e  now  ta  e  a  closer  look  at  the  relationship  of  all  the  data  symbols  at  the  baseband 
and  the  detection  procedure.  For  binary  message  bit  4  =  0,  or  I ,  the  polar  symbols  are  simply 

ak  =  2/t  -  1 

Under  the  controlled  ISI,  the  samples  of  the  transmission  signal  y(t)  are 

y(kTfy)  =  bk  =  cik  (7.42) 

answIredTvfi'rs?  0?  ??  jS  h°W  t0  detect  from  y^Th)  or  bk .  This  question  can  be 

»*=  0-  ±2.  FrJ ",  of  6,  o,^,.  Because  «=  ±1,  to 


bk  —  2  =>  ak  —  1 

bk  =  -2  =»  ak  =  - 1 

bk-0  =»  ak  =  -ak_l  or4  =  l-4_, 


or  4  =  | 
or  4  =  0 


(7.43) 


Kgure  7.18 

u,tterential 


encoded 

duobinary 

s'9naling. 
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Therefore,  a  simple  detector  of  duobinary  signaling  is  to  first  detect  all  the  bits  4  corresponding 
to  bk  =  ±2.  The  remaining  {/?*}  are  zero- valued  samples  that  imply  transition:  that  is,  the 
current  digit  is  1  and  the  previous  digit  is  0,  or  vice  versa.  This  means  the  digit  detection  must 
be  is  based  on  the  previous  digit.  An  example  of  this  digit-by-digit  detection  was  shown  in 
Table  7. 1 .  The  disadvantage  of  the  detection  method  in  Eq.  (7.43)  is  that  when  y{kTb)  =  0,  the 
current  bit  decision  depends  on  the  previous  bit  decision.  If  the  previous  digit  were  detected 
incorrectly,  then  the  error  would  tend  to  propagate,  until  a  sample  value  of  ±2  appears.  To 
mitigate  this  error  propagation  problem,  we  apply  a  effective  mechanism  known  as  differential 
coding. 

Figure  7.18  illustrates  a  duobinary  signal  generator  by  introducing  an  additional  differ¬ 
ential  encoder  prior  to  partial-response  pulse  generation.  As  shown  in  Fig.  7.18,  differential 
encoding  is  a  very  simple  step  that  changes  the  relationship  between  line  code  and  the  message 
bits.  Differential  encoding  generates  a  new  binary  sequence 

pk  =  lk  ©  pk_x  modulo  2 

with  the  assumption  that  the  precoder  initial  state  is  either  po  =  0  or  po  =  1 .  Now,  the  precoder 
output  enters  a  polar  line  coder  and  generates 


at  =  2 Pk  -  1 


Because  of  the  duobinary  signaling  bk  =  ak  +  ak- \  and  the  zero-ISI  pulse  generator,  the 
samples  of  the  received  signal  y(t)  without  noise  become 


y(kTb)  =  bk  =  ak  +  at- 1 

=  2(pk  +  Pk- 1)  —  2 

=  2  (pk- 1  ©  4  +  Pk- 1  —  0 

2(1  -  4)  Pk- 1  =  1 
2(4-U  Pk- 1=0 


(7.44) 


Based  on  Eq.  (7.44),  we  can  summarize  the  direct  relationship  between  the  message  bits  and 
the  sample  values  as 


y(kTh)  = 


0 

±2 


4  =  l 

4=0 


(7.45) 


This  relationship  serves  as  our  basis  for  a  symbol-by-symbol  detection  algorithm.  In  short,  the 
decision  algorithm  is  based  on  the  current  sample  y(kTb).  When  there  is  no  noise,  y(kTh)  =  bk 
and  the  receiver  decision  is 


2  -  \y(kTb)\ 


(7.46) 


figure  7.18 

U|tterential 

encoded 

duobinary 

s'9naling. 
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TABLE  7.2 

Binary  Duobinary  Signaling  with  Differential  Encoding 


Time  k 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

Ik 

1 

1 

0 

1 

1 

0 

0 

0 

1 

0 

1 

1 

1 

Pk 

0 

1 

0 

0 

1 

0 

0 

0 

0 

1 

1 

0 

1 

0 

<>k 

-1 

1 

-1 

-1 

1 

-1 

-1 

-1 

-1 

1 

1 

-1 

1 

-1 

bk 

0 

0 

-2 

0 

0 

-2 

-2 

-2 

0 

2 

o 

o 

o 

Detected  bits 

1 

1 

0 

1 

1 

0 

0 

0 

1 

0 

1 

1 

1 

Figure  7.19 

Pulse  generation 
by  transversal 
filter. 


Therefore,  the  incorporation  of  differential  encoding  with  duobinary  signaling  not  only  sim- 
pl.fies  the  decs, on  rule  but  also  makes  the  decision  independent  of  the  previous  digit  and 

Hiffi*  na  ,S  crror  ProP^at'on-  1°  Table  7.2,  the  example  of  Table  7.1  is  recalculated  with 
deferential  encodmg.  The  decoding  relationship  of  Eq.  (7.45)  is  clearly  shown  in  this  example. 

general!  '  eren  1  enco  mg  defined  for  binary  information  symbols  can  be  conveniently 
fo  I  d  tn°,nb,nar?  Sy?1b0,S' When  the  '"formation  symbols  4  are  M  -ary,  the  only  change 

generated  o  n  l  "8  **  l°  rep'ace  “modul°  2”  with  “modul°  M  •”  Similarly,  other 

Dronapation  n  w  reSp?nSj  s,8na*'ng  such  as  the  modified  duobinary  must  also  face  the  error 

adopted  to  nrev  r  detect,on- A  suitable  ^  of  differential  encoding  can  be  similarly 
adopted  to  prevent  error  propagation. 


7.3.7  Pulse  Generation 

filter  with ^  8'nera"id  as  lhe  unil  impulse  tesponse  c 
the  wavdorm  ,,  .,  ^  1  ^  M  alwa>,s  >*  «■**  A  better  method  is  to  genet 

m  ”  teTnetS  IT1  f  fT™'  m,er  (,aplKd  *‘<y  U“U>  discussed  here.  The  p» 

sszizzz:  r  sma" ^  <*■ 719 

in  Fie  7  1%  When  n™,  P  P  ion  to  these  sample  values  in  sequence,  as  sho' 
applied  ^tt  the  “pu,  of  “n“T  n  T»  *  * 

Pit).  This  output,  when  passed  through  a  low  m  Wl"  **  3  Sta'rcaSe  aPProximatl0n. 

can  be  improved  by  reducing  the  pulse  °“L  ^  aPPr°Ximat' 

need  to  meet  the  desired  N^ql^criterion^H ' ^  T™"8  31  ^  detector  input  of  the  recei' 
that  after  passing  through  the  <-h  i  u  CnCe’ the  transmitted  pulses  should  be  so  shap 
practice,  howevep ttald desired  (Nyquist,  form. 

be  carried  out  by  an  equalizer  at  rh  •  ^  ngld  y  at  tbe  transmitter.  The  final  shaping  c 
y  equalizer  at  the  rece.ver,  as  discussed  later  (Sec.  7.5). 


Figure  7.20 

M  Scrambler. 

(b)  Descrambler. 


7.4  SCRAMBLING 


In  general,  a  scrambler  tends  to  make  th<*  .i..,  c 

Is  or  Os.  Scrambling  can  be  heloful  in  t  3  3  more  random  by  removing  long  strings  of 
binary  data.  Scramblers,  however  are  n  .  ng  extractio"  by  removing  long  strings  of  Os  in 
the  data,  and  they  are  optimized  for  rh^”1311  y  USed  f°r  Preventing  unauthorized  access  to 
generation  of  a  long  string  of  zeros  in  ih*  h  UrP^’  Sucb  optimization  may  actually  result  in 
these  long  zero  strings  by  using  the  °  3,ta'  be  d'g'ta*  ne,work  must  be  able  to  cope  with 
n  S  by  US,n^  the  zero  replacement  techniques  discussed  in  Sec.  7.2. 


Figure  7.19 

Pulse  generation 
by  transversal 
filter. 


(a) 


figure  7.20 

(Q)  Scrambler. 

|b)  Descrambler. 


(a)  <b) 
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Figure  7.20  shows  a  typical  scrambler  and  descrambler.  The  scrambler  consists  of  a  feed¬ 
back  shift  register,  and  the  matching  descrambler  has  a  feedforward  shift  register,  as  shown 
in  Fig.  7.20.  Each  stage  in  the  shift  register  delays  a  bit  by  one  unit.  To  analyze  the  scrambler 
and  the  matched  descrambler,  consider  the  output  sequence  T  of  the  scrambler  (Fig.  7.20a).  If 
S  is  the  input  sequence  to  the  scrambler,  then 

S  ®  D*T  0  D5T  =  T  (7.47) 

where  D  represents  the  delay  operator;  that  is,  D"T  is  the  sequence  T  delayed  by  n  units.  Now, 
recall  that  the  modulo  2  sum  of  any  sequence  with  itself  gives  a  sequence  of  all  Os.  Adding 
ITT  ®  l)'  )T  to  both  sides  of  Eq.  (7.47),  we  get 

S  =  T  ®  (D3  ©  D5)T 
=  [1©(D3©£>5)]7’ 

=  (1  ®  F)T  (7.48) 

where  F  =  D3  ®  D5. 

design  the  descrambler  at  the  receiver,  we  start  with  T,  the  sequence  received  at  the 
descrambler.  From  Eq.  (7.48),  it  follows  that 

T  ®  FT  =  T  ®  (£)3  ©  D5)T  =  S 

readilv  im  V ”  Wa'u^  '1.°  re®enera,e  input  sequence  5  from  the  received  sequence  T,  is 
readily  implemented  by  the  descrambler  shown  in  Fig.  7.20b. 

in  R  Hem-p  "  c*e*ect'®n  error  ‘n  ^  received  sequence  T  will  affect  three  output  bits 

bit  error  g  haS  the  disadvantage  of  causing  multiple  errors  for  a  single  received 

Example  7.2 

aZi,!',,,l"OIOTl "  i$  fed  10  ""  ““■**»  »  RS  7.20a.  Find  the  scrambler 
output  T,  assuming  the  initial  content  of  the  registers  to  be  zero. 

a'nTm^t^asT^T- Fel'y  7  T  ?'  a"d  ,he  S 

again  enters  thp  r^o  .  FS  through  the  feedback  path.  This  new  sequence  FS 

agam  enters  the  regtster  and  ,s  returned  as  and  so  on.  Hence 

T  =  S  ®  FS  ©  F2S  ®  F3S  ©  . . . 

=  (1©F©F2©F3®...)5  (7.49) 

Recognizing  that 

F  =  D3  ©  D5 

we  have 

F' ’  =  103  ®  °5)(d3  ®  O5)  =  D‘  ©  D'O  ®  D»  ®  D» 

Because  modulo-2  addition  of  any  sequence  with  itself  is  zero,  D8  ©  D8  =  0,  and 

F2  =  D6  ©  D10 
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Similarly 


F3  =  (£>6  ©  £>l0)(£>3  ©  D5)  =  D9  ©  D11  ©  £)13  ©  D15 
and  so  on.  Hence  [see  Eq.  (7.49)], 


T  =  (1  ©D3©D5©D6©D9©Dl0©Dn  ©  D12  ©  D13  ©  D15  •  •  •  )S 

Because  DnS  is  simply  the  sequence  5  delayed  by  n  bits,  various  terms  in  the  above 
equation  correspond  to  the  following  sequences: 


S 

d3s 

d5s 

D^S 

D9S 

Dl0S 

DUS 

d'2s 

Di3S 

PI5S 

T 


101010100000111 
000101010100000111 
00000101010100000111 
000000101010100000111 
000000000 101010100000111 
000000000010101010000011 1 
00000000000 1010101 00000 1 1 1 
00000000000010101010000011 1 
0000000000000 1010101 00000 1 1 1 
00000000000000010101010000011 1 
101110001101001 


Note  that  the  input  sequence  contains  the  periodic  sequence  10101010  •  •  ,  as  well  as  a 
long  string  of  0s.  The  scrambler  output  effectively  removes  the  periodic  component,  as 
well  as  the  long  string  of  0s.  The  input  sequence  has  15  digits.  The  scrambler  output  up 
to  the  15th  digit  only  is  shown,  because  all  the  output  digits  beyond  15  depend  on  input 
digits  beyond  15,  which  are  not  given. 

We  can  verify  that  the  descrambler  output  is  indeed  5  when  the  foregoing  sequence 
T  is  applied  at  its  input  (Prob.  7.4-1). 


7-5  DIGITAL  RECEIVERS  AND 
REGENERATIVE  REPEATERS 

Basically,  a  receiver  or  a  regenerative  repeater  performs  three  functions.  ( 1 )  reshaping  incoming 
pulses  by  means  of  an  equalizer,  (2)  extracting  the  timing  information  required  to  sample 
incoming  pulses  at  optimum  instants,  and  (3)  making  symbol  detection  decisions  based  on  the 
pulse  samples.  The  repeater  shown  in  Fig.  7.21  consists  of  a  receiver  plus  a  “regenerator.”  A 
complete  repeater  may  also  include  provision  for  separation  of  dc  power  from  ac  signals.  This 
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Figure  7.21 

Regenerative 

repeater. 


is  normally  accomplished  by  transformer-coupling  the  signals  and  bypassing  the  dc  around 
the  transformers  to  the  power  supply  circuitry.* 


7.5.1  Equalizers 

A  pulse  train  is  attenuated  and  distorted  by  the  transmission  medium.  The  attenuation  can 

Chan  Th"?!  '  1  6  Preamplifier,  whereas  the  distortion  is  compensated  by  the  equalizer. 

rrili  •  //  IS  °  l0n  IS  m  '  e  t0rm  °*  dlsPersion’  which  is  caused  by  an  attenuation  of  certain 

COmf°lents  °' the  data  Pu*se  fain.  Theoretically,  an  equalizer  should  have  a 

the  riii  •  rac  ens  lc  1  at  ls  the  inverse  of  that  of  the  transmission  medium.  This  will  restore 

Inh-  nts  the  lqUenCH  TTntS  eHminate  Pulse  disPersion-  Unfortunately,  this  also 

This  unde  irnhle  h  "0,Se  ^  b°°S,ing  itS  comP°"ents  at  these  critical  frequencies. 

Th.s  undesnab  e  phenomenon  .s  known  as  noise  amplification. 

detector  on! v  n>e'cl|  J*S  b°Wever’  comP'ete  equalization  is  really  not  necessary,  because  a 

T  dy  SimP'e  dec*s'ons — such  as  whether  the  pulse  is  positive 

can  be  tolerated.  Pulse  dis^rsionres'ulu' inTsSh711"^0'6,  C°nsiderab,e  pU'Se 
Noise  increase  r^siihinn  f  .u  ,.  n  and  the  consequent  increase  in  error  detection. 

the  detection  error  probably ^  freqUendes)  a'S° 

inevitable  compromise  between  reducing  ISI  and  ^  8"  ■?"  T™""  equalizer  involv^s  an 
oftheeaualiz-ifinnrhar  •  ..  p  landreducing  the  channel  noise.  A  judicious  choice 

systems."  characte™>cs  »  a  central  feature  in  all  well-designed  digital  communication 

Zero-Forcing  Equalizer 

for  all  t.  All  that  Is  needed  ifto 'eliminL"1"11"11^  (interference)  with  neighboring  pulses 
at  their  respective  samn/in  •  °r  mimmize  interference  with  neighboring  pulses 

sample  SSL  ?,”1!,*  “.  "=«iver  dealt  Is  based  o. 

using  the  transversal  filter  srm  aXe  ’  et|ualization  can  be  accomplished  by  equalizers 
8  ransversal  Liter  structure  encountered  earlier.  Unlike  traditioltal  filters  transversal 

induced  by  power  su^es  ^ightmul^SpS^Ltfr  eleclmnics  of  the  regenerator  from  high-voltage  uansients 
into  a  cable  pair  dedicated  to  the  purpose  *  S  °rmer  w,nd,ngs  may  be  provided  to  couple  fault-locate  signals 
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Figure  7.22 

Zero-forcing 

equalizer 

analysis. 


(a) 


PM 


filter  equalizers  are  easily  adjustable  to  compensate  against  different  channels  or  even  slowly 
time-varying  channels.  The  design  goal  is  to  force  the  equalizer  output  pulse  to  have  zero  IS1 
values  at  the  sampling  (decision-making)  instants.  In  other  words,  the  equalizer  output  pulses 
satisfy  the  Nyquist  or  the  controlled  ISI  criterion.  The  time  delay  T  between  successive  taps 
is  chosen  to  be  7* ,  the  interval  between  pulses. 

To  begin,  set  the  tap  gains  co  =  1  and  c*  =  0  for  all  other  values  of  k  in  the  transversal  filter 
in  Fig.  7.22a.  Thus  the  output  of  the  filter  will  be  the  same  as  the  input  delayed  by  NTb.  For  a 
single  pulse  pr(t)  (Fig.  7.22b)  at  the  input  of  the  transversal  filter  with  the  tap  setting  just  given, 
the  filter  output  p„{t)  will  be  exactly  pr(t  -  A (Tb),  that  is,  pr(t)  delayed  by  NTb.  This  delay  has 
no  practical  effect  on  our  communication  system  and  is  not  relevant  to  our  discussion.  Hence, 
for  convenience,  we  shall  ignore  this  delay.  This  means  thatpr(f)  in  Fig.  7.22b  also  represents 
the  filter  output  p„(t)  for  this  tap  setting  (c0  =  1  and  c*  =  0,  k  £  0).  We  require  that  the 
output  pulse  Po (/)  satisfy  the  Nyquist  s  criterion  or  the  controlled  ISI  criterion,  as  the  case  may 
be.  For  the  Nyquist  criterion,  the  output  pulse  pQ(t)  must  have  zero  values  at  all  the  multiples 
of  Tb.  From  Fig.  7.22b,  we  see  that  the  pulse  amplitudes  a\,  a_i,  and  a2  at  Tb,  -Tb,  and  2Tb, 
respectively,  are  not  negligible.  By  adjusting  the  tap  gains  (c*),  we  generate  additional  shifted 
pulses  of  proper  amplitudes  that  will  force  the  resulting  output  pulse  to  have  desired  values  at 
t  =  0,  ±7*,  ±27),,  .... 

The  output  p0(t)  (Fig  7.22c)  is  the  sum  of  pulses  of  the  form  ckpr(t  -  kTb)  (ignoring  the 
delay  of  NTb).  Thus 


N 

p0(t )  =  ^2  CnPr ~ nTb) 
n=-N 


(7.50) 
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The  samples  of  p0{t)  at  t  =  kTb  are 

N 

Po(kTh)  =  ^  c„pr(kTb-  nTh)  k  =  0,  ±1,  ±2,  ±3, . . .  (7.51a) 

n=-N 


By  using  a  more  convenient  notation  pr[k]  to  denote  pr(kTb)  and  Po[k]  to  denote  p0(kTb), 
Eq.  (7.5  la)  can  be  expressed  as 


N 

Polk]  =  J2  cnpr[k  -  n]  k  =  0,  ±1,  ±2,  ±3,...  (7.51b) 

n——N 


Nyquist’s  first  criterion  requires  the  samples  p„[k]  =  0  for  k  ^  0,  and  p(,[k\  =  1  for  k  =0. 
Upon  substituting  these  values  in  Eq.  (7.5  lb),  we  obtain  a  set  of  infinite  simultaneous  equations 
in  terms  ot  -N  +  1  variables.  Clearly,  it  is  not  possible  to  solve  all  the  equations.  However,  if 
we  specify  the  values  of  Po[k]  only  at  IN  +  1  points  as 


Polk]  = 


1  k=  0 

0  k  =  ±1,  ±2 . ±N 


(7.52) 


then  a  unique  solution  exists.  This  assures  that  a  pulse  will  have  zero  interference  at  sampling 
ins  ants  o  piece  mg  and  N  succeeding  pulses.  Because  the  pulse  amplitude  decays  rapidly, 
interference  beyond  the  Afth  pulse  is  not  significant  for  N  >  2.  in  general.  Substitution  of  the 

vari  ,h!°n  Th  C  ?'  (  5,b)  yields  3  Set  of  2/v  +  •  simultaneous  equations  for  2 N  +  I 
doubles.  These  -N  +  1  equations  can  be  rewritten  in  the  matrix  form  of 


'  0  " 

0 

1 

”  PrlO] 

Prl  1] 

Prl-l] 

Prl  0] 

Prl  2N  +  1] 
•••  Prl~2N  +  2] 

Prl~2N] 

Prl~2N+\] 

I 

0 

'•  : 

0 

— 1 

Pr\2N  -  1] 
 Prl2N] 

Prl2N  -  2] 
PA2N-1] 

••  Prl  0] 

••  Prl  1] 

Prl~  1] 

PrlO] 

P, 


C-N 
C-N+ 1 

C- 1 

co 

C\ 


(7.53) 


the diago^UneT Sut^h Tm  +  l)x(2N+l)  matrix  P r  has  identical  entries  along al 

in  'S  k"°Wn  3S  the  ToeP'itz  matrix  and  is  commonly  encounterec 

and  fim  cdumn  fits  t  ™ A  Toep'itZ  matrix  »  fa»y  determined  by  its  first  rov 

inverse  (see  eg  the  meth  ii  aicl,ProP®It,es  anc*  admits  simpler  algorithms  for  computing  its 
(see,  e.g.,  the  method  by  Trench’).  The  tap  gain  c*  can  be  obtained  by  solving  this  * 
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of  equations  by  taking  the  inverse  of  the  matrix  P, 

c  =  Pr'  P„ 


Example  7.3  For  the  received  pulse  pr(t)  in  Fig.  7.22b,  let 


MO]  =  1 

MU  =  -0.3  M2]  =0.1 

M-U  =  -0.2  M-2]  =  0.05 


Design  a  three-tap  (N  =  \)  equalizer. 

Substituting  the  foregoing  values  in  Eq.  (7.53),  we  obtain 


"  0  “ 

1  -0.2  0.05  ' 

”  c_  l 

I 

= 

-0.3  1  -0.2 

CO 

0 

0.1  -0.3  1 

c  1 

Solution  of  this  set  yields  c_i  =0.210,  co  =  1.13,  andci  =0.318.  This  tap  setting  assures 
us  that  /?o[0]  =  1  and/?o[-l]=Po(l  ]  =  0.  The  ideal  output  p„(t)  is  sketched  in  Fig.  7.22c. 


Note  that  the  equalizer  determined  from  Eq.  (7.53)  can  guarantee  only  the  zero  IS1  con¬ 
dition  of  Eq.  (7.52).  In  other  words,  ISI  is  zero  only  for  k  =  0,  ±1,  . . . ,  ±N.  In  fact,  for  k 
outside  this  range,  it  is  quite  common  that  the  samples p0(kTb)  ^  0,  indicating  some  residual 
ISI.  For  instance,  consider  the  equalizer  problem  in  Example  7.3.  The  samples  of  the  equalized 
pulse  has  zero  ISI  for  k  =  —  1 ,  0,  1 .  However,  from 

N 

PoW  =  cnpr[k  -  n] 
n=-N 

we  can  see  that  the  three-tap  zero-forcing  equalizer  parameters  will  lead  to 

p„[- 3]  =  0.010  Pol- 2]  =  0.0145  p„[2\  =  0.0176 
p0[i]  =  0.0318  M*]=0  *  =  0,  ±1,  ±4,  ... 

It  is  therefore  clear  that  not  all  the  ISI  has  been  removed  because  ot  these  lour  nonzero  samples 
of  the  equalizer  output  pulse.  In  fact,  because  we  only  have  IN  +  1  (N  =  I  in  Example  7.3) 
parameters  in  the  equalizer,  it  is  impossible  to  force  pdk]  =  0  tor  all  k  unless  N  =  oo.  This 
means  that  we  will  not  be  able  to  design  a  practical  finite  tap  equalizer  that  achieves  perfect 
zero  ISI.  Still,  when  N  is  sufficiently  large,  then  typically  the  residual  nonzero  sample  values 
will  be  small,  indicating  that  most  of  the  ISI  has  been  suppressed. 
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Minimum  Mean  Square  Error  (MMSE)  Method 

In  practice,  an  alternative  approach  is  to  minimize  the  mean  square  difference  between  the 
equalizer  output  response  p„[k ]  and  the  desired  zero  ISI  response.  This  is  known  as  the  mini¬ 
mum  mean  square  error  (MMSE)  method  for  designing  transversal  filter  equalizers.  The  MMSE 
method  does  not  try  to  force  the  pulse  samples  to  zero  at  IN  points.  Instead,  we  minimize  the 
squared  errors  averaged  over  a  set  of  output  samples.  This  method  involves  more  simultaneous 
equations.  Thus  we  must  find  the  equalizer  tap  values  to  minimize  the  average  (mean)  square 
error  over  a  larger  window  [-tf,  K]: 


MSE  *  ^  Y. 


k=-K 


where  we  use  a  function  known  as  the  Kronecker  delta 

m  = 


1  k=  0 
0  kjLO 


The  solution  to  this  minimization  problem  can  be  better  represented  in  matrix  form  as 

C  =  Pjpo 

where  Prf  represents  the  Moore-Penrose  pseudo-inverse  of  the  nonsquare  matrix  Pr  of  size 

^  ^  MMSE  design  often  leads  to  a  more  robust  equalizer  for  the 

reduction  of  ISI. 

Adaptive  Equalization  and  Other  More  General  Equalizers:  The  equalizer  filter 
structure  that  is  described  here  has  the  simplest  form.  Practical  digital  communication  systems 
often  apply  much  more  sophisticated  equalizer  structures  and  more  advanced  equalization 
algorithms.  Because  of  the  probabilistic  tools  needed,  we  will  defer  detailed  coverage  on  the 
specialized  topic  of  equalization  to  Chapter  13. 


7 .5.2  Timing  Extraction 

auhe^e^eiveHn'13'  '°  **  Sampled  at  Precise  instants.  This  requires  a  clock  signal 

nization)  delavedlT  ^  the  dOCk  S'gnal  at  the  transmitter  (symbol  or  bit  synchro¬ 

nization),  delayed  by  the  channel  response.  Three  general  methods  of  synchronization  exist: 

a^mastert^ming*  source)^ ^  3  SeC°ndary  Standard  <e8’  transmitter  and  receiver  slaved  to 
2.  Transmitting  a  separate  synchronizing  signal  (pilot  clock). 

3'  ,^ynChr0nizati0n’  Where  timing  information  is  extracted  from  the  received  signal 


communication  ^ 'systems  Th^  'S  suitable  for  lar8e  volumes  of  data  and  high-spe 

to ZS5S? ^  Teth0d’  "  Which.  part  of  the  channel  capacity  is  us 

to  the  data  rate  and  when  aHHV^  ^i'*3  ^  Whe°  the  available  capacity  is  large  in  comparis 

ltional  transmission  power  can  be  spared.  The  third  method 
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figure  7.23 

Timing 

extraction. 


a  very  efficient  method  of  timing  extraction  or  clock  recovery  because  the  timing  is  derived 
from  the  received  message  signal  itself.  An  example  of  the  self-synchronization  method  will 
be  discussed  here. 

We  have  already  shown  that  a  digital  signal,  such  as  an  on-off  signal  (Fig.  7.3a),  contains 
a  discrete  component  of  the  clock  frequency  itself  (Fig.  7.3c).  Hence,  when  the  on-off  binary 
signal  is  applied  to  a  resonant  circuit  tuned  to  the  clock  frequency,  the  output  signal  is  the 
desired  clock  signal. 

Not  all  the  binary  signals  contain  a  discrete  component  of  the  clock  frequency.  For  example, 
a  bipolar  signal  has  no  discrete  component  of  any  frequency  [see  Eq.  (7.2 1 )  or  Fig.  7.9].  In  such 
cases,  it  may  be  possible  to  extract  timing  by  using  a  nonlinear  device  to  generate  a  frequency 
tone  that  is  related  to  the  timing  clock.  In  the  bipolar  case,  for  instance,  a  simple  rectification 
converts  a  bipolar  signal  to  an  on-off  signal,  which  can  readily  be  used  to  extract  timing. 

Small  random  deviations  of  the  incoming  pulses  from  their  ideal  location  (known  as  timing 
jitter)  are  always  present,  even  in  the  most  sophisticated  systems.  Although  the  source  emits 
pulses  at  the  right  instants,  subsequent  operations  during  transmission  (e.g.,  Doppler  shift) 
tend  to  cause  pulses  to  deviate  from  these  original  positions.  The  Q  of  the  tuned  circuit  used 
for  timing  extraction  must  be  large  enough  to  provide  an  adequate  suppression  of  timing  jitter, 
yet  small  enough  to  meet  the  stability  requirements.  During  the  intervals  in  which  there  are 
no  pulses  in  the  input,  the  oscillation  continues  because  of  the  flywheel  effect  of  the  high-Q 
circuit.  But  still  the  oscillator  output  is  sensitive  to  the  pulse  pattern;  for  example,  during  a 
long  string  of  Is  the  output  amplitude  will  increase,  whereas  during  a  long  string  of  Os  it  will 
decrease.  This  introduces  additional  jitter  in  the  timing  signal  extracted. 

The  complete  timing  extractor  and  time  pulse  generator  for  a  polar  case  in  shown  in 
Fig.  7.23.  The  sinusoidal  output  of  the  oscillator  (timing  extractor)  is  passed  through  a  phase 
shifter  that  adjusts  the  phase  of  the  timing  signal  so  that  the  timing  pulses  occur  at  the  maximum 
points.  This  method  is  used  to  recover  the  clock  at  each  of  the  regenerators  in  a  PCM  system.  The 
jitter  introduced  by  successive  regenerators  adds  up,  and  after  a  certain  number  of  regenerators 
it  is  necessary  to  use  a  regenerator  with  a  more  sophisticated  clock  recovery  system  such  as  a 
phase-locked  loop. 


(Mir  'Wvv 


416 


PRINCIPLES  OF  DIGITAL  DATA  TRANSMISSION 


Figure  7.24 

Error  probability 
in  threshold 
detection. 


Timing  Jitter 

Variations  of  the  pulse  positions  or  sampling  instants  cause  timing  jitter.  This  results  from 
several  causes,  some  of  which  are  dependent  on  the  pulse  pattern  being  transmitted,  whereas 
others  are  not.  The  former  are  cumulative  along  the  chain  of  regenerative  repeaters,  since  all 
the  repeaters  are  affected  in  the  same  way,  whereas  the  other  forms  of  jitter  are  random  from 
regenerator  to  regenerator  and  therefore  tend  to  partially  cancel  out  their  mutual  effects  over 
a  long-haul  link.  Random  forms  of  jitter  are  caused  by  noise,  interference,  and  mistuning  of 
the  clock  circuits.  Pattern-dependent  jitter  results  from  clock  mistuning,  amplitude-to-phase 
conversion  in  the  clock  circuit,  and  ISI,  which  alters  the  position  of  the  peaks  of  the  input 
signal  according  to  the  pattern.  The  rms  value  of  the  jitter  over  a  long  chain  of  N  repeaters  can 
be  shown  to  increase  as  y/N. 

Jitter  accumulation  over  a  digital  link  may  be  reduced  by  buffering  the  link  with  an  elastic 
store  and  clocking  out  the  digit  stream  under  the  control  of  a  highly  stable  phase-locked  loop. 
Jitter  reduction  is  necessary  about  every  200  miles  in  a  long  digital  link  to  keep  the  maximum 
jitter  within  reasonable  limits. 

7.5.3  Detection  Error 

Once  the  transmission  has  passed  through  the  equalizer,  detection  can  take  place  at  the  detector 
that  samples  the  received  signal  based  on  the  clock  provided  by  the  timing  extractor.  The  signal 
received  at  the  detector  consists  ot  the  equalized  pulse  train  plus  a  random  channel  noise.  The 
noise  can  cause  error  in  pulse  detection.  Consider,  for  example,  the  case  of  polar  transmission 
using  a  basic  pulse  p(t)  (Fig.  7.24a).  This  pulse  has  a  peak  amplitude  Ap.  A  typical  received 
pulse  train  is  shown  in  Fig.  7.24b.  Pulses  are  sampled  at  their  peak  values.  If  noise  were  absent, 
the  sample  ot  the  positive  pulse  (corresponding  to  1)  would  be  A„  and  that  of  the  negative 
pulse  (corresponding  to  0)  would  be  -APS  Because  of  noise,  these  samples  would  be  ±Ap+n 
where  n  is  the  random  noise  amplitude  (see  Fig.  7.24b).  From  the  symmetry  of  the  situation, 
the  detection  threshold  is  zero;  that  is,  if  the  pulse  sample  value  is  positive,  the  digit  is  detected 
as  1;  it  the  sample  value  is  negative,  the  digit  is  detected  as  0 


*  This  assumes  zero  ISI. 
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The  detector  s  decision  of  whether  to  declare  1  or  0  could  be  made  readily  from  the  pulse 
sample,  except  that  the  noise  value  n  is  random,  meaning  that  its  exact  value  is  unpredictable. 
It  may  have  a  large  or  a  small  value,  and  it  can  be  negative  as  well  as  positive.  It  is  possible 
that  1  is  transmitted  but  n  at  the  sampling  instant  has  a  large  negative  value.  This  will  make 
the  sample  value  Ap  +  n  small  or  even  negative.  On  the  other  hand,  if  0  is  transmitted  and  n 
has  a  large  positive  value  at  the  sampling  instant,  the  sample  value  —Ap  +  n  can  be  positive 
and  the  digit  will  be  detected  wrongly  as  1.  This  is  clear  from  Fig.  7.24b. 

The  performance  of  digital  communication  systems  is  typically  specified  by  the  average 
number  of  detection  errors.  For  example,  if  two  cellphones  (receivers)  in  the  same  spot  are 
attempting  to  detect  the  same  transmission  from  a  cellular  tower,  the  cellphone  with  the  lower 
number  of  detection  errors  is  the  better  receiver.  It  is  likely  to  have  fewer  dropped  calls  and  less 
trouble  receiving  clear  speech.  However,  because  noise  is  random,  sometimes  one  cellphone 
may  be  better  while  other  times  the  other  cellphone  may  have  fewer  errors.  The  real  measure 
of  receiver  performance  is  therefore  the  average  ratio  of  the  number  of  errors  to  the  total 
number  of  transmitted  data.  Thus,  the  meaningful  performance  comparison  is  the  likelihood 
of  detection  error,  or  the  detection  error  probability. 

Because  the  precise  analysis  and  evaluation  of  this  error  likelihood  require  the  knowledge 
and  tools  from  probability  theory,  we  will  postpone  error  analysis  until  after  the  introduction 
of  probability  in  Chapter  8.  Later,  in  Chapter  11,  we  will  discuss  fully  the  error  probability 
analysis  of  different  digital  communication  systems  for  different  noise  models  as  well  as  system 
designs  against  different  noises.  For  example,  Gaussian  noise  can  generally  characterize  the 
random  channel  noise  from  thermal  effects  and  intersystem  cross  talk.  Optimum  detectors 
can  be  designed  to  minimize  the  error  likelihood  against  Gaussian  noise.  However,  switching 
transients,  lightning  strikes,  power  line  load  switching,  and  other  singular  events  cause  very 
high  level  noise  pulses  of  short  duration  to  contaminate  the  cable  pairs  that  carry  digital  signals. 
These  pulses,  collectively  called  impulse  noise,  cannot  conveniently  be  engineered  away,  and 
they  constitute  the  most  prevalent  source  of  errors  from  the  environment  outside  the  digital 
systems.  Errors  are  virtually  never,  therefore,  found  in  isolation,  but  occur  in  bursts  of  up  to 
several  hundred  at  a  time.  To  correct  error  burst,  we  use  special  burst  error  correcting  codes 
described  in  Chapter  15. 


7.6  EYE  DIAGRAMS:  AN  IMPORTANT  TOOL 

In  the  last  section,  we  studied  the  effect  of  noise  and  channel  ISI  on  the  detection  of  digital 
transmissions.  We  also  described  the  design  of  equalizers  to  compensate  the  channel  dis¬ 
tortion  and  explained  the  timing-extraction  process.  We  now  present  a  practical  engineering 
tool  known  as  the  eye  diagram.  The  eye  diagram  is  easy  to  generate  and  is  often  applied 
by  engineers  on  received  signals  because  it  makes  possible  the  visual  examination  of  sever¬ 
ity  of  the  ISI,  the  accuracy  of  timing  extraction,  the  noise  immunity,  and  other  important 
factors. 

We  need  only  a  basic  oscilloscope  to  generate  the  eye  diagram.  Given  a  baseband  signal 
at  the  channel  output 


y(t)  =  akp(t  -  kTb) 


it  can  be  applied  to  the  vertical  input  of  the  oscilloscope.  The  time  base  of  the  scope  is  triggered 
at  the  same  rate  1  /Tb  as  that  of  the  incoming  pulses,  and  it  yields  a  sweep  lasting  exactly  Tb, 
the  interval  of  one  transmitted  data  symbol  ak.  The  oscilloscope  shows  the  superposition  of 
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Figure  7.25 

The  eye 
diagram. 
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many  traces  ot  length  7/,  from  the  channel  output  y ( f ) .  What  appears  on  the  oscilloscope  is 
simply  the  input  signal  (vertical  input)  cut  up  every  Tb  and  then  superimposed  on  top  of  one 
another.  The  resulting  pattern  on  the  oscilloscope  looks  like  a  human  eye,  hence  the  name  eye 
diagram.  More  generally,  we  can  also  apply  a  time  sweep  that  lasts  m  symbol  intervals,  or  mTb. 

'  osclllosc°Pe  pattern  is  simply  the  input  signal  (vertical  input)  cut  up  every  mTb  and  then 
su|  enmposed  on  top  ot  one  another.  The  oscilloscope  will  then  display  an  eye  diagram  that  is 
mrh  wide  and  has  the  shape  of  in  eyes  in  a  horizontal  row. 

We  now  present  an  example.  Consider  the  transmission  of  a  binary  signal  by  polar  NRZ 
pulses  (Fig.  7..5a).  Its  eye  diagrams  are  shown  in  Fig.  7.25b  for  the  time  base  of  Tb  and 

;  h'  [espect,ve  y- In  thls  example.  the  channel  has  infinite  bandwidth  to  pass  the  NRZ  pulse 
an  t  ce  is  no  c  annel  distortion.  Hence,  we  obtain  eye  diagrams  with  totally  open  eye(s). 
We  can  also  consider  a  channel  output  using  the  same  polar  line  code  and  a  different  (RZ) 
hi  rhi  S  mS  °Wn  The  resulting  eye  diagrams  are  shown  in  Fig.  7.25d. 

timin  ^  ,•  C  ^i?  'S  Wlde  °pen  °rdy  at  midpoint  of  the  pulse  duration.  With  proper 

the  the  reCeiVtr  Sh0Uld  Sample  the  received  si8nal  right  at  the  midpoint  where 

a  e  the  best  noise  immunity  at  the  decision  point  (Sec.  7.5.3). 
wher>  the  nid  ^  °  i™  .P°int  °*  tbe  ^  represents  the  best  sampling  instant  of  each  pulse, 

(zero  ISO  ^  “  maximum  without  interference  from  any  other  neighboring  pulse 

through  'this  n(!nV|de[  auhann,el  !hat  is  dis,orti  ve  or  has  finite  bandwidth,  or  both.  After  passing 
^  7  25e  The  '  'h  **  NRZ  P°,ar  ^  <*  Rg.  7.25a  becomes  the  waveform  of 
and  spread  out  The'^  ,r"IU  f>ldses  are  no  longer  rectangular  but  are  be  rounded,  distorted, 
case  the  'sii  no^ T  "0t  fU"y  °pen  ai«  a*  shown  in  Fig.  7.25f.  In  this 
from  the  full-sc  d  -  !T  ,  H™’  pulse. vaIues  at  their  respective  sampling  instants  will  deviate 

partially  closed  eye  pattern.  Y  *  Varymg  am°Um  traCC'  CaUsing  blurS’  resulting  ‘ 

will  cause  DroDort^nnat  Cye  WiU  tend  to  cI°se  in  all  cases.  Weaker  noise 

P  P  ately  less  closing.  The  decision  threshold  with  respect  to  which  symbol 
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Figure  7.26 

Reading  an  eye 
diagram. 


Optimum  sampling 
instant  at  maximum 


Noise  margin  at  the 
best  sampling  point 


!  Level  crossing 


Distortion  at 
best  sampling  point 


ambiguity  shows 
timing  jitter 


Error-free  sampling  region 


(1  or  0)  was  transmitted  is  the  midpoint  of  the  eye.*  Observe  that  for  zero  ISI,  the  system  can 
tolerate  noise  of  up  to  half  the  vertical  opening  of  the  eye.  Any  noise  value  larger  than  this 
amount  can  cause  a  decision  error  if  its  sign  is  opposite  to  the  sign  of  the  data  symbol.  Because 
ISI  reduces  the  eye  opening,  it  clearly  reduces  noise  tolerance.  The  eye  diagram  is  also  used 
to  determine  optimum  tap  settings  of  the  equalizer.  Taps  are  adjusted  to  obtain  the  maximum 
vertical  and  horizontal  eye  opening. 

The  eye  diagram  is  a  very  effective  tool  for  signal  analysis  during  real-time  experiments. 
It  not  only  is  simple  to  generate,  it  also  provides  very  rich  and  important  information  about  the 
quality  and  susceptibility  of  the  received  digital  signal.  From  the  typical  eye  diagram  given  in 
Fig.  7.26,  we  can  extract  several  key  measures  regarding  the  signal  quality. 

•  Maximum  opening  point.  The  eye  opening  amount  at  the  sampling  and  decision  instant 
indicates  that  amount  of  noise  the  detector  can  tolerate  without  making  an  error.  The  quantity 
is  known  as  the  noise  margin.  The  instant  of  maximum  eye  opening  indicates  the  optimum 
sampling  or  decision-making  instant. 

•  Sensitivity  to  timing  jitter.  The  width  of  the  eye  indicates  the  time  interval  over  which  correct 
decision  can  still  be  made,  and  it  is  desirable  to  have  an  eye  with  the  maximum  horizontal 
opening.  If  the  decision-making  instant  deviates  from  the  instant  when  the  eye  has  a  maximum 
vertical  opening,  the  margin  of  noise  tolerance  is  reduced.  This  causes  higher  error  probability 
in  pulse  detection.  The  slope  of  the  eye  shows  how  fast  the  noise  tolerance  is  reduced  and, 
hence,  the  sensitivity  of  the  decision  noise  tolerance  to  variation  of  the  sampling  instant.  It 
demonstrates  the  effects  of  timing  jitter. 

•  Level-crossing  (timing)  jitter.  Typically,  practical  receivers  extract  timing  information  about 
the  pulse  rate  and  the  sampling  clock  from  the  (zero)  level  crossing  of  the  received  signal 
waveform.  The  variation  of  level  crossing  can  be  seen  from  the  width  of  the  eye  comers. 
This  measure  provides  information  about  the  timing  jitter  such  a  receiver  is  expected  to 
experience. 

Finally,  we  provide  a  practical  eye  diagram  example  for  a  polar  signaling  waveform.  In 
this  case,  we  select  a  cosine  roll-off  pulse  that  satisfy  Nyquist’s  first  criterion  of  zero  ISI. 
The  roll-off  factor  is  chosen  to  be  r  =  0.5.  The  eye  diagram  is  shown  in  Fig.  7.27  for  a  time 
base  of  2Th.  In  fact,  even  for  the  same  signal,  the  eye  diagrams  may  be  somewhat  different 
for  different  time  offset  (or  initial  point)  values.  Figure  7.27a  illustrates  the  eye  diagram  of 
this  polar  signaling  waveform  for  a  display  time  offset  of  Tb/2,  whereas  Fig.  7.27b  shows  the 


*  This  is  true  for  a  two-level  decision  (e.g.,  when  p(t)  and  -p(t)  are  used  for  1  and  0,  respectively].  For  a  three-level 
decision  (e.g.,  bipolar  signaling),  there  will  be  two  thresholds. 
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Figure  7.27 

Eye  diagrams  of 
a  polar  signaling 
system  using  a 
raised  cosine 
pulse  with  roll-off 
factor  0.5: 

(a)  over  2  symbol 
periods  27/,  with 
a  time  shift  7/,/ 2; 

(b)  without  time 
shift. 


Sampling  and  decision  instant 


Sampling  and  decision  instants 


(a) 


(b) 


normal  eye  diagram  when  the  display  time  offset  value  is  zero.  It  is  clear  from  comparison 
that  these  two  diagrams  have  a  simple  horizontal  circular  shift  relationship.  By  observing  the 
maximum  eye  opening,  we  can  see  that  this  baseband  signal  has  zero  IS  I,  confirming  the  basic 
feature  of  the  raised-cosine  pulse.  On  the  other  hand,  because  Nyquist’s  first  criterion  places  no 
requirement  on  the  zero  crossing  of  the  pulse,  the  eye  diagram  indicates  that  timing  jitter  would 
be  likely. 


7.7  PAM:  M-ARY  BASEBAND  SIGNALING  FOR 
HIGHER  DATA  RATE 

Regardless  of  which  line  code  is  used,  binary  baseband  modulations  have  one  thing  in  common: 
they  all  transmit  one  bit  of  information  over  the  interval  of  Th  second,  or  at  the  bit  rate  of  1/7* 
bit  per  second.  It  the  transmitter  would  like  to  send  bits  at  a  much  higher  rate,  Th  may  be 
shortened.  For  example,  to  increase  the  bit  rate  by  M,  Th  must  be  reduced  by  the  same  factor 

°f  u  h<TeiVfr’ thCre  'S  3  hCaVy  price  to  be  Paid  in  bandwidth.  As  we  demonstrated  in  Fig.  7.9, 
t  le  ant  width  of  baseband  modulation  is  proportional  to  the  pulse  rate  1/7*.  Shortening  h 
y  a  actor  of  M  will  certainly  increase  the  required  channel  bandwidth  by  M .  Fortunately. 

ii  UCin^  ''  IS,  n0t  *be  0rdy  vvu->  'ncrease  data  rate.  A  very  effective  practical  solution  is  to 
allow  each  pulse  to  carry  multiple  bits.  We  explain  this  concept  here. 

or  each  symbol  transmission  within  the  time  interval  of  Th  to  carry  more  bits,  there 
must  be  more  than  two  symbols  to  choose  from.  By  increasing  the  number  of  symbols  to  M. 

?  7  m!°mat,0n  ,ransmitted  by  each  symbol  will  also  increase  with  M.  For 

for  rnmmW  w  (xz  (  °V  quaternary)’  we  have  four  basic  symbols,  or  pulses,  available 

4  svmho^Th"1  •  k'  a)- A  sequence  of  two  binary  digits  can  be  transmitted  by  just  one 
fvi7  If  III  m  'S  ",:CaUSe  3  seL*Uence  °*  two  bits  can  form  only  four  possible  sequences 
If  L  f  ’  ’  V?  ecause  we  have  four  dist'nct  symbols  available,  we  can  assign  one 

time  duration  of  7S  A  y  ***  COmbinations  <Fig-  7-28a).  Each  symbol  now  occupies  a 
“7  0t  Ts:  A  Sl?nahng  example  for  a  short  sequence  is  given  in  Fig.  7.28b  and  the 
4-ary  eye-diagram  is  shown  in  Fig.  7.28c.  B 

Henc?  m  tmnsmif  ^7*  ^  ^  pair  °f  bits  by  one  4-ary  pulse  (Fig.  7.28b). 

i  '  r  "7  °nly  72)  4-ary  pulses.  This  means  one  4-ary  symbol  can 

combinations,  a  'nary  dlgltS'  Als°’  because  three  bits  can  form  2x2x2 


combinations  a  „r  x,  :  5 '  Because  three  bits  can  form  2xzx/- 

•  g  p  Of  three  bits  can  be  transmitted  by  one  8-ary  symbol.  Similarly,  a  group 


Figure  7.28 

4-Ary  PAM 
signaling:  (a) 
four  RZ  symbols; 

(b)  baseband 
transmission; 

(c)  the  4-ary  RZ 
eye  diagram. 
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Figure  7.28 

4-Ary  PAM 
signaling:  (a) 
four  RZ  symbols; 

(b)  baseband 
transmission; 

(c)  the  4-ary  RZ 
eye  diagram. 
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of  four  bits  can  be  transmitted  by  one  1 6-ary  symbol.  In  general,  the  information  lM  transmitted 
by  an  A/  -ary  symbol  is 


h i  =  log2  M  bits  (7.55) 

This  means  we  can  increase  the  rate  of  information  transmission  by  increasing  A/. 

This  special  Af-ary  signaling  is  known  as  the  pulse  amplitude  modulation  (PAM) 
because  the  data  information  is  conveyed  by  the  varying  pulse  amplitude.  We  should  note 
here  that  pulse  amplitude  modulation  is  only  one  of  many  possible  choices  of  A/  -ary  signaling. 
There  are  an  infinite  number  of  such  choices.  Still,  only  a  limited  few  are  truly  effective  in 
combating  noise  and  efficient  in  saving  bandwidth  and  power.  A  more  detailed  discussion  of 
other  M  -ary  signaling  schemes  will  be  presented  a  little  later,  in  Sec.  7.9. 

As  in  most  system  designs,  there  are  always  prices  to  pay  for  every  possible  gain.  The 
price  paid  by  PAM  to  increase  data  rate  is  power.  As  M  increases,  the  transmitted  power  also 
increases  as  A/.  This  is  because  to  have  the  same  noise  immunity,  the  minimum  separation 
between  pulse  amplitudes  should  be  comparable  to  that  of  binary  pulses.  Therefore,  pulse 
amplitudes  increase  with  M  (Fig.  7.28).  It  can  be  shown  that  the  transmitted  power  increases 
as  A/2  (Prob.  7.7-5).  Thus,  to  increase  the  rate  of  communication  by  a  factor  of  log-,  A/,  the 
power  required  increases  as  A/2.  Because  the  transmission  bandwidth  depends  only  on  the 
pulse  rate  and  not  on  pulse  amplitudes,  the  bandwidth  is  independent  of  A/.  We  will  use  the 
following  example  of  PSD  analysis  to  illustrate  this  point. 


Sample  7.4  Determine  the  PSD  of  the  quaternary  (4-ary)  baseband  signaling  in  Fig.  7.28  when  the  message 
bits  1  and  0  are  equally  likely. 

The  4-ary  line  code  has  four  distinct  symbols  corresponding  to  the  four  different 
combinations  of  two  message  bits.  One  such  mapping  is 


cik 


-3  message  bits  00 
—  1  message  bits  01 
-1-1  message  bits  10 
4-3  message  bits  1 1 


(7.56) 


Therefore,  all  four  values  of  are  equally  likely,  each  with  a  chance  of  1  in  4.  Recall  that 


a 


2 

k 
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Within  the  summation,  1/4  of  the  a*  will  be  ±1,  and  ±3.  Thus, 


Ro 


1  [N 

=  lim  —  —  (— 
N-+ooN  |_4 


N 


N 


3r  +  -(-ir  +  +-dr  +  + 


4  4 

On  the  other  hand,  for  n  >  0,  we  need  to  determine 


N  i\ 

4«>  J  = 


Rn  —  ClkOk+n 

k 

To  find  this  average  value,  we  build  a  table  with  all  the  possible  values  of  the  product 
akak+n'- 


Possible  Values  of 


From  the  foregoing  table  listing  all  the  possible  products  of  we  see  that  each 

product  in  the  summation  can  take  on  any  of  the  following  six  values  ±  1 ,  i3,  i9. 

First,  (±1,  ±9)  are  equally  likely  (1  in  8).  On  the  other  hand,  ±3  are  equally  likely  (1  in 
4).  Thus,  we  can  show  that 


R" = »lm„  s  + £<-»  +  ~(+n  +  ji-3) + £<+»]  =  o 


As  a  result. 


Sx(f)  =  - 


Sy(f)  =  -  \P(f)\ 2 

*  s 


Thus,  the  W-ary  line  code  generates  the  same  PSD  shape  as  binary  polar  signaling, 
e  only  difference  is  that  it  utilizes  5  times  the  original  signal  power. 


Although  most  terrestrial  digital  telephone  network  uses  binary  encoding,  the  subscriber 
loop  portion  of  the  integrated  services  digital  network  (ISDN)  uses  the  quaternary  code,  2B1Q 

J-2®*,!1  USeS  NRZ  pulses  t0  ^nsmits  160  kbit/s  of  data  at  a  baud  rate  (pulse 
rate)  of  80kbit/s.  Of  the  various  line  codes  examined  by  the  ANSI  standards  committee,  2B  IQ 

provided  the  greatest  baud  rate  reduction  in  the  noisy  and  cross-talk-prone  local  cable  plant 
environment.  r 

ni™St  Shap‘nR  and  Eye  Vagrams  in  PAM:  In  this  case,  we  can  use  the  Nyquist  criterion 

can  be  correct!  dT  P  hk  ^  ^  ™  * the  Sample  points’ and-  therefore,  their  amplitudes 

( Dartia^resnon si  r  by  sampling  at  the  pulse  centers.  We  can  also  use  the  controlled  ISI 

(partial  response  signaling)  for  M -ary  signaling.8 
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Figure  7.29 

Eye  diagrams  of 
a  4-ary  PAM 
signaling  system 
using  a 
raised-cosine 
pulse  with  roll-off 
factor  0.5: 

(a)  over  two 
symbol  periods 
27),  with  time 
offset  7),/2; 

(b)  without  time 
offset. 


Sampling  and  decision  instant 


Sampling  and  decision  instants 


(a) 


(b) 


Eye  diagrams  can  also  be  generated  for  M -ary  PAM  by  using  the  same  method  used  for 
binary  modulations.  Because  of  multilevel  signaling,  the  eye  diagram  should  have  M  levels 
at  the  optimum  sampling  instants  even  when  ISI  is  zero.  Here  we  generate  the  practical  eye 
diagram  example  for  a  four-level  PAM  signal  that  uses  the  same  cosine  roll-off  pulse  with 
roll-off  factor  r  =  0.5  that  was  used  in  the  eye  diagram  of  Fig.  7.27.  The  corresponding  eye 
diagrams  with  time  offsets  of  7^/2  and  0  are  given  in  Fig.  7.29a  and  b,  respectively.  Once 
again,  no  ISI  is  observed  at  the  sampling  instants.  The  eye  diagrams  clearly  show  four  equally 
separated  signal  values  without  ISI  at  the  optimum  sampling  points. 

7.8  DIGITAL  CARRIER  SYSTEMS 

Thus  far,  we  have  discussed  baseband  digital  systems,  where  signals  are  transmitted  directly 
without  any  shift  in  frequency.  Because  baseband  signals  have  sizable  power  at  low  frequencies, 
they  are  suitable  for  transmission  over  a  pair  of  wires  and  coaxial  cables.  Much  of  the  modern 
communication  is  conducted  this  way.  However,  baseband  signals  cannot  be  transmitted  over  a 
radio  link  or  satellites  because  this  would  necessitate  impractically  large  antennas  to  efficiently 
radiate  the  low-frequency  spectrum  of  the  signal.  Hence,  for  these  applications,  the  signal 
spectrum  must  be  shifted  to  a  high-frequency  range.  A  spectrum  shift  to  higher  frequencies 
is  also  required  to  transmit  several  messages  simultaneously  by  sharing  the  large  bandwidth 
of  the  transmission  medium.  As  seen  in  Chapter  4,  the  spectrum  of  a  signal  can  be  shifted 
to  a  higher  frequency  by  applying  the  baseband  digital  signal  to  modulate  a  high-frequency 
sinusoid  (carrier). 

In  transmitting  and  receiving  digital  carrier  signals,  we  need  a  modulator  and  demodulator 
to  transmit  and  receive  data.  The  two  devices,  modulator  and  demodulator  are  usually  packaged 
in  one  unit  called  a  modem  for  two-way  (duplex)  communications. 

7.8. 1  Basic  Binary  Carrier  Modulations 

There  are  two  basic  forms  of  carrier  modulation:  amplitude  modulation  and  angle  modulation. 
In  amplitude  modulation,  the  carrier  amplitude  is  varied  in  proportion  to  the  modulating  signal 
(i.e.,  the  baseband  signal  ).  This  is  shown  in  Fig.  7.30.  An  unmodulated  carrier  cos  coct  is  shown 
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Figure  7.30 

(a)  The  carrier 
cos  coct .  (b)  The 
modulating 
signal  m{t). 

(c)  ASK:  the 
modulated  signal 
m(t)  cos  coct . 
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Figure  7.31 

(a)  The 
modulating 
signal  m(t). 

(b)  PSK:  the 
modulated  signal 
m(t)  cos  coct. 

(c)  FSK:  the 
modulated 
signal. 


(b) 


in  Fig.  7.30a.  The  on-ott  baseband  signal  m(t)  (the  modulating  signal)  is  shown  in  Fig.  7.30b. 
It  can  be  written  according  to  Eq.  (7. 1 )  as 


m(t)  =  aup(t  -  AT*), 


where  p(t)  =  n  ^ 


The  line  code  ak  —  0,  I  is  on-off.  When  the  carrier  amplitude  is  varied  in  proportion  to  m(t), 
we  can  write  the  carrier  modulated  signal  as 


<PaskU)  =  m(t)  cos  o)ct  (7.57) 

shown  in  Fig.  7.30c.  Note  that  the  modulated  signal  is  still  an  on-off  signal.  This  modulation 
scheme  of  transmitting  binary  data  is  known  as  on-off  keying  (OOK)  or  amplitude  shift 
keying  (ASK). 

Ot  course,  the  baseband  signal  m(t)  may  utilize  a  pulse  p(t)  different  from  the  rectangular 
one  shown  in  the  example  of  Fig.  7.30.  This  will  generate  an  ASK  signal  that  does  not  have  a 
constant  amplitude  during  the  transmission  of  1  (ak  =  1). 

It  the  baseband  signal  m(t)  were  polar  (Fig.  7.31a),  the  corresponding  modulated  signal 
»Ht)  cos  wet  would  appear  as  shown  in  Fig.  7.31b.  In  this  case,  if  p{t)  is  the  basic  pulse,  we 
are  transmitting  1  by  a  pulse  p(t)  cos  coct  and  0  by  -p(t)  cos  coct  =  p(t)  cos  (coct  +  7t).  Hence, 
the  two  pulses  are  jt  radians  apart  in  phase.  The  information  resides  in  the  phase  or  the  sign 
o  t  ie  pulse.  For  this  reason  this  scheme  is  known  as  phase  shift  keying  (PSK).  Note  that 
the  transmission  is  still  polar.  In  fact,  just  like  ASK,  the  PSK  modulated  carrier  signal  has  the 

C  timp  rnrm  ° 


<Ppsk  (f)  -  »'(t)  cos  a )ct  m(t)  =  ^  akp(t  -  kTh) 


(7.58) 


with  the  difference  that  the  line  code  is  polar  a*  =  ±1 

kevi“  3re  '™smi«5d  ^  varying  the  frequency,  we  have  the  case  of  frequency  shift 

transmitted  hvi^  i  °Wf  l"  ^  ^  *c‘ A  ® 's  transmitted  by  a  pulse  of  frequency  coco,  and  1  's 
.  ‘  .  .  -  a  Pulse  ol  frequency  .  The  information  about  the  transmitted  data  resides  m 

one  rhClt!fTy  Tre  FSK  Signa‘  may  bC  Vi£Wed  -  a  ™  of  two  interleaved  ASK  signal, 
with  a  modulating  frequency  coCfl,  and  the  other  with  a  modulating  frequency  euC| .  We  can 
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Figure  7.31 

(a)  The 
modulating 
signal  m(t). 

(b)  PSK:  the 
modulated  signal 
m(/)cos  coct. 

(c)  FSK:  the 
modulated 
signal. 


(b) 


(c) 


use  the  binary  ASK  expression  of  Eq.  (7.57)  to  write  the  FSK  signal  as. 

<PFSK  W  =  X  °kP^  ~  kTh )  COS  "Cl '  +  D'  "  ak^P(t  -  kTb>  COS  toco1 

where  a*  =  0,  1  is  on-off.  Thus  the  FSK  signal  is  a  superposition  of  two  AM  signals  with 
different  carrier  frequencies  and  different  but  complementary  amplitudes. 

In  practice,  ASK  as  an  on-off  scheme  is  commonly  used  today  in  optical  fiber  communi¬ 
cations  in  the  form  of  laser-intensity  modulation.  PSK  is  commonly  applied  in  digital  satellite 
communications  and  was  also  used  in  earlier  telephone  modems  (2400  and  4800  bit/s).  As  for 
FSK,  AT&T  in  1962  developed  one  of  the  earliest  telephone-line  modems  called  1 03  A;  it  uses 
FSK  to  transmit  300  bit/s  at  two  frequencies,  1070  and  1270  Hz,  and  receives  FSK  at  2025 
and  2225  Hz. 


7.8.2  PSD  of  Digital  Carrier  Modulation 

We  have  just  shown  that  the  binary  carrier  modulations  of  ASK,  PSK,  and  FSK  can  all  be 
written  into  some  forms  of  m(t)  cos  a>ct.  To  determine  the  PSD  of  the  ASK,  PSK,  and  FSK 
signals,  it  would  be  helpful  for  us  to  first  find  the  relationship  between  the  PSD  of  m(t)  and 
the  PSD  of  the  modulated  signal 


<p(t)  =  m(t)  cos  u)ct 


Recall  from  Eq.  (3.81 )  that  the  PSD  of  (p(t)  is 


S<p(f)  = 


lim 

T— too 


l*r(/~)l2 

T 
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where  is  the  Fourier  transform  of  the  truncated  signal 

Figure  7.32 

PSD  of  (a)  ASK, 

<PT(t)  =  cp(t)[u{t  +  T/ 2)  -  u(t  -  77 2)] 

=  m(t)[u(t  +  T / 2)  —  u(t  —  T /2)]  cos 

=  w^COcos  (7.59) 

|b)  PSK,  and 
(c)  FSK. 

Here  mr(t)  is  the  truncated  baseband  signal  with  Fourier  transform  Mr(f ).  Applying  the 
frequency  shift  property  [see  Eq.  (3.37)],  we  have 

*r(/~)  =  \  [ Mrif  ~fc)  +  +fc)  1 

As  a  result,  the  PSD  of  the  modulated  carrier  signal  <p(t)  is 

r— ►oo  4  7 

Because  A/  (/*)  is  a  baseband  signal,  Mj{f  +/c)  and  — /c)  have  zero  overlap  as  T  ->  oo 

as  long  as/r  is  larger  than  the  bandwidth  of  M  (f).  Therefore,  we  conclude  that 

s„<0-  lim  'JWrif+m'  mif-fM*] 

r- OO  4  L  T  T 

=  (/^  +/c)  +  -/c)  (7.60) 

In  other  words,  tor  an  appropriately  chosen  carrier  frequency,  modulation  causes  a  shift  in  the 
baseband  signal  PSD. 

Now,  the  ASK  signal  in  Fig.  7.30c,  fits  this  model,  with  m(t)  being  an  on-off  signal  (using 
a  tull-v\  idth  or  NRZ  pulse).  Hence,  the  PSD  of  the  ASK  signal  is  the  same  as  that  of  an  on-oft 
signal  (Fig.  7.4b)  shitted  to  as  shown  in  Fig.  7.32a.  Remember  that  by  using  a  full-width 

rectangular  pulse  p(r), 

In  this  case,  the  baseband  on-off  PSD  has  no  discrete  components  except  at  dc  in  Fig.  7.30b. 
Theretore,  the  ASK  spectrum  has  discrete  component  only  at  coc. 

The  PSK  signal  also  tits  this  modulation  description  where  m(t )  is  a  polar  signal  using 
a  full-width  NRZ  pulse.  Therefore,  the  PSD  of  a  PSK  signal  is  the  same  as  that  of  the  polar 
baseband  signal  shitted  to  ±a,c,  as  shown  in  Fig.  7.32b.  Note  that  this  PSD  has  the  same  shape 
(wit  ia  d  fterent  scaling  factor)  as  the  PSD  of  the  ASK  minus  its  discrete  components. 
a  J'nMy;  WC  haVC  shown  that  the  FSK  s'gnal  may  be  viewed  as  a  sum  of  two  interleaved 
signals  using  the  full-width  pulse.  Hence,  the  spectrum  of  FSK  is  the  sum  of  two  ASK 
spectra  at  frequencies  coC()  and  coc, ,  as  shown  in  Fig.  7.32c.  It  can  be  shown  that  by  properly 
choosing  (oco  and  wC]  and  by  maintaining  phase  continuity  during  frequency  switching,  discrete 
components  can  be  eliminated  at  <uCo  and  a>Cl.  Thus,  no  discrete  components  appear  in  this 
spectruiiL  It  is  important  to  note  that  the  bandwidth  of  FSK  is  higher  than  that  of  ASK  or  PSK 

As  observed  earlier,  polar  signaling  is  the  most  power-efficient  scheme.  The  PSK.  being 

?  rctluircs  -  dB  less  power  than  ASK  (or  FSK)  tor  the  same  noise  immunity,  that  is,  t°r 

the  same  error  probability  in  pulse  detection. 
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Figure  7.32 

PSD  of  (a)  ASK, 

(b)  PSK,  and 

(c)  FSK. 


Of  course,  we  can  also  modulate  bipolar,  or  any  other  scheme  discussed  earlier.  Also,  note 
that  the  use  of  the  NRZ  rectangular  pulse  in  Fig.  7.30  or  7.3 1  is  for  the  sake  of  illustration  only. 
In  practice,  baseband  pulses  may  be  spectrally  shaped  to  eliminate  ISI. 


7.8.3  Connections  between  Analog  and  Digital 
Carrier  Modulations 

There  is  a  natural  and  clear  connection  between  ASK  and  AM  because  the  message  infor¬ 
mation  is  directly  reflected  in  the  varying  amplitude  of  the  modulated  signals.  Because  of  its 
nonnegative  amplitude,  ASK  is  essentially  an  AM  signal  with  modulation  index  //  =  1.  There 
is  a  similar  connection  between  FSK  and  FM.  FSK  is  simply  an  FM  signal  with  only  limited 
number  of  instantaneous  frequencies. 

The  connection  between  PSK  and  analog  modulation  is  a  bit  more  subtle.  For  PSK,  the 
modulated  signal  can  be  written  as 

V’psk  W  =  ^  cos  +  Ok)  kTb  <  t  <  kT),  +  Tb 

It  can  therefore  be  connected  with  PM.  However,  a  closer  look  at  the  PSK  signal  reveals  that 
because  of  the  constant  phase  Ok ,  its  instantaneous  frequency,  in  fact,  does  not  change.  In  fact. 
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we  can  rewrite  the  PSK  signal 

^psk  (0  =  ^  cos  cos  —  A  sin  0*  sin  coct 

=  at  cos  (oct  +  bk  sin  coct  kTb  <  t  <  kTb  +  Tb  (7.61) 

by  letting  =  A  cos  0*  and  =  —A  sin  0*.  From  Eq.  (7.61),  we  recognize  its  strong  resem¬ 
blance  to  the  QAM  signal  representation  in  Sec.  4.4.  Therefore,  the  digital  PSK  modulation  is 
closely  connected  with  the  analog  QAM  signal.  In  particular,  0  =  0,  ;r  for  binary  PSK.  Thus, 
binary  PSK  can  be  written  as 


±A  cos  a)ct 

This  is  effectively  a  digital  manifestation  of  the  DSB-SC  amplitude  modulation.  In  fact,  as  will 
be  discussed  later,  by  letting  take  on  multilevel  values  while  setting  b^  =  0  we  can  generate 
another  digital  carrier  modulation  known  as  the  pulse  amplitude  modulation  (or  PAM),  which 
can  carry  multiple  bits  during  each  modulation  time-interval  7/>. 

As  we  have  studied  in  Chapter  4,  DSB-SC  amplitude  modulation  is  more  power  efficient 
than  AM.  Binary  PSK  is  therefore  more  power  efficient  than  ASK.  In  terms  of  bandwidth 
utilization,  we  can  see  from  their  connection  to  analog  modulations  that  ASK  and  PSK  have 
identical  bandwidth  occupation  while  FSK  requires  larger  bandwidth.  These  observations 
intuitively  corroborate  our  PSD  results  of  Fig.  7.32. 


7.8.4  Demodulation 

Demodulation  ot  digital-modulated  signals  is  similar  to  that  of  analog-modulated  signals. 
Because  of  the  connections  between  ASK  and  AM,  between  FSK  and  FM.  and  between  PSK  and 
QAM  (or  DSB-SC  AM),  different  demodulation  techniques  used  for  the  analog  modulations 
can  be  directly  applied  to  their  digital  counterparts. 


Figure  7.33 

(a)  Noncoherent 
detection  of  FSK. 

(b)  Coherent 
detection  of  FSK. 


Figure  7.34 

Coherent  binary 
FSK  detector 
(similar  to  a 
DSB-SC 
demodulator). 


ASK  Detection 

Just  like  AM.  ASK  (Fig.  7.30c),  can  be  demodulated  both  coherently  (for  synchronous  detec¬ 
tion)  or  noncoherently  (for  envelope  detection).  The  coherent  detector  requires  more  elaborate 
equipment  and  has  superior  performance,  especially  when  the  signal  power  (hence  SNR)  is 
Urn  For  higher  SNR.  the  envelope  detector  performs  almost  as  well  as  the  coherent  detector. 

ence,  coherent  detection  is  not  often  used  for  ASK  because  it  will  defeat  its  very  purpose 
(the  simplicity  of  detection).  If  we  can  avail  ourselves  of  a  synchronous  detector,  we  might  as 
well  use  PSK.  which  has  better  power  efficiency  than  ASK 


FSK  Detection 

Once  again,  the  binary  FSK  can  be  viewed  as  two  interleaved  ASK  signals  with  carrier  fre¬ 
quencies  o>c  and  ov, ,  respectively  (Fig.  7.32c).  Therefore,  FSK  can  be  detected  coherently  or 
noncoherently.  In  noncoherent  detection,  the  incoming  signal  is  applied  to  a  pair  of  filters  tuned 

T|  , '  ,m  ,l°'  f  J"esPectlve*y-  Each  filter  is  followed  by  an  envelope  detector  (see  Fig.  7.33a)- 
The  outputs  ot  the  two  envelope  detectors  are  sampled  and  compared.  If  a  ()  is  transmitted  by 

Pmc  l  Jlv  nnqUenCy  aV,P  the"  thiS  pulse  Wi"  appear  at  the  output  of  the  filter  tuned  to 
Practically  no  signal  appears  at  the  output  of  the  filter  tuned  to  m  Hence  the  sample  of  the 

envelope  detector  output  following  the  <oCo  filter  will  be  greater  than  the  sample  of  the  envelope 
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Figure  7.33 

(a)  Noncoherent 
detection  of  FSK. 

(b)  Coherent 
detection  of  FSK. 
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Figure  7.34 

Coherent  binary 
PSK  detector 
(similar  to  a 
DSB-SC 
demodulator). 


detector  output  following  the  a>Ct  filter,  and  the  receiver  decides  that  a  0  was  transmitted.  In 
the  case  of  a  1,  the  opposite  happens. 

Of  course,  FSK  can  also  be  detected  coherently  by  generating  two  references  of  frequencies 
coCo  and  (DCy ,  for  the  two  demodulators,  to  demodulate  the  signal  received  and  then  comparing 
the  outputs  of  the  two  demodulators  as  shown  in  Fig.  7.33b.  Thus,  coherent  FSK  detector 
must  generate  two  carriers  in  synchronization  with  the  modulation  carriers.  Once  again,  this 
complex  demodulator  defeats  the  purpose  of  FSK,  which  is  designed  primarily  for  simpler, 
noncoherent  detection.  In  practice,  coherent  FSK  detection  is  not  in  use. 


PSK  Detection 

In  binary  PSK,  a  1  is  transmitted  by  a  pulse  A  cos  coct  and  a  0  is  transmitted  by  a  pulse 
-A  cos  ioct  (Fig.  7.3  lb).  The  information  in  PSK  signals  therefore  resides  in  the  carrier  phase. 
Just  as  in  DSB-SC,  these  signals  cannot  be  demodulated  via  envelope  detection  because  the 
envelope  stays  constant  for  both  1  and  0  (Fig.  7.3  lb).  The  coherent  detector  of  the  binary  PSK 
modulation  is  shown  in  Fig.  7.34.  The  coherent  detection  is  similar  to  that  used  for  analog 
signals.  Methods  of  carrier  acquisition  have  been  discussed  in  Sec.  4.8. 


Differential  PSK 

Although  envelope  detection  cannot  be  used  for  PSK  detection,  it  is  still  possible  to  exploit  the 
finite  number  of  modulation  phase  values  for  noncoherent  detection.  Indeed,  PSK  signals  may 
be  demodulated  noncoherently  by  means  of  an  ingenious  method  known  as  differential  PSK, 
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Figure  7.35 

(a)  Differential 
encoding; 

(b)  encoded 
signal; 

(c)  differential 
PSK  receiver. 
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or  DPSK.  The  principle  of  differential  detection  is  for  the  receiver  to  detect  the  relative  phase 
change  between  successive  modulated  phases  9k  and  6k— i-  Since  the  phase  value  in  PSK  is 
hnite  (equaling  to  0  and  n  in  binary  PSK),  the  transmitter  can  encode  the  information  data  into 
t  e  phase  difference  0*  —  0*-i.  For  example,  a  phase  difference  of  zero  represents  0  whereas 
a  phase  difference  of  n  signifies  1. 

This  technique  is  known  as  differential  encoding  (before  modulation).  In  one  differential 
ct  e,  a  ,s  encoded  by  the  same  pulse  used  to  encode  the  previous  data  bit  (no  transition),  and 
a  is  encoded  by  the  negative  of  the  pulse  used  to  encode  the  previous  data  bit  (transition). 
•  '  Crf",,a‘  e"‘-'oding  ,s  simP,e  t0  implement,  as  shown  in  Fig.  7.35a.  Notice  that  the  addition 
is  modulo-2  .The  encoded  signal  is  shown  in  Fig.  7.35b.  Thus  a  transition  in  the  line  code  pulse 
sequence  indicates  1  and  no  transition  indicates  0.  The  modulated  signal  consists  of  pulses 


A  cos  ( o)ct  -I -  0* )  =  ±A  cos  coct 

mdsest!!  bi:‘S  (VhC  PreSem  PUlSC  and  the  previous  pulse  ^ve  the  same  polarity  or  phase;  both 
nuke  are  r>f  ^  COS  or  ^  cos  o>ct.  It  the  data  bit  is  1,  the  present  pulse  and  the  previou: 
pulse  are  of  oppos.te  polarities  or  phases;  if  the  present  pulse  is  A  cos  co(;.  the  previous  puls, 
is  -A  cos  (oct,  and  vice  versa. 

ine  that  the'  (p'»-  735c),  we  avoid  generation  of  a  local  carrier  by  observ 

ambiauitv  For  H  ^  ^  Signa'  ^  is  3  Carrier  <±^cos  <**)  with  a  possible  sip 

iTn  one  hi t  in  I  ?!'  m  P'aCe  °f  th£  Carrier’  use  the  received  signal  delay* 
uct  v(t)  -  *W  '  If  th^  received  pulse  is  identical  to  the  previous  pulse,  the  prod 

W^mLilrlr  2  {A  /2Kl  +  C°S  ^  a"d  «he  low-pass  filter  output  ;(r)  =  M 
We  immediately  detect  the  present  bit  as  0.  If  the  received  pulse  and  the  previous  pulse  are  o 
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TABLE  7.3 

Differential  Encoding  and  Detection  of  Binary  DPSK 


Time  k 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

4 

1 

0 

1 

0 

0 

1 

1 

1 

0 

0 

Qk 

0 

1 

1 

0 

0 

0 

1 

0 

1 

1 

1 

Line  code 

-1 

1 

1 

-1 

-1 

-1 

1 

-1 

1 

1 

1 

h 

IT 

0 

0 

n 

IT 

7T 

0 

7T 

0 

0 

0 

Ok  ~  Ok- 1 

71 

0 

IT 

0 

0 

7T 

7 r 

7T 

0 

0 

Detected  bits 

1 

0 

1 

0 

0 

1 

1 

1 

0 

0 

opposite  polarity,  y(t )  =  -A2  cos2  coct  and  z(r)  =  - A2/2 ,  and  the  present  bit  is  detected  as 
0.  Table  7.3  illustrates  a  specific  example  of  the  encoding  and  decoding. 

Thus,  in  terms  of  demodulation  complexity,  ASK,  FSK,  and  DFSK  can  all  be  nonco- 
herently  detected  without  a  synchronous  carrier  at  the  receiver.  On  the  other  hand,  PSK 
must  be  coherently  detected.  Noncoherent  detection,  hov/ever,  comes  with  a  price  in  terms 
of  noise  immunity.  From  the  point  of  view  of  noise  immunity,  coherent  PSK  is  superior  to  all 
other  schemes.  PSK  also  requires  smaller  bandwidth  than  FSK  (see  Fig.  7.32).  Quantitative 
discussion  of  this  topic  can  be  found  in  Chapter  11. 


7.9  M-ARY  DIGITAL  CARRIER  MODULATION 

The  binary  digital  carrier  modulations  of  ASK,  FSK,  and  PSK  all  transmit  one  bit  of  information 
over  the  interval  of  Tf,  second,  or  at  the  bit  rate  of  1/7),  bit/s.  Similar  to  digital  baseband 
transmission,  higher  bit  rate  transmission  can  be  achieved  either  by  reducing  Tf,  or  by  applying 
M -ary  signaling;  the  first  option  requires  more  bandwidth;  the  second  requires  more  power. 
In  most  communication  systems,  bandwidth  is  strictly  limited.  Thus,  to  conserve  bandwidth, 
an  effective  way  to  increase  transmission  data  rate  is  to  generalize  binary  modulation  by 
employing  M  -ary  signaling.  Specifically,  we  can  apply  M -level  ASK,  M  -frequency  FSK,  and 
M  -phase  PSK  modulations. 

M  -ary  ASK  and  Noncoherent  Detection 

M  -ary  ASK  is  a  very  simple  generalization  of  binary  ASK.  Instead  of  sending  only 
(pit)  =  0  for  0  and  (pit)  =  A  cos  coct  for  1 
M -ary  ASK  can  send  log2  M  bits  each  time  by  transmitting,  for  example. 


(pit)  =  0,  A  cos  a>ct,  2 A  cos  coct ,  . . . ,  iM  -  1  )A  cos  (oct 


This  is  still  an  AM  signal  that  uses  M  different  amplitudes  and  a  modulation  index  of  pi  = 
1.  Its  bandwidth  remains  the  same  as  that  of  the  binary  ASK,  while  its  power  is  increased 
proportionally  with  A/2.  Its  demodulation  would  again  be  achieved  via  envelope  detection  or 
coherent  detection. 

M- ary  FSK  and  Orthogonal  Signaling 

M -FSK  is  similarly  generated  by  selecting  one  sinusoid  from  the  set  {A  cos  2nf,t, 
i  =  |  m  }  to  transmit  a  particular  pattern  of  log2  M  bits.  Generally  for  FSK,  we  can 
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design  a  frequency  increment  8f  and  let 


fm  =/t  +  (rn  -  l)8f  m=  1,2 . M 


For  this  FSK  with  equal  frequency  separation,  the  frequency  deviation  (in  analyzing  the  FM 
signal)  is 


A/  =  =  '-(M  -  1)5/ 


It  is  therefore  clear  that  the  selection  of  the  frequency  set  {/} }  determines  the  performance  and 
the  bandwidth  of  the  FSK  modulation.  If  8f  is  chosen  too  large,  then  the  M  -ary  FSK  will  use 
too  much  bandwidth.  On  the  other  hand,  if  8f  is  chosen  too  small,  then  over  the  time  interval 
ot  7),  second,  different  FSK  symbols  will  show  virtually  no  difference  and  the  receiver  will  be 
unable  to  distinguish  the  different  symbols  reliably.  Thus  large  8f  leads  to  bandwidth  waste, 
whereas  small  8f  is  prone  to  detection  error  due  to  transmission  noise  and  interference. 

The  task  ot  M  -ary  FSK  design  is  to  determine  a  small  enough  8f  that  each  FSK  sym¬ 
bol  A  cos  (Oft  is  highly  distinct  from  all  other  FSK  symbols.  One  solution  to  this  problem 
ot  FSK  signal  design  actually  can  be  found  in  the  discussion  of  orthogonal  signal  space  in 
Sec.  2.7.2.  It  we  can  design  FSK  symbols  to  be  orthogonal  in  7^  by  selecting  a  small  <5/ 

(or  A/ ),  then  the  FSK  signals  will  be  truly  distinct  over  T b,  and  the  bandwidth  consumption 
will  be  small. 

To  find  the  minimum  8f  that  leads  to  an  orthogonal  set  of  FSK  signals,  the  orthogonality 
condition  according  to  Sec.  2.7.2  requires  that 


fTh 

I  A  cos  {2nfmt)A  cos  (2jrfnt)  dt  =  0  m^n 
J  o 


(7.62) 


We  can  use  this  requirement  to  find  the  minimum  8f.  First  of  all. 


=  —t  SU1  2jT^m  A2  sin  2n(fm  -fn)Th 

'  2  b  2n(fm  +fn)Tb  +  ~2  h  -fjh 

(7.63) 


A2  sin  2n{fm  -f„)Tb 
2  2t r{fm  -/„) 


figure  7.36 

Binary  FSK 


Because/,,,  _/,+(,„_  1)5/,  for  mutual  orthogonality  we  have 


symbols  in  the 
^dimensional 


sin  [2z r(m  -  n)8jTb]  =  0  m  £  n 
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This  choice  of  minimum  frequency  separation  is  known  as  the  minimum  shift  FSK.  Since  it 
forms  an  orthogonal  set  of  symbols,  it  is  often  known  as  orthogonal  signaling. 

We  can  in  fact  describe  the  minimum  shift  FSK  geometrically  by  applying  the  concept  of 
orthonormal  basis  functions  in  Section  2.7.  Let 


=  J  —  cos  2jt 


(*+W) 


t  i  =  1,2 . M 


It  can  be  simply  verified  that 


f 

Jo 


I'miO'l'nV)  dt  = 


1  m  =  n 
0  m  ^  n 


Thus,  each  of  the  FSK  symbol  can  be  written  as 


/  T 

A  cos  2 nfmt  =  Ayj  m  =  1,  2,  . . . ,  M 


The  geometrical  relationship  of  the  two  FSK  symbols  for  M  =2  is  easily  captured  by  Fig.  7.36. 

The  demodulation  of  A7-ary  FSK  signals  follows  the  same  approach  as  the  binary  FSK 
demodulation.  Generalizing  the  binary  FSK  demodulators  of  Fig.  7.33  we  can  apply  a  bank  of 
M  coherent  or  noncoherent  detectors  to  the  M  -ary  FSK  signal  before  making  a  decision  based 
on  the  strongest  detector  branch. 

Earlier  in  the  PSD  analysis  of  baseband  modulations,  we  showed  that  the  baseband  digital 
signal  bandwidth  at  the  symbol  interval  of  7/,  can  be  approximated  by  1  /7/,.  Therefore,  for  the 
minimum  shift  FSK,  A/  =  (M  —  l)/(47/,),  and  its  bandwidth  according  to  Carson’s  rule  is 
approximately 


M  -3 

2(4/+8)=_ 

In  fact,  it  can  be  in  general  shown  that  the  bandwidth  of  an  orthogonal  M  -ary  scheme 
is  M  times  that  of  the  binary  scheme  [see  Sec.  11.7,  Eq.  (11.123)].  Therefore,  in  an  M  -ary 
orthogonal  scheme,  the  rate  of  communication  increases  by  a  factor  of  log2  M  at  the  cost  of 
M-fold  transmission  bandwidth  increase.  For  a  comparable  noise  immunity,  the  transmitted 
power  is  practically  independent  of  M  in  the  orthogonal  scheme.  Therefore,  unlike  M  -ary  ASK, 
M -ary  FSK  does  not  require  more  transmission  power.  However,  its  bandwidth  requirement 
increases  almost  linearly  with  M  (compared  with  binary  FSK  or  M -ary  ASK). 


figure  7.36 

Binary  FSK 
symbols  in  the 
^^dimensional 
orthogonal 
s,9nal  space. 


Mt) 


♦ 


0 
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Figure  7.37 

M -ary  PSK 
symbols  in  the 
orthogonal 
signal  space: 
(a)  M  =  2; 

(bj  M  =  4; 

(c)  M  =  8. 


M  -ary  PSK,  PAM,  and  QAM 

By  making  a  small  modification  to  Eq.  (7.61),  PSK  signals  in  general  can  be  written  into  the 
format  of 


^psk  (0  =  am 


coct 


0  <t  <Th  (7.64a) 


in  which  am  =  ,4  cos  0m  and  bm  =  —A  sin  6m.  In  fact,  based  on  the  analysis  in  Sec.  2.7, 
y/T/Tfj  cos  coct  and  \/2/7/,  sin  a)ct  are  orthogonal  to  each  other.  Furthermore,  they  are  nor¬ 
malized  over  [0,  7),].  As  a  result,  we  can  represent  all  PSK  symbols  in  a  two-dimensional 
signal  space  with  basis  functions 


such  that 


<P?sk  (0  =  (0  +  bmfc(t)  (7.64b) 

We  can  geometrically  illustrate  the  relationship  of  the  PSK  symbols  in  the  signal  space 
(  tfr-  -  )•  Equation  (7.64)  means  that  PSK  modulations  can  be  represented  as  QAM  sig- 
na  n  tact,  because  the  signal  is  PSK,  the  signal  points  must  meet  a  special  requirement 


am  +  —  A~  cos~  6m  +  (—A)2  sin2  0n 

2 

=  A  =  constant 


(7.64c) 


In  other  words,  all  the  signal  points  must  stay  on  acircle  of  radius  A.  In  practice,  all  the  signal 
•  S^e  C  osen  to  e  equally  spaced  in  the  interest  of  obtaining  the  best  immunity  against 
n°  Se'  Therefore’  ^  A/-ary  PSK  signaling,  the  angles  are  typically  chosen  uniformly  as 


2tt 


0m-6o+  ~{m-  1)  m  =  1,  2,  ....  M 

:''LTC'a!  PSK  *'fnaling  wi,h  M  =  4  is  «n  extremely  popular  and  powerful  digi¬ 
tal  modulation  format  *  It  ,n  fact  is  a  summation  of  two  binary  PSK  signals,  one  using  d* 


*  QPSK  has  several  effective  variations  including  the  offset  QPSK. 
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(in-phase)  earner  of  cos  wct  while  the  other  uses  the  (quadrature)  carrier  of  sin  (oct  of  the  same 
frequency.  For  this  reason,  it  is  also  known  as  quadrature  PSK  (QPSK).  We  can  transmit  and 
receive  both  of  these  signals  on  the  same  channel,  thus  doubling  the  transmission  rate. 

To  further  generalize  the  PSK  to  achieve  higher  data  rate,  we  can  see  that  the  PSK  repre¬ 
sentation  of  Eq.  (7.64)  is  a  special  case  of  the  quadrature  amplitude  modulation  (QAM)  signal 
discussed  in  in  Chapter  4  (Fig.  4. 19).  The  only  difference  lies  in  the  requirement  by  PSK  that 
the  modulated  signal  have  a  constant  magnitude  (modulus)  A.  In  fact,  the  much  more  flexible 
and  general  QAM  signaling  format  can  be  conveniently  used  for  digital  modulation  as  well. 
The  signal  transmitted  by  an  M  -ary  QAM  system  can  be  written  as 


Pi(t)  =  a,p(t)  cos  (oct  -|-  bip(t)  sin  coct 

=  npit)  cos  ( (oct  -0j)  i  =  1,2 . M 


where 


Oj  =  tan  1  — 


and 


(7.65) 


and p(t)  is  a  properly  shaped  baseband  pulse.  The  simplest  choice  of p{!)  would  be  a  rectangular 
pulse 


Certainly,  better  pulses  can  also  be  applied  conserve  bandwidth. 

Figure  7.38a  shows  the  QAM  modulator  and  demodulator.  Each  of  the  two  signals  m\(t) 
and  m2(t)  is  a  baseband  s/Af-ary  pulse  sequence.  The  two  signals  are  modulated  by  two 
carriers  of  the  same  frequency  but  in  phase  quadrature.  The  digital  QAM  signal  p,(/)  can  be 
generated  by  means  of  QAM  by  letting  m\(t)  =  «//>(/)  and  m2(f)  =  b,p{t).  Both  m\ (/)  and 
m2(t)  are  baseband  PAM  signals.  The  eye  diagram  of  the  QAM  signal  consists  of  the  in-phase 
component  m\  (r)  and  the  quadrature  component  m2(t).  Both  exhibit  the  M -ary  baseband  PAM 
eye  diagram,  as  discussed  earlier  in  Sec.  7.6. 

The  geometrical  representation  of  M  -ary  QAM  can  be  extended  from  the  PSK  signal 
space  by  simply  removing  the  constant  modulus  constraint  Eq.  (7.64c).  One  very  popular  and 
practical  choice  of  r,  and  0;  for  M  =  16  is  shown  graphically  in  Fig.  7.38b.  The  trans¬ 
mitted  pulse  pi(t)  can  take  on  16  distinct  forms,  and  is,  therefore,  a  16-ary  pulse.  Since 
M  =  16,  each  pulse  can  transmit  the  information  of  log2  16  =  4  binary  digits.  This  can 
be  done  as  follows:  there  are  16  possible  sequences  of  four  binary  digits  and  there  are  16 
combinations  (a„  bj)  in  Fig.  7.38b.  Thus,  every  possible  four-bit  sequence  is  transmitted  by 
a  particular  (a„  bj)  or  (r„  0,).  Therefore,  one  signal  pulse  np(t)  cos  (coct  -  0,  )  transmits  four 
bits.  Compared  with  binary  PSK  (or  BPSK),  the  16-ary  QAM  bit  rate  is  quadrupled  without 
increasing  the  bandwidth.  The  transmission  rate  can  be  increased  further  by  increasing  the 
value  of  M . 

Modulation  as  well  as  demodulation  can  be  performed  by  using  the  system  in  Fig.  7.38a. 
The  inputs  are  m  i  (0  =  a,p(t)  dndm2(t)  =  btp(t).  The  two  outputs  at  the  demodulator  are  a,p(t) 
and  bip(t).  From  knowledge  of  (a„  bj),  we  can  determine  the  four  transmitted  bits.  Further 
analysis  of  1 6-ary  QAM  on  a  noisy  channel  is  carried  out  in  Sec.  1 1 .6  (Eq.  ( 1 1 . 1 04)].  The  practi¬ 
cal  value  of  this  1 6-ary  QAM  signaling  becomes  fully  evident  when  we  consider  its  broad  range 
of  applications.  In  fact,  16-QAM  is  used  in  the  V.32  telephone  data/fax  modems  (9600  bit/s), 
in  high-speed  cable  modems,  and  in  modem  satellite  digital  television  broadcasting. 
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Figure  7.38 

(a)  QAM  or 
quadrature 
multiplexing  and 

(b)  16-point 
QAM  (M  =  16). 


(b) 


Note  that  1 1  we  disable  the  data  stream  that  modulates  sin  coc t  in  QAM,  then  all  the  signaling 
points  can  be  reduced  to  a  single  dimension.  Upon  setting  m2(t)  =  0,  QAM  becomes 

Pi(t)  =  ciip(t)  cos  wct,  t  e  [0,  Tb] 

This  degenerates  into  the  pulse  amplitude  modulation  or  PAM.  Comparison  of  the  signal 

of  rtTn«R °  Wlt,h  ',he  anal°g  DSBSC  si8nal  makes  it  clear  that  PAM  is  the  digital  version 
y  ,  "  Sl*nd  *  ust  as  ana,°g  QAM  is  tormed  by  the  superposition  of  two  DSB-SC 

^m  Xf‘  a',IO,IS  ,ln  pta“  <|uadra,ure-  OAM  consists  of  two  PAM  signals,  each 

GAM  paS  'I*  relationship  between  analog  DSB-SC  and 

kss^M  ™  Ke<l“'reS  T.'  am0'1",  °f  ba"dwid,h  “  QAM  does.  However  PAM  is  much 
OA^“T  XT*  r* M  m0d'"Mi0n  **“»•* leTCl*  «*  dimension,  whereas 
QAM  requires  only  vhW  signaling  levels  in  each  of  the  Iwo  onhogonal  QAM  dimensions. 

Trading  Power  and  Bandwidth 

In  Chapter  11  we  shall  discuss  several  other  types  of  M- ary  signaling.  The  nature  of  the 

on^hechoiceof M-'a  C  tra"sm,s‘p0n  bandwid‘b  and  the  transmitted  power  (or  SNR)  depends 
practically  indeoenfl  °[ example; in  orthogonal  signaling,  the  transmitted  poweris 

P  Cn  °  M  but  tbe  transmission  bandwidth  increases  with  M .  Contrast  this 
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to  the  PAM  case,  where  the  transmitted  power  increases  roughly  with  M 2  while  the  bandwidth 
remains  constant.  Thus,  M  -ary  signaling  allows  us  great  flexibility  in  trading  signal  power 
(or  SNR)  for  transmission  bandwidth.  The  choice  of  the  appropriate  system  will  depend  upon 
the  particular  circumstances.  For  instance,  it  will  be  appropriate  to  use  QAM  signaling  if  the 
bandwidth  is  at  a  premium  (as  in  telephone  lines)  and  to  use  orthogonal  signaling  when  power 
is  at  a  premium  (as  in  space  communication). 


7. 1 0  MATLAB  EXERCISES 


In  this  section,  we  provide  MATLAB  programs  to  generate  the  eye  diagrams.  The  first  step  is 
to  specify  the  basic  pulse  shapes  in  PAM.  The  next  four  short  programs  are  used  to  generate 
NRZ,  RZ,  half-sinusoid,  and  raised-cosine  pulses. 


%  (pnrz.m) 

%  generating  a  rectangular  pulse  of  width  T 

%  Usage  function  pout=pnrz (T) ; 

function  pout=prect (T) ; 

pout=ones (1, T) ; 

end 


%  (prz.m) 

%  generating  a  rectangular  pulse  of  width  T/2 
%  Usage  function  pout=prz(T); 
function  pout=prz (T) ; 

pout= [zeros ( 1 , T/4 )  ones(l,T/2)  zeros (1 , T/4 )] ; 
end 


%  (psine.m) 

%  generating  a  sinusoid  pulse  of  width  T 
% 

function  pout=psine (T) ; 
pout  =  s in (pi*  [0 :T-1] /T)  ; 
end 


%  (prcos.m) 

%  Usage  y=prcos (rollfac, length,  T) 
function  y=prcos (rollfac, length,  T) 

%  rollfac  =  0  to  1  is  the  rolloff  factor 

%  length  is  the  onesided  pulse  length  in  the  number  of  T 
%  length  =  2T+1; 

%  T  is  the  oversampling  rate 

y=rcosfir ( rollfac ,  length,  T,l,  'normal  ); 

end 
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The  first  program  (binary  eye  .m)  uses  the  four  different  pulses  to  generate  eye 
diagrams  of  binary  polar  signaling. 


%  (binary_eye.m) 

%  generate  and  plot  eyediagrams 
% 

clear ;clf ; 

data  =  sign (randn (1,400)  )  ;  %  Generate  400  random  bits 


Tau=64 ; 


%  Define  the  symbol  period 


dataup-upsample (data,  Tau) ;  %  Generate  impulse  train 
yrz“conv (dataup , prz (Tau) ) ;  %  Return  to  zero  polar  signal 

yrz=yrz ( 1 : end-Tau+1 ) ; 

ynrz=conv (dataup, pnrz (Tau) ) ;  %  Non-return  to  zero  polar 
ynrz=ynrz (1 : end-Tau+1) ; 

ysine=conv (dataup, psine (Tau));  %  half  sinusoid  polar 

ysine=ysine(l: end-Tau+1) ; 


Td=4  ; 


%  truncating  raised  cosine  to  4  periods 


yrcos=conv (dataup, prcos (0.5,Td,Tau) ) ;  %  rolloff  factor  =  0.5 

VrcOS=Vrcos(2*TH*Taii.on^_o*^  +  q, *  0 


The  second  program  (Mary_eye.m)  uses  the  four  different  pulses  to  generate  eye 
diagrams  of  four-level  PAM  signaling. 


%  (Mary_eye.m) 

%  generate  and  plot  eyediagrams 
% 

% 

clear;clf ; 
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PROBLEMS 


7.2- 1  Consider  a  pulse  shape 


p\(t)  =  A(t/Th) 


(a)  Find  PSDs  for  the  polar,  on-off,  and  bipolar  signaling. 

(b)  Sketch  roughly  the  PSDs  and  compare  their  bandwidths. 

7.2-2  If  the  pulse  shape  is 

P2(t)=  n  (o^) 


(a)  Find  PSDs  for  the  polar,  on-off.  and  bipolar  signaling. 

(b)  Sketch  roughly  these  PSDs  and  find  their  bandwidths.  For  each  case,  compare  the  bandwidth 
to  the  PSD  obtained  in  Prob.  7.2-1. 


7.2-3  If  the  pulse  shape  is 


pjU)  = 


\t\  <  Tb/2 
\t\  >  Tb/2 


(a)  Find  PSDs  for  the  polar,  on-off,  and  bipolar  signaling. 

(b)  Sketch  roughly  these  PSDs  and  find  their  bandwidths.  For  each  case,  compare  the  bandwidth 
to  the  PSD  obtained  in  Prob.  7.2-1. 

7.2-4  (a)  A  random  binary  data  sequence  110100101  is  transmitted  by  means  of  a  Manchester 
(split-phase)  line  code  with  the  pulse  p(t)  shown  in  Fig.  7.7a.  Sketch  the  waveform  y(r). 

(b)  Derive  Sy(f),  the  PSD  of  a  Manchester  (split-phase)  signal  in  part  (a)  assuming  1  and  0  to 
be  equally  likely.  Roughly  sketch  this  PSD  and  find  its  bandwidth. 
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7.2- 5  Use  differential  code  with  the  pulse  of  Prob.  7.2-3  to  derive  the  PSD  for  a  binary  signal  and 

determine  the  PSD  Sy(f). 

7.2- 6  The  duobinary  line  coding  proposed  by  Lender  is  also  ternary  like  bipolar,  but  requires  only 

half  the  bandwidth  of  bipolar.  In  practice,  duobinary  coding  is  indirectly  realized  by  using  a 
special  pulse  shape  as  discussed  in  Sec.  7.3  (see  Fig.  7.18).  In  this  code,  a  0  is  transmitted  by 
no  pulse,  and  a  1  is  transmitted  by  a  pulse  /?(/)  or  - p(t )  according  to  the  following  rule.  A  1 
is  encoded  by  the  same  pulse  as  that  used  for  the  previous  1  if  there  are  even  number  of  Os 
between  them.  It  is  encoded  by  a  pulse  of  opposite  polarity  if  there  are  an  odd  number  of  Os 
between  them.  A  number  0  is  considered  to  be  an  even  number.  Like  bipolar,  this  code  also 
has  single-error-detection  capability,  because  correct  reception  implies  that  between  successive 
pulses  of  the  same  polarity,  an  even  number  of  Os  must  occur,  and  between  successive  pulses  of 
opposite  polarity,  an  odd  number  of  Os  must  occur. 

(a)  Using  a  half-cosine  pulse  of  Prob.  7.2-3,  sketch  the  duobinary  signal  v(0  for  the  random 
binary  sequence 

010100110111010  •• 

(b)  Determine  Rq,  R j,  and  /?2  for  this  code.  Assume  (or  you  may  show)  that  Rn  =  0  for  all 
n  >  2.  Find  and  sketch  the  PSD  for  this  line  code  (assuming  half-width  pulse).  Show  that 
its  bandwidth  is  Rb/2  Hz,  independent  of  the  pulse  shape. 


7.3- 1  A  binary  data  stream  needs  to  be  transmitted  at  5  Mbit/s  by  means  of  binary  signaling.  To  reduce 
ISI,  a  cosine  roll-off  pulse  of  roll-off  factor  r  =  0.25  will  be  used.  Determine  the  minimum 
required  bandwidth  for  this  transmission. 


7.3-2  Repeat  Prob.  6.2-9  if  Nyquist’s  criterion  pulses  with  roll-off  factor  r  =  0.2  are  used. 


3  Repeat  Prob.  7.3-2  if  M  4  pulse  levels  are  transmitted  such  that  each  transmission  of  a  pulse 
with  a  distinct  level  represents  2  bits.  Generalize  the  results  when  M  =  2m  pulse  levels  are  used 
in  transmission. 


7.3-4  Consider  the  PSD  of  the  on-off  signaling  in  Eq.  (7.19b).  This  PSD  may  contain  impulses  at 
frequencies 


/  =  - 
n 


Since  these  frequency  impulses  do  not  carry  message  information,  it  is  more  power  efficient  to 
eliminate  them  in  transmission. 


(a)  Determine  the  conditions  needed  for  the  pulse-shaping  filter  P(f)  to  eliminate  all  impulses 


domain hC  L°nd','°ns  ol  part  (a)  tor  find  the  equivalent  conditions  for  p(t )  in  time 


<C>  fromTvo!0".-"^ aPt  <b>  iS  a  dUa' t0  thC  Pr°blem  of  Nyquist’s  first  criterion.  Use  the  results 

t  elnT  S  7?"°"  ‘°  de'ennine  3  Cl3SS  of  P (0  in  the  time  domain  that  can  nullify 

tne  impulses  m  part  (a). 


”'5  ISKr,*?"  'y,“ !h°W" i,F* 7  13sa.isflc.he  N, quisle*. do..  If/,  -  *» 

via  the  Nvauist  Crit^  *  e!?I7nine  the  rale  at  which  binary  data  can  be  transmitted  by  this  pulse 
via  tne  Nyquist  criterion.  What  is  the  roll-off  factor? 


Figure  P.7.3-6 


Figure  P.7.3-7 
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7.3-6  The  Fourier  transform  P(f)  of  the  basic  pulse  p(t)  used  in  a  certain  binary  communication 
system  is  shown  in  Fig.  P7.3-6. 

(a)  From  the  shape  of  P(f),  explain  at  what  pulse  rate  this  pulse  would  satisfy  Nyquist’s  first 
criterion. 

(b)  Find  p(t)  and  verify  that  whether  or  not  this  pulse  satisfies  Nyquist  first  criterion. 

(c)  Based  on  part  (a),  show  what  the  roll-off  factor  is. 

(d)  Show  how  rapidly  the  pulse  decays  as  a  function  of  t. 


7.3-7  Four-level  transmission  at  the  pulse  rate  of  2  Mbit/s  is  to  be  transmitted  by  means  of  Nyquist 
first  criterion  pulses  with  P(f)  shown  in  Fig.  PI. 3-1.  The  frequencies  f\  and  fi  (in  Hz)  of  this 
spectrum  are  adjustable.  The  channel  available  for  transmission  of  this  data  has  a  bandwidth  of 
650  kHz.  Determine  f\  and/2,  an^  the  roll-off  factor. 


7.3-8  (a)  Show  that  the  inverse  Fourier  transform  of  P(f)  (the  raised-cosine  pulse  spectrum  in 
Eq.  (7.35)  is  the  pulse  p(t)  given  in  Eq  (7.36). 

(b)  Show  that  the  resulting  time  domain  pulse  decays  at  the  rate  of  1/f3. 

Hint:  Use  the  time-shifting  property  of  the  Fourier  transform  and  Eq.  (7.35). 


7.3- 9  Show  that  the  inverse  Fourier  transform  of  P(f)  in  Eq.  (7.39)  is  indeed  the  second  Nyquist 

criterion  pulse  p(t)  given  in  Eq.  (7.38). 

Hint:  Use  Eq.  (3.32)  to  find  the  inverse  transform  of  P(f)  in  Eq.  (7.39)  and  express  sine  ( x )  in 
the  form  sin  x/x. 

7.3- 10  Show  that  there  exists  one  (and  only  one)  pulse  p(t)  ot  bandwidth  Rb/2  Hz  that  satisfies  the 

criterion  for  a  second  criterion  pulse  [Eq.  (7.37)].  Show  that  this  pulse  is  given  by 


p(t)  =  {sinc(;r/V)  +  sinc[jr/?/,(/  -  Tb)\\  = 


sin  (7 zRbt) 
nRbt(\  —  Rbt) 


and  its  Fourier  transform  is  P(f)  given  in  Eq.  (7.39). 
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Hint:  For  a  pulse  of  bandwidth  /fy/2,  the  Nyquist  interval  is  1  /Rh  =  Th,  and  conditions  (7.37) 


Figure  P.7.4-3 


give  the  Nyquist  sample  values  at  /  =  ±nTh.  Use  the  interpolation  formula  [Eq.  (6.10)]  with 

B  =  Rh/ 2,  Ts  =  Tfj  to  construct  p(t).  In  determining  P(f ),  recognize  that  (1  +  e^2^)  - 
e-jxfTb^nJTb  +e-jxJTb  2) 

7.3-11  In  a  duobinary  transmission,  sample  values  of  the  received  pulses  were  read  as  follows: 


—  1020020  — 200220  — 20220  — 2 


(a)  Explain  whether  there  is  any  error  in  detection. 

(b)  Decode  the  data  by  looking  for  a  sequence  with  the  fewest  possible  detection  errors. 

7.3-12  In  a  duobinary  data  transmission  using  differential  encoding,  the  binary  data  sequence  is  as 
follows: 


10010001011101101 


(a)  Specify  the  differential  encoder  output  sequence. 

(b)  Specify  the  sampled  duobinary  transmission  signal. 

(c)  Determine  the  detection  rule  and  confirm  the  correct  output  sequence. 

7.3-13  (a)  For  a  modified  duobinary  transmission,  design  the  corresponding  differential  encoder  to 
eliminate  error  propagation  at  detection. 


(b)  Repeat  Prob.  7.3-12  for  the  modified  duobinary  system  in  part  (a). 

Consider  the  scrambler  and  the  descrambler  in  Fig.  7.20. 

(a)  Find  the  scrambler  output  T  when  the  input  sequence  is  S  =  101110001101001. 

(b)  Confirm  that  the  descrambler  output  recovers  the  scrambler  input  5  when  the  output  T  in 

rvirt  l<il  ic  _ li  • 


part  (a)  is  the  desrambler  input. 


Figure  P.7.4-2 


5 


T 


7.4-3  Repeat  Prob.  7.4-2  if  the  scrambler  shown  in 
Figure  P7.4-2  to  form  a  composite  scrambler. 


in  Fig.  P7.4-3  is  concatenated  with  the  scrambler  in 
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Figure  P.7.4-3  - 

v  r~ 

s-h— q-^5 


7.5- 1  In  a  certain  binary  communication  system  that  uses  (the  first)  Nyquist  criterion  pulses,  a  received 

pulse  pr(t)  (Fig.  7.22a)  has  the  following  nonzero  sample  values: 

M  0)  =  1 

Pr(Tb)  =  -0.1  Pr(-Tb)  =  0.6  =  -0.07 

(a)  Determine  the  tap  settings  of  a  three-tap,  zero-forcing  equalizer. 

(b)  Using  the  equalizer  in  part  (a),  find  the  residual  nonzero  1SI. 

7.5- 2  For  the  same  communication  system  of  Prob.  7.5- 1 ,  do  the  following. 

(a)  Design  a  three-tap  MMSE  equalizer:  (Note  that  K  should  be  chosen  as  large  as  possible  for 
which  p0[±K\  is  nonzero.) 

(b)  Determine  the  resulting  MSE  of  the  three-tap  MMSE  equalizer; 

(c)  Compare  the  MSE  of  part  (b)  with  the  resulting  MSE  of  the  three-tap  ZF  equalizer  used  in 
Prob.  7.5-1. 

7.5- 3  For  the  same  communication  system  of  Prob.  7.5- 1 , 

(a)  Design  a  five-tap  ZF  equalizer. 

(b)  Design  a  five-tap  MMSE  equalizer. 

(c)  Determine  the  resulting  MSE  of  the  five-tap  MMSE  equalizer. 

(d)  Determine  the  resulting  MSE  of  the  five-tap  ZF  equalizer. 

(e)  Compare  the  resulting  MSE  with  the  results  in  Prob.  7.5-2. 

7.5- 4  Repeat  Prob.  7.5-2  for  a  system  with  the  following  nonzero  sample  values: 

M  0)  =  1 

Pr(Tb)  =  —0.1  Pr(-Tb)  =  0.6 

Pr(2Tb)  =  -0.2  pr(—2Tb)  =  0. 1 7 

7.5- 5  Repeat  Prob.  7.5-3  for  a  system  with  the  following  nonzero  sample  values: 


M0)  =  1 

Pr(Tb )  =  -0. 1  Pr(-Tb)  =  0.6 

pr(2Tb)  =  —0.2  pr(—2Tb)  =  0.17 


PRINCIPLES  OF  DIGITAL  DATA  TRANSMISSION 


7.6-1  For  binary  signaling  of  rate  1/7),,  sketch  the  eye  diagram  when  the  following  pulse  shape  is 
applied: 


P(t)  =  n  ( — — ) 

V  37^/4/ 

and  the  line  code  is  (a)  polar;  (b)  on/off;  (c)  bipolar;  (d)  duobinary. 

7.6- 2  Repeat  Prob. 7.6-1  when  the  following  pulse  shape  is  applied: 

p(0  =  Afe) 

7.6- 3  Repeat  Prob. 7. 6-1  when  the  following  pulse  shape  is  applied: 


p(t)  =  U  [  1  +  COS  (2nt/Tb)\  If  I  <  Tb/2 
1°  |/|  >  Tb/ 2 


7.6-4  Repeat  Prob.7.6- 1  when  the  following  pulse  shape  is  applied: 

1 


PU)  = 


2  (1  +cos  (nt/Tb)] 
0 


I'l  <  Tb 

I'l  >  Tb 


7.7-1  In  a  PAM  scheme  with  M  =  16,  do  the  following. 

(a)  Sketch  the  eye  diagram  for  a  transmission  pulse  shape  of 


Pit)  = 


^[l+cosOrf/7*)]  |/|  <  Tb 

°  I'l  >  Tb 


(b)  Show  whether  the  pulse  shape  satisfies  the  first  Nyquist  criterion  of  zero  ISI. 

7.7-2  In  a  PAM  scheme  with  M  =  1 6,  do  the  following. 

n  ivuT!!16/  ^'C  !lm'lmUm  Iransnl'ss'°n  bandwidth  required  to  transmit  data  at  a  rate  of 
1 2.000  bit/s  with  zero  ISI. 

(b)  Determine  the  transmission  bandwidth  if  Nyquist  criterion  pulses  with  a  roll-off  factor 
r  =  0.2  are  used  to  transmit  data. 

11  3  Quantized  "tiT'  °'  bandwidth  12  kHz  is  sampled  at  a  rate  25%  above  the  Nyquist  rate  and 
SSLS  T  T"  error  »  not  ,0  exceed  0.5%  of  the  signal  peak  amplitude.  The 
result, ng  quant, zed  samples  are  now  coded  and  transmitted  by  4-ary  pulsed 

(a)  Determine  the  minimum  number  of  4-ary  pulses  required  to  encode  each  sample. 

<  b>  Determ, ne  the  mmimum  transmission  bandwidth  required  to  transmit  this  data  with  zero  ISI- 

<cl  SZ.“ if  Nwist pul“5  wi,h  25,1 

7  M  °,er  ■  “"ai"  cto™'  *' »  —  *»  bi*.  To  reduce  .he 

bandwidth,  „  ,s  deeded  to  transmit  this  data  using  16-ary  PAM  signaling. 
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(a)  By  what  factor  is  the  bandwidth  reduced? 

(b)  By  what  factor  is  the  transmitted  power  increased,  assuming  minimum  separation  between 
pulse  amplitudes  to  be  the  same  in  both  cases? 

Hint:  Take  the  pulse  amplitudes  to  be  ±4/2,  ±34/2,  ±54/2,  ±74/2 . ±154/2,  so  that 

the  minimum  separation  between  various  amplitude  levels  is  4  (same  as  that  in  the  binary  case 
pulses  ±4/2).  Assume  all  the  16  levels  to  be  equally  likely.  Recall  also  that  multiplying  a  pulse 
by  a  constant  k  increases  its  energy  k2- fold. 

7.7-5  Consider  a  case  of  binary  transmission  using  polar  signaling  that  uses  half-width  rectangular 
pulses  of  amplitudes  4/2  and  —4/2.  The  data  rate  is  R/,  bit/s. 

(a)  What  is  the  minimum  transmission  bandwidth  and  the  transmitted  power? 

(b)  This  data  is  to  be  transmitted  by  M  -ary  rectangular  half-width  pulses  of  amplitudes 


Note  that  to  maintain  about  the  same  noise  immunity,  the  minimum  pulse  amplitude  separation 
is  4.  If  each  of  the  M  -ary  pulses  is  equally  likely  to  occur,  show  that  the  transmitted  power  is 

p  =  (A/2  -  \W 
24  log2  M 

Also  determine  the  transmission  bandwidth. 

7.7- 6  An  audio  signal  of  bandwidth  1 5  kHz  is  sampled  at  a  rate  of  44. 1  kHz,  quantized  into  256  levels, 

and  coded  by  means  of  M-ary  PAM  pulses  satisfying  Nyquist’s  criterion  with  a  roll-off  factor 
r  =  0.25.  A  30  kHz  bandwidth  is  available  to  transmit  the  data.  Determine  the  best  value  of  M . 

7.8- 1  Figure  P7.8-1  shows  a  binary  data  transmission  scheme.  The  baseband  signal  generator  uses 

full-width  pulses  and  polar  signaling.  The  data  rate  is  1  Mbit/s. 

(a)  If  the  modulator  generates  a  PSK  signal,  what  is  the  bandwidth  of  the  modulated  output? 

(b)  If  the  modulator  generates  FSK  with  the  difference  fc\  -  f( o  =  100  kHz  (Fig.  7.32c), 
determine  the  modulated  signal  bandwidth. 


fi9ure  P.7.8-1 


7.8- 2  Repeat  Prob.  7.8- 1(a)  if  instead  of  full-width  pulses.  Nyquist  criterion  pulses  with  r  =  0.2  are 

used. 

7.8- 3  If  the  input  to  a  binary  differential  PSK  modulation  system  is  0  1  1  0  0  1  0  1  0  0  0  1  1  1,  tabulate 

the  following: 

(a)  differential  encoder  output 

(b)  modulated  phase  value  0* 
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(c)  modulated  phase  difference  0*  —  j 

(d)  decision  rule  of  the  detector 

7.9-1  We  now  replace  the  modulator  in  Figure  P7.8- 1  with  an  M  -ary  baseband  modulation  scheme.  The 
baseband  signal  generator  uses  full-width  pulses  and  polar  signaling.  The  data  rate  is  1  Mbit/s. 

(a)  If  the  modulator  generates  a  PSK  signal,  what  is  the  bandwidth  of  the  modulated  output? 

(b)  If  the  modulator  generates  a  PAM  signal,  what  is  the  bandwidth  of  the  modulated  output? 

(c)  If  the  modulator  generates  a  QAM  signal,  what  is  the  bandwidth  of  the  modulated  output? 

(d)  If  the  modulator  generates  FSK  with  minimum  frequency  separation  of  1  (X)  kHz  (Fig.  7.32c), 
what  is  the  modulated  signal  bandwidth? 


8  FUNDAMENTALS  OF 
PROBABILITY  THEORY 


Thus  far,  we  have  been  studying  signals  whose  values  at  any  instant  t  are  determined 
by  their  analytical  or  graphical  description.  These  are  called  deterministic  signals, 
implying  complete  certainty  about  their  values  at  any  moment  t.  Such  signals,  which 
can  be  specified  with  certainty,  cannot  convey  information.  It  will  be  seen  in  Chapter  14  that 
information  is  inherently  related  to  uncertainty.  The  higher  the  uncertainty  about  a  signal  (or 
message)  to  be  received,  the  higher  its  information  content.  If  a  message  to  be  received  is 
specified  (i.e.,  if  it  is  known  beforehand),  then  it  contains  no  uncertainty  and  conveys  no  new 
information  to  the  receiver.  Hence,  signals  that  convey  information  must  be  unpredictable. 
In  addition  to  information-bearing  signals,  noise  signals  that  perturb  information  signals  in  a 
system  are  also  unpredictable  (otherwise  they  can  simply  be  subtracted).  These  unpredictable 
message  signals  and  noise  waveforms  are  examples  of  random  processes  that  play  key  roles 
in  communication  systems  and  their  analysis. 

Random  phenomena  arise  either  because  of  our  partial  ignorance  of  the  generating  mech¬ 
anism  (as  in  message  or  noise  signals)  or  because  the  laws  governing  the  phenomena  may 
be  fundamentally  random  (as  in  quantum  mechanics).  Yet  in  another  situation,  such  as  the 
outcome  of  rolling  a  die,  it  is  possible  to  predict  the  outcome  provided  we  know  exactly  all 
the  conditions:  the  angle  of  the  throw,  the  nature  of  the  surface  on  which  it  is  thrown,  the 
force  imparted  by  the  player,  and  so  on.  The  exact  analysis,  however,  is  so  complex  and  so 
sensitive  to  all  the  conditions  that  it  is  impractical  to  carry  it  out,  and  we  are  content  to  accept 
the  outcome  prediction  on  an  average  basis.  Here  the  random  phenomenon  arises  from  our 
unwillingness  to  carry  out  the  exact  and  full  analysis  because  it  is  impractical  to  amass  all  the 
conditions  precisely  or  not  worth  the  effort. 

We  shall  begin  with  a  review  of  the  basic  concepts  of  the  theory  of  probability,  which 
forms  the  basis  for  describing  random  processes. 


CONCEPT  OF  PROBABILITY 

To  begin  the  discussion  of  probability,  we  must  define  some  basic  elements  and  important  terms. 
The  term  experiment  is  used  in  probability  theory  to  describe  a  process  whose  outcome  cannot 
be  fully  predicted  because  the  conditions  under  which  it  is  performed  cannot  be  predetermined 
with  sufficient  accuracy  and  completeness.  Tossing  a  coin,  rolling  a  die,  and  drawing  a  card 
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Figure  8.2 

Representation  of 

(a)  complement, 

(b)  union,  and 

(c)  intersection  of 
events. 


from  a  deck  are  some  examples  of  such  experiments.  An  experiment  may  have  several  sepa¬ 
rately  identifiable  outcomes.  For  example,  rolling  a  die  has  six  possible  identifiable  outcomes 
( 1 , 2,  3, 4, 5,  and  6).  An  event  is  a  subset  of  outcomes  that  share  some  common  characteristics. 
An  event  occurs  if  the  outcome  of  the  experiment  belongs  to  the  specific  subset  of  outcomes 
defining  the  event.  In  the  experiment  of  rolling  a  die,  for  example,  the  event  “odd  number  on  a 
throw"  can  result  from  any  one  of  three  outcomes  (viz.,  1,  3,  and  5).  Hence,  this  event  is  a  set 
consisting  of  three  outcomes  (1,3,  and  5).  Thus,  events  are  groupings  of  outcomes  into  classes 
among  which  we  choose  to  distinguish.  The  ideas  of  experiment,  outcomes,  and  events  form 
the  basic  foundation  of  probability  theory.  These  ideas  can  be  better  understood  by  using  the 
concepts  of  set  theory. 

We  define  the  sample  space  S  as  a  collection  of  all  possible  and  separately  identifiable 
outcomes  of  an  experiment.  In  other  words,  the  sample  space  S  specifies  the  experiment.  Each 
outcome  is  an  element,  or  sample  point,  of  this  space  S  and  can  be  conveniently  represented 
by  a  point  in  the  sample  space.  In  the  experiment  of  rolling  a  die,  for  example,  the  sample  space 
consists  of  six  elements  represented  by  six  sample  points  fa  fa  ft,  £1,  £5,  and  fa  where  ft 
represents  the  outcome  “a  number  i  is  thrown”  (Fig.  8.1).  The  event,  on  the  other  hand,  is  a 
subset  of  S.  The  event  “an  odd  number  is  thrown,”  denoted  by  A0 ,  is  a  subset  of  S  (or  a  set  of 
sample  points  ft ,  fa  and  fa.  Similarly,  the  event  Ae,  “an  even  number  is  thrown,”  is  another 
subset  of  S  (or  a  set  of  sample  points  fa  fa  and  fa: 

A°  =  (Ci.  ft,  ft)  Ae  =  (ft,  ft,  ft) 

Let  us  denote  the  event  “a  number  equal  to  or  less  than  4  is  thrown”  as  B.  Thus, 


B  =  (ft.  ft.  ft,  ft) 

These  events  are  clearly  marked  in  Fig.  8. 1 .  Note  that  an  outcome  can  also  be  an  event,  because 
an  outcome  is  a  subset  of  S  with  only  one  element. 

The  complement  of  any  event  A.  denoted  by  is  the  event  containing  all  points  not  in 
/l.  Thus,  for  the  event  B  in  Fig.  8.1.  ^  =  (ft,  ft),  =  Af  and  ^  =  ^  An  event  that  has 

no  sample  points  is  a  null  event,  which  is  denoted  by  0  and  is  equal  to  Sc. 

The  union  of  events  4  and  B,  denoted  by  4  U  B,  is  the  event  that  contains  all  points  in 
A  and  B.  This  is  the  event  stated  as  having  “an  outcome  of  either  Aoy  B"  For  the  events  in 
Fig.  8.1, 


Aa\JB  =  (ft,  ft,  ft,  ft,  ft) 
Ae  U  B  ~  (ft,  ft,  ft.  Cl ,  ft) 


8.1  Concept  of  Probability 
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Representation  of 

(a)  complement, 

(b)  union,  and 


(c)  intersection  of 
events. 


5 


(a) 


A\JB  =  B\JA 
(b) 


AB  and  ACB 
(c) 


Observe  that  the  union  operation  commutes: 


AVB  =  B\JA 


(8.1) 


The  intersection  of  events  A  and  B ,  denoted  by  A  OB  or  simply  by  AB ,  is  the  event  that  contains 
points  common  to  A  and  B.  This  is  the  event  that  “outcome  is  both  A  and  B ,”  also  known  as 
the  joint  event  AD  B.  Thus,  the  event  AeB ,  “a  number  that  is  even  and  equal  to  or  less  than  4 
is  thrown,”  is  a  set  (ft,  ft),  and  similarly  for  A()B, 


AoB  =  (Su  ft) 


AeB  =  (ft,  ft) 


Observe  that  the  intersection  also  commutes 


(8.2) 


AKB=BHA 


All  these  concepts  can  be  demonstrated  on  a  Venn  diagram  (Fig.  8.2).  If  the  events  A  and 
B  are  such  that 


AC\B  =  V) 


(8.3) 


then  A  and  B  are  said  to  be  disjoint,  or  mutually  exclusive,  events.  This  means  events  A  and 
B  cannot  occur  simultaneously.  In  Fig.  8.1  events  Ae  and  A()  are  mutually  exclusive,  meaning 
that  in  any  trial  of  the  experiment  if  Ae  occurs,  A0  cannot  occur  at  the  same  time,  and  vice 
versa. 

Relative  Frequency  and  Probability 

Although  the  outcome  of  an  experiment  is  unpredictable,  there  is  a  statistical  regularity  about 
the  outcomes.  For  example,  if  a  coin  is  tossed  a  large  number  of  times,  about  half  the  times 
the  outcome  will  be  “heads,”  and  the  remaining  half  of  the  times  it  will  be  “tails  ”  We  may 
say  that  the  relative  frequency  of  the  two  outcomes 4  heads  or  tails  is  one-halt.  This  relative 
frequency  represents  the  likelihood  of  a  particular  event. 

Let  A  be  one  of  the  events  of  interest  in  an  experiment.  If  we  conduct  a  sequence  of  N 
independent  trials*  of  this  experiment,  and  if  the  event  A  occurs  in  N(A)  out  of  these  N  trials, 
then  the  fraction 


f{A)  = 


(8.4) 


Trials  conducted  under  similar  discernible  conditions. 
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is  called  the  relative  frequency  of  the  event  A.  Observe  that  for  small  N,  the  fraction  N(A) / N 
may  vary  widely  with  N.  As  N  increases,  the  fraction  will  approach  a  limit  because  of  statis¬ 
tical  regularity. 

The  probability  of  an  event  has  the  same  connotations  as  the  relative  frequency  of  that 
event.  Hence,  we  estimate  the  probability  of  each  event,  as  the  relative  frequency  of  that  event.* 
Therefore,  to  an  event  A,  we  assign  the  probability  P(A)  as 


P(A)  =  lim 


(8.5) 


From  Eq.  (8.5),  it  follows  that 


0  <  P(A)  <  1 


(8.6) 


Example  8. 1  Assign  probabilities  to  each  of  the  six  outcomes  in  Fig.  8.1. 


Because  each  of  the  six  outcomes  is  equally  likely  in  a  large  number  of  independent  trials, 
each  outcome  will  appear  in  one-sixth  of  the  trials.  Hence, 

P^i)  =  g  i=  1,  2,  3,  4,  5,  6  (8.7) 


Consider  now  the  two  events  A  and  B  of  an  experiment.  Suppose  we  conduct  N  independent 
trials  ot  this  experiment  and  events  A  and  B  occur  in  N(A)  and  N(B )  trials,  respectively.  If  A 
and  B  are  mutually  exclusive  (or  disjoint),  then  if  A  occurs,  B  cannot  occur,  and  vice  versa. 
Hence,  the  event  AUfi  occurs  in  N(A)  +  N(B)  trials  and 


N(A)  +  N(B) 
N 


P(A  Ufi)=  lim 


N  — ►oo  yv 


=  P(A)  +  P(B)  if  A  n  B  =  0 


(8.8) 


This  result  can  be  extended  to  more  than  two  mutually  exclusive  events.  In  other  words,  if 


events  (A,)  are  mutually  exclusive  such  that 


Aj  n  Aj  =  0  /  ^  j 


then 
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Example  8.2  Assign  probabilities  to  the  events  Ae ,  A„,  B,  AeB ,  and  A„B  in  Fig.  8. 1 . 

Because  Ae  (£2  U  £4  U  fg)  where  £2,  £4,  and  {5  are  mutually  exclusive, 

P(Ae)  =  P(£ 2)  +  /»(&)  +  PC6) 

From  Eq.  it  follows  that 


Similarly, 

P(Ao)  =  \ 

PW  =  l 

From  Fig.  8. 1  we  also  observe  that 


AeB  =  &  U  C4 


and 


P{AeB)  =  P(( 2)  +  P(( 4)  =  j 


Similarly, 


1 


P(AoB)  =  - 


(8.9a) 


(8.9b) 

(8.9c) 


(8.10a) 


(8.10b) 


We  can  also  show  that 


P(S)  =  1  (8.11) 

This  result  can  be  proved  by  using  the  relative  frequency.  Let  an  experiment  be  repeated  N 
times  (N  large).  Because  S  is  the  union  of  all  possible  outcomes,  S  occurs  in  every  trial.  Hence, 
N  out  of  N  trials  lead  to  event  S,  and  the  result  follows. 


Example  8.3  Two  dice  are  thrown.  Determine  the  probability  that  the  sum  on  the  dice  is  seven. 

I  For  this  experiment,  the  sample  space  contains  36  sample  points  because  36  possible 
outcomes  exist.  All  the  outcomes  are  equally  likely.  Hence,  the  probability  of  each  outcome 
is  1/36. 
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A  sum  of  seven  can  be  obtained  by  the  six  combinations:  (1,6).  (2,  5),  (3,  4),  (4,  3) 
(5,  2),  and  (6,  1 ).  Hence,  the  event  “a  seven  is  thrown"  is  the  union  of  six  outcomes,  each 
with  probability  1/36.  Therefore, 


_  1  1  1  1  I  1 

P(  a  seven  is  thrown  )  = - 1 - 1 - 1 - 1 - 1 - = 

36  36  36  36  36  36 


1 

6 


Example  8.4 


A  coin  is  tossed  four  times  in  succession.  Determine  the  probability  of  obtaining  exactly  two 
heads. 


A  total  ot  24  —  16  distinct  outcomes  are  possible,  all  of  which  are  equally  likely  because 
ol  the  symmetry  of  the  situation.  Hence,  the  sample  space  consists  of  16  points,  each  with 
probability  1/16.  The  16  outcomes  are  as  follows: 


HHHH 

TTTT 

HHHT 

TTTH 

HHTH 

TTHT 

HHTT 

— »  TTHH 

HTHH 

THTT 

HTHT 

—*  *•  77/77/ 

HTTH 

— > THHT 

HTTT 

THHH 

Six  out  ot  these  1 6  outcomes  lead  to  the  event  “obtaining  exactly  two  heads”  (arrows). 
Because  all  of  the  six  outcomes  are  disjoint  (mutually  exclusive), 

^(obtaining  exactly  two  heads)  =  —  =  - 

16  8 


In  Example  8.4.  the  method  of  listing  all  possible  outcomes  quickly  becomes  unwieldy 
as  the  number  of  tosses  increases.  For  example,  if  a  coin  is  tossed  just  10  times,  the  total 
number  of  outcomes  is  1024.  A  more  convenient  approach  would  be  to  apply  the  results  of 
combinatorial  analysis  used  in  Bernoulli  trials,  to  be  discussed  shortly. 


vuiiuiumiai  i  i  onumnty  and  Independent  Events 

Conditional  Probability:  It  often  happens  that  the  probability  of  one  event  is  influence 
by  the  outcome  of  another  event.  As  an  example,  consider  drawing  two  cards  in  successio 
rom  a  deck.  Let  A  denote  the  event  that  the  first  card  drawn  is  an  ace.  We  do  not  replace  the  car 
i  rawn  in  the  first  trial  Let  B  denote  the  event  that  the  second  card  drawn  is  an  ace.  It  is  evidet 
the  tiro  3|f  !'  rawm8  an  ace  in  the  second  trial  will  be  influenced  by  the  outcome  c 

•  S  \raVV ■  .  .  e  '7' dr™  does  not  result  in  an  ace,  then  the  probability  of  obtaining  an  ac 
We  now  im  H  '1  The  Probability  of  event  B  thus  depends  on  whether  event  A  occur: 

when Ti  t  ' t<>nd,t,0nal  probahilit.v  P(B\A)  to  denote  the  probability  of  event 

when  known  that  event  A  has  occurred.  P(B\A)  is  read  as  “probability  of  B  given  A." 
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Let  there  be  N  trials  of  an  experiment,  in  which  the  event  A  occurs  n\  times.  Of  these  n\ 
trials,  event  B  occurs  /12  times.  It  is  clear  that  n 2  is  the  number  of  times  that  the  joint  event 
AD  B  (Fig.  8.2c)  occurs.  That  is, 

P(ADB)  =  lim  (77)=  lim  (— )  (— ^ 

N-*oo\N  '  N-*oo\N  '  \m  J 

Note  that  Iimw->oo(ni//V)  =  P{A).  Also,  lim^_>0o(n2/ni)  =  P(B\A)*  because  B  occurs 
«2  of  the  n\  times  that  A  occurred.  This  represents  the  conditional  probability  of  B  given  A. 
Therefore. 


P(ADB)  =  P(A)P(B\A) 


(8.12) 


and 


P(B\A)  = 


P(A  n  B) 
~P(A) 


provided  P(A)  >  0 


Using  a  similar  argument,  we  obtain 


P(A\B)  = 


P(A  n  B) 
~P{B) 


provided  P(B)  >  0 


(8.13a) 


(8.13b) 


It  follows  from  Eqs.  (8.13)  that 


P(A)P(B\A) 
P(A\B)  =  „  „ 

P(B) 

P(B)P(A\B) 

P(B\A)  = 


(8.14a) 

(8.14b) 


Equations  (8. 1 4)  are  called  Bayes"  rule.  In  Bayes’  rule,  one  conditional  probability  is  expressed 
in  terms  of  the  reversed  conditional  probability. 


Example  8.5  An  experiment  consists  of  drawing  two  cards  from  a  deck  in  succession  (without  replacing  the 

first  card  drawn).  Assign  a  value  to  the  probability  of  obtaining  two  red  aces  in  two  draws. 

Let  A  and  be  the  events  “red  ace  in  the  first  draw”  and  “red  ace  in  the  second  draw,” 
respectively.  wish  to  determine  P(A  D  B), 

P(ADB)  =  P(A)P(B\A) 


and  the  relative  frequency  of  A  is  2/52  —  1/26.  Hence. 


P(A)  = 


26 


*  Here  we  are  implicitly  using  the  fact  that  rt\ 
limw-,oo(/t|//V)  /  0,  that  is,  if  P(A )  £  0. 


00  as  N  — ►  00.  This  is  true  provided  the  ratio 
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Also,  P(B\A)  is  the  probability  of  drawing  a  red  ace  in  the  second  draw  given  that  the  first 
draw  was  a  red  ace.  The  relative  frequency  of  this  event  is  1/5 1 ,  so 

rwt>  =  h 


Hence, 


P(A  H  B) 


1 

1326 


Independent  Events:  Under  conditional  probability,  we  presented  an  example  where 
the  occurrence  of  one  event  was  influenced  by  the  occurrence  of  another.  There  are,  of  course, 
many  examples  in  which  two  or  more  events  are  entirely  independent;  that  is,  the  occurrence 
of  one  event  in  no  way  influences  the  occurrence  of  the  other  event.  As  an  example,  we  again 
consider  the  drawing  of  two  cards  in  succession,  but  in  this  case  we  replace  the  card  obtained 
in  the  first  draw  and  shuffle  the  deck  before  the  second  draw.  In  this  case,  the  outcome  of  the 
second  draw  is  in  no  way  influenced  by  the  outcome  of  the  first  draw.  Thus  P(B ),  the  probability 
of  drawing  an  ace  in  the  second  draw,  is  independent  of  whether  the  event  A  (drawing  an  ace 
in  the  first  trial)  occurs.  Thus,  the  events  A  and  B  are  independent.  The  conditional  probability 
P(B\A)  is  given  by  P(B). 

The  event  B  is  said  to  be  independent  of  the  event  A  if  and  only  if 

P(AnB)  =  P(A)P(B)  (8.15a) 

Note  that  if  the  events  A  and  B  are  independent,  it  follows  from  Eqs.  (8. 13a)  and  (8. 15b)  that 

P(B\A)  =P(B)  (8.15b) 

This  relationship  states  that  if  B  is  independent  of  A ,  then  its  probability  is  not  affected  by  the 
event  A.  Naturally,  if  event  B  is  independent  of  event  A ,  then  event  A  is  also  independent  of  B . 
It  can  been  seen  from  Eqs.  (8.14)  that 


P(A\B)  =  P(A)  (8.15c) 

Note  that  there  is  a  huge  difference  between  independent  events  and  mutually  exclusive 
events.  If  A  and  B  are  mutually  exclusive,  then  A  D  B  is  empty  and  P{A  n  B)  =  0.  If  A  and  B 
are  mutually  exclusive,  then  A  and  B  cannot  occur  at  the  same  time.  This  clearly  means  that 
they  are  NOT  independent  events. 

Bernoulli  Trials 

In  Bernoulli  trials,  if  a  certain  event  A  occurs,  we  call  it  a  “success  ”  If  P(A)  =  p,  then  the 
probability  of  success  is  p.  If  q  is  the  probability  of  failure,  then  q  =  1  -  p  We  shall  find  the 
probability  of*  successes  in  n  (Bernoulli)  trials.  The  outcome  of  each  trial  is  independent  of 
the  outcomes  of  the  other  trials.  It  is  clear  that  in  n  trials,  if  success  occurs  in  k  trials,  failure 
occurs  in  n  k  trials.  Since  the  outcomes  of  the  trials  are  independent,  the  probability  of  this 
event  is  clearly  p"(  1  -  />)"“* ,  that  is. 


P(k  successes  in  a  specific  order  in  n  trials)  =  pk(  1  —  p)n~k 
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But  the  event  of  k  successes  in  n  trials”  can  occur  in  many  different  ways  (different  orders). 
It  is  well  known  from  combinatorial  analysis  that  there  are 

/n\  n\ 

U  =  <8I6> 

ways  in  which  k  positions  can  be  taken  from  n  positions  (which  is  the  same  as  the  number  of 
ways  of  achieving  k  successes  in  n  trials). 

This  can  be  proved  as  follows.  Consider  an  urn  containing  n  distinguishable  balls  marked 
1,2,  Suppose  we  draw  k  balls  from  this  urn  without  replacing  them.  The  first  ball  could 

be  any  one  of  the  n  balls,  the  second  ball  could  be  any  one  of  the  remaining  (/?—!)  balls,  and 
so  on.  Hence,  the  total  number  of  ways  in  which  k  balls  can  be  drawn  is 

n(n-  l)(n  —  2).. .  (n  -  k  +  1)  =  — — — 

(n-k)\ 


Next,  consider  any  one  set  of  the  k  balls  drawn.  These  balls  can  be  ordered  in  different  ways. 
We  could  label  any  one  of  the  k  balls  as  number  1,  and  any  one  of  the  remaining  (k  —  1)  balls 
as  number  2,  and  so  on.  This  will  give  a  total  of  k(k  —  l)()t  —  2)  •  •  •  1  =  k\  distinguishable 
patterns  formed  from  the  k  balls.  The  total  number  of  ways  in  which  k  things  can  be  taken 
from  n  things  is  n\/(n  -  k)\  But  many  of  these  ways  will  use  the  same  k  things,  arranged  in 
different  order.  The  ways  in  which  k  things  can  be  taken  from  n  things  without  regard  to  order 
(unordered  subset  k  taken  from  n  things)  is  n\/(n  -  k)\  divided  by  k\ This  is  precisely  defined 
by  Eq.  (8.16). 

This  means  the  probability  of  k  successes  in  n  trials  is 

P(k  successes  in  n  trials)  =  ^  ^  pk{\  -  p)n~k 

=  (8I7> 

Tossing  a  coin  and  observing  the  number  of  heads  is  a  Bernoulli  trial  with  p  =  0.5.  Hence,  the 
probability  of  observing  k  heads  in  n  tosses  is 

P(k  heads  in  n  tosses)  =  (^.  )  (0.5/  (0.5)""A  =  ^  (0.5)” 


Sample  8.6 


A  binary  symmetric  channel  (BSC)  has  an  error  probability  Pe  (i.e.,  the  probability  of  receiving 
0  when  1  is  transmitted,  or  vice  versa,  is  Pe).  Note  that  the  channel  behavior  is  symmetrical 
with  respect  to  0  and  1.  Thus, 


P(0|l)  =  />(1|0)  =  Pe 


and 


/>(0|0)  =  P(1|1)  =  1  -Pe 

where  P(y|jr)  denotes  the  probability  of  receiving  y  when  x  is  transmitted.  A  sequence  of  n 
binary  digits  is  transmitted  over  this  channel.  Determine  the  probability  of  receiving  exactly 
k  digits  in  error. 
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The  reception  each  digit  is  independent  of  the  other  digits.  This  is  an  example  of  a 
Bernoulli  trial  the  probability  of  success  p  =  Pe  (“success"  here  is  receiving  a  digit 
in  error).  Clearly,  the  probability  of  k  successes  in  n  trials  (k  errors  in  n  digits)  is 

/’(receiving  k  out  of  n  digits  in  error)  =  (  )  p£(l  —  Pe)nk 

For  example,  Pe  =  10-5,  the  probability  of  receiving  two  digits  wrong  in  a  sequence 
of  eight  digits 


(  M  (10-5)2(1  -  1(T5)6  ~  r?7T  HT10  =  (2.8)1(T9 
\2 /  2!  6! 


Example  8.7  PCM  Repeater  Error  Probability 

In  pulse  code  modulation,  regenerative  repeaters  are  used  to  detect  pulses  (before  they  are  lost 
in  noise)  and  retransmit  new,  clean  pulses.  This  combats  the  accumulation  of  noise  and  pulse 
distortion. 

A  certain  PCM  channel  consists  of  ti  identical  links  in  tandem  (Fig.  8.3).  The  pulses  are 
detected  at  the  end  of  each  link  and  clean  new  pulses  are  transmitted  over  the  next  link.  If  Pe 
is  the  probability  of  error  in  detecting  a  pulse  over  any  one  link,  show  that  Pe,  the  probability 
of  error  in  detecting  a  pulse  over  the  entire  channel  (over  the  n  links  in  tandem),  is 

P e  —  nPe  nPe  1 


Figure  8.3  1st  link  2st  link 

A  PCM  repeater.  In  • - >  * - »»  > 


♦ 


kih  link 


Out 


The  probabilities  of  detecting  a  pulse  correctly  over  one  link  and  over  the  entire  channel 
(n  links  in  tandem)  are  1  —  Pe  and  1  —  Pe,  respectively.  A  pulse  can  be  detected  correctly 
over  the  entire  channel  if  either  the  pulse  is  detected  correctly  over  every  link  or  errors 
are  made  over  an  even  number  of  links  only. 

1  ~  Pe  =  /’(correct  detection  over  all  links) 

+  P(error  over  two  links  only) 

+  /’(error  over  four  links  only)  q - 

+  /’(error  over  2  |^-  J  links  only) 
where  \a\  denotes  the  largest  integer  less  than  or  equal  to  a. 

Because  pulse  detection  over  each  link  is  independent  of  the  other  links  (see  Example  8.6), 
/’(correct  detection  over  all  n  links)  =  (1  -  Pe)n 


/’(error  over  k  links  only)  = 


k\(n  —  k)\ 


Ped-Pe) 


n—k 


and 


Hence, 
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>n-k 


In  practice,  Pe  <£  1,  so  only  the  first  two  terms  on  the  right-hand  side  of  this  equation  are 
of  significance.  Also,  (I  -  Pe)n~k  ~  1,  and 


If  nPe  1 ,  then  the  second  term  can  also  be  neglected,  and 


\-PE^(\  -Pe)n 

-  1  -  nPe  nPe  <$c  1 


and 


PE  —  nPe 


We  can  explain  this  result  heuristically  by  considering  the  transmission  of  N  (N  -*  oo) 
pulses.  Each  link  makes  NPe  errors,  and  the  total  number  of  errors  is  approximately  nNPe 
(approximately,  because  some  of  the  erroneous  pulses  over  a  link  will  be  erroneous  over 
other  links).  Thus  the  overall  error  probability  is  nPe. 


In  binary  communication,  one  of  the  techniques  used  to  increase  the  reliability  of  a  channel 
is  to  repeat  a  message  several  times.  For  example,  we  can  send  each  message  (0  or  1) 
three  times.  Hence,  the  transmitted  digits  are  000  (for  message  0)  or  111  (for  message 
1).  Because  of  channel  noise,  we  may  receive  any  one  of  the  eight  possible  combinations 
of  three  binary  digits.  The  decision  as  to  which  message  is  transmitted  is  made  by  the 
majority  rule;  that  is,  if  at  least  two  of  the  three  detected  digits  are  0,  the  decision  is  0,  and 
so  on.  This  scheme  permits  correct  reception  of  data  even  if  one  out  of  three  digits  is  in 
error.  Detection  error  occurs  only  if  at  least  two  out  of  three  digits  are  received  in  error. 
If  Pe  is  the  error  probability  of  one  digit,  and  P(e)  is  the  probability  of  making  a  wrong 
decision  in  this  scheme,  then 


=  3/£(l  -Pe)  +  P] 


In  practice.  Pe  «;  I,  and 


P(e)  -  3 P] 
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For  instance,  if  Pe  =  10-4,  P(e)  ~  3  x  10-8.  Thus,  the  error  probability  is  reduced 
from  10-4  to  3  x  I0-8.  We  can  use  any  odd  number  of  repetitions  for  this  scheme  to 
function. 

In  this  example,  higher  reliability  is  achieved  at  the  cost  of  a  reduction  in  the  rate  of 
information  transmission  by  a  factor  of  3.  We  shall  see  in  Chapter  15  that  more  efficient 
ways  exist  to  effect  a  trade-off  between  reliability  and  the  rate  of  transmission  through  the 
use  of  error  correction  codes. 


Figure  8.4 

The  event  of 
interest  B  and  the 
partition  of  S  by 
Mi). 


Multiplication  Rule  for  Conditional  Probabilities 

As  shown  in  Eq.  (8.12),  we  can  write  the  joint  event 


P(ADB)  =  P(A)P(B/A) 


This  rule  on  joint  events  can  be  generalized  for  multiple  events  A  \ ,  A 2,  . . . ,  A„  via  iterations. 
If  A 1  As  ••■An  0,  then  we  have 


F(A,A2--A„) 


P(A|A2--A„)  P(A,A2.-A„-i) 

P(A|A2  •  *  - A„_i )  '  P(A\A2  •  •  •  A„_2) 


P(A|A2) 

P(Ai) 


P(A\) 


(8.18a) 


=  P(A„|A|A2  •  ••  A„_,)  •  P(A„_,  |A,A2  ■  A„_2)  •  •  •  P(A2|A,)  •  P(A,) 

(8.18b) 


Note  that  sinceA|A2  ■  ■  A„jk  0,  every  denominator  in  Eq.  (8. 18a)  is  positive  and  well  defined. 


Example  8.9  Suppose  a  box  of  diodes  consist  of  Ng  good  diodes  and  Nb  bad  diodes.  If  five  diodes  are 
randomly  selected,  one  at  a  time,  without  replacement,  determine  the  probability  of  obtaining 
the  sequence  of  diodes  in  the  order  of  good,  bad,  good,  good,  bad. 


We  can  denote  G,  as  the  event  that  the  ith  draw  is  a  good  diode.  We  are  interested  in  the 
event  of  G1G2G3G4G5. 

P(GiG^G3G4G^)  =  P(Gi)P(G$|Gi)P(G3|GiG£)P(G4|GiG£G3)P(G5|GiG2C3G4) 

_  _ Ng  -  1  Ng  -  2 

Ng+Nb  Ng  +  Nb  -  l'  Nb  +  Ng  -  2  '  Ng  +  Nb  -  3 
Nh-  1 

'  Ng+Nb-4 


To  Divide  and  Conquer:  The  Total  Probability  Theorem 

In  analyzing  a  particular  event  of  interest,  sometimes  a  direct  approach  to  evaluating  its  prob¬ 
ability  can  be  difficult  because  there  can  be  so  many  different  outcomes  to  enumerate.  When 
dealing  with  such  problems,  it  is  often  advantageous  to  adopt  the  divide-and-conquer  approach 
by  separating  all  the  possible  causes  leading  to  the  particular  event  of  interest  B.  The  total 
probability  theorem  provides  a  perfect  tool  for  analyzing  the  probability  of  such  problems. 

We  define  5  as  the  sample  space  of  the  experiment  of  interest.  As  shown  in  Fig.  8.4.  the 
entire  sample  space  can  be  partitioned  into  n  disjoint  events  A|,  . ...  An.  We  can  now  state  the 
theorem: 


Example  8.10 
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Figure  8.4 

The  event  of 
interest  B  and  the 
partition  of  S  by 
f At). 


S 


Total  Probability  Theorem.  Let  n  disjoint  events  A\ . A„  form  a  partition  of  the 


sample  space  5  such  that 


n 


and  Ag  D  Aj  =  0,  if  /  ^  j 


/=! 


Then  the  probability  of  an  event  B  can  be  written  as 


n 


P(B)  =  J2P(B\Ai)P(Ai) 


i=i 


Proof:  The  proof  of  this  theorem  is  quite  simple  based  on  Fig.  8.4.  Since  {/t,  }  form  a  partition 
of  S,  then 


B  =  BnS  =  BC\(A |  LM2  U  •  •  •  LM„) 
=  (A,B)U(A2B)\J---U(A„B) 


Because  {/!,}  are  disjoint,  so  are  {A,B\.  Thus, 


n 


n 


P(B)  =  P(AiB)  =  Y  P(B\Ai)P(Ai) 


1=1 


/=! 


This  theorem  can  simplify  the  analysis  of  the  more  complex  event  of  interest  B  by  iden¬ 
tifying  all  different  causes  Ag  for  B.  By  quantifying  the  effect  of  A,  on  B  through  P(B\Ai ),  the 
theorem  allows  us  to  “divide-and-conquer  ’  a  complex  problem  (of  event  B). 


E\,  £2,  ...  ,£yv  it  encounters.  These  error  patterns  are  mutually  exclusive,  each  with  prob¬ 
ability  P(Ej)  =  pi.  When  the  error  pattern  E\  occurs,  the  data  packet  would  be  incorrectly 
decoded  with  probability  qx.  Find  the  probability  that  the  data  packet  is  incorrectly  decoded. 

I  We  apply  total  probability  theorem  to  tackle  this  problem.  First,  define  B  as  the  event  that 
the  data  packet  is  incorrectly  decoded.  Based  on  the  problem,  we  know  that 


P(B\Ei)  =  qi  and  P(Ei)  =  p{ 
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Furthermore,  the  data  packet  has  been  incorrectly  decoded.  Therefore 

X>  =  1 

1=1 


Applying  the  total  probability  theorem,  we  find  that 

n  n 

P(B)  =  P{B\Ei)P{Ei)  =  Y^qiPi 

i=l  i=l 


Isolating  a  Particular  Cause:  Bayes’  Theorem 

The  total  probability  theorem  facilitates  the  probabilistic  analysis  of  a  complex  event  by  using  a 
divide-and-conquer  approach.  In  practice,  it  may  also  be  of  interest  to  determine  the  likelihood 
of  a  particular  cause  of  an  event  among  many  disjoint  possible  causes.  Bayes’  theorem  provides 
the  solution  to  this  problem. 

Bayes'  Theorem:  Let  n  disjoint  events  A  \ ,  . . . ,  An  form  a  partition  of  the  sample  space 
S.  Let  B  be  an  event  with  P(B)  >  0.  Then  for  j  =  1,  . . . ,  n, 

m.|  B)  =  P(B\Aj)P(Aj)  =  P(B\Aj)P(Aj) 

1  P(B)  £"=1  P(B\Ai)P(Ai) 

The  proof  is  already  given  by  the  theorem  itself. 

Bayes  theorem  provides  a  simple  method  for  computing  the  conditional  probability  of  Aj 
given  that  B  has  occurred.  The  probability  P(Aj\B)  is  often  known  as  the  posterior  probability 
of  event  Aj.  It  describes,  among  n  possible  causes  of  B,  the  probability  that  B  may  be  caused 
by  Aj.  In  other  words.  Bayes'  theorem  isolates  and  finds  the  relative  likelihood  of  each  possible 
cause  to  an  event  of  interest. 


Example  8.11  A  communication  system  always  encounters  one  of  three  possible  interference  waveforms. 

F|,  F2,  or  F3.  The  probability  of  each  interference  is  0.8,  0.16,  and  0.04,  respectively.  The 
communication  system  fails  with  probabilities  0.01,  0.1,  and  0.4  when  it  encounters  F|,  Fl> 
and  F3,  respectively.  Given  that  the  system  has  failed,  find  the  probability  that  the  failure  is  a 
result  of  F\,  F3,  or  F3,  respectively. 

Denote  B  as  the  event  of  system  failure.  We  know  from  the  description  that 
P(F| )  =  0.8  P(F2)  =  0.16  F(F3)  =  0.04 

Furthermore,  the  effect  of  each  interference  on  the  system  is  given  by 

IF(fi|F,)  =  0.01  F(B|F2)  =  0.1  F(B|F3)  =  0.4 

Now  following  Bayes’  theorem,  we  find  that 

P(F,  | B)  =  )P(Fi)  _ _ (0.01)(0,8) _ _  _  0  2 

£/=l  P(B\Fi)P(Fi)  (0.01)(0.8)  +  (0.1)(0. 1 6)  +  (0.4)(di04) 
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P(F2\B) 
P(Fi\B ) 


F(F|F2)F(F2) 
LU  P{B\Fi)P{Fi) 
F(fl|F3)F(F3) 
TLl  P(B\Fi)P(Ft) 


Example  8. 1 1  illustrates  the  major  difference  between  the  posterior  probability  P(Fi\B) 
and  the  prior  probability  P(Fi).  Although  the  prior  probability  P(F3)  =  0.04  is  the  lowest 
among  the  three  possible  interferences,  once  the  failure  event  B  has  occurred,  P(F^\B)  =  0.4  is 
actually  one  of  the  most  likely  events.  Bayes’  theorem  is  an  important  tool  in  communications 
for  determining  the  relative  likelihood  of  a  particular  cause  to  an  event. 


Axiomatic  Theory  of  Probability 

The  relative  frequency  definition  of  probability  is  intuitively  appealing.  Unfortunately,  it  has 
some  serious  mathematical  objections.  Logically  there  is  no  reason  why  we  should  get  the  same 
estimate  of  the  relative  frequency  whether  we  base  it  on  10,000  trials  or  on  20.  Moreover,  in 
the  relative  frequency  definition,  it  is  not  clear  when  and  in  what  mathematical  sense  the  limit 
in  Eq.  (8.5)  exists.  If  we  consider  a  set  of  an  infinite  number  of  trials,  we  can  partition  such 
a  set  into  several  subsets,  such  as  odd  and  even  numbered  trials.  Each  of  these  subsets  (of 
infinite  trials  each)  would  have  its  own  relative  frequency.  So  far,  all  the  attempts  to  prove  that 
the  relative  frequencies  of  all  the  subsets  are  equal  have  been  futile.1  There  are  some  other 
difficulties  also.  For  instance,  in  some  cases,  such  as  Julius  Caesar  having  visited  Great  Britain, 
it  is  an  experiment  for  which  we  cannot  repeat  the  event  an  infinite  number  of  trials.  Thus,  we 
can  never  know  the  probability  of  such  an  event.  We,  therefore,  need  to  develop  a  theory  of 
probability  that  is  not  tied  down  to  any  particular  definition  of  probability.  In  other  words,  we 
must  separate  the  empirical  and  the  formal  problems  of  probability.  Assigning  probabilities  to 
events  is  an  empirical  aspect,  and  setting  up  purely  formal  calculus  to  deal  with  probabilities 
(assigned  by  whatever  empirical  method)  is  the  formal  aspect. 

It  is  instructive  to  consider  here  the  basic  difference  between  physical  sciences  and  mathe¬ 
matics.  Physical  sciences  are  based  on  inductive  logic,  and  mathematics  is  strictly  a  deductive 
logic.  Inductive  logic  consists  of  making  a  large  number  of  observations  and  then  generalizing, 
from  these  observations,  laws  that  will  explain  these  observations.  For  instance,  history  and 
experience  tell  us  that  every  human  being  must  die  someday.  This  leads  to  a  law  that  humans  are 
mortals .  This  is  inductive  logic.  Based  on  a  law  (or  laws)  obtained  by  inductive  logic,  we  can 
make  further  deductions.  The  statement  “John  is  a  human  being,  so  he  must  die  some  day’’  is  an 
example  of  deductive  logic.  Deriving  the  laws  of  the  physical  sciences  is  basically  an  exercise 
in  inductive  logic,  whereas  mathematics  is  pure  deductive  logic.  In  a  physical  science  we  make 
observations  in  a  certain  field  and  generalize  these  observations  into  laws  such  as  Ohm’s  law. 
Maxwell’s  equations,  and  quantum  mechanics.  There  are  no  other  proofs  for  these  inductively 
obtained  laws.  They  are  found  to  be  true  by  observation.  But  once  we  have  such  inductively 
formulated  laws  (axioms  or  hypotheses),  by  using  thought  process,  we  can  deduce  additional 
results  based  on  these  basic  laws  or  axioms  alone.  This  is  the  proper  domain  of  mathematics. 
All  these  deduced  results  have  to  be  proved  rigorously  based  on  a  set  of  axioms.  Thus,  based 
on  Maxwell’s  equations  alone,  we  can  derive  the  laws  of  the  propagation  of  electromagnetic 
waves. 

This  discussion  shows  that  the  discipline  of  mathematics  can  be  summed  up  in  one  apho¬ 
rism:  “This  implies  that.”  In  other  words,  if  we  are  given  a  certain  set  of  axioms  (hypotheses). 
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then,  based  upon  these  axioms  alone,  what  else  is  true?  As  Bertrand  Russell  puts  it:  “Pure  math¬ 
ematics  consists  entirely  of  such  asseverations  as  that,  if  such  and  such  proposition  is  true  of 
anything,  then  such  and  such  another  proposition  is  true  of  that  thing."  Seen  in  this  light,  it  may 
appear  that  assigning  probability  to  an  event  may  not  necessarily  be  the  responsibility  of  the 
mathematical  discipline  of  probability.  Under  mathematical  discipline,  we  need  to  start  with 
a  set  of  axioms  about  probability  and  then  investigate  what  else  can  be  said  about  probability 
based  on  this  set  of  axioms  alone.  We  start  with  a  concept  (as  yet  undefined)  of  probability 
and  postulate  axioms.  The  axioms  must  be  internally  consistent  and  should  conform  to  the 
observed  relationships  and  behavior  of  probability  in  the  practical  and  the  intuitive  sense.  It 
is  beyond  the  scope  of  this  book  to  discuss  how  these  axioms  are  formulated.  The  modem 
theory  of  probability  starts  with  Eqs.  (8.6),  (8.8),  and  (8. 1 1 )  as  its  axioms.  Based  on  these  three 
axioms  alone,  what  else  is  true  is  the  essence  of  modem  theory  of  probability.  The  relative 
frequency  approach  uses  Eq.  (8.5)  to  define  probability,  and  Eqs.  (8.5),  (8.8),  and  (8. 1 1)  follow 
as  a  consequence  of  this  definition.  In  the  axiomatic  approach,  on  the  other  hand,  we  do  not 
say  anything  about  how  we  assign  probability  P(A)  to  an  event  A;  rather,  we  postulate  that  the 
probability  function  must  obey  the  three  postulates  or  axioms  in  Eqs.  (8.6),  (8.8),  and  (8.11). 
The  modern  theory  ot  probability  does  not  concern  itself  with  the  problem  of  assigning  prob¬ 
abilities  to  events.  It  assumes  that  somehow  the  probabilities  were  assigned  to  these  events  a 
priori. 

It  a  mathematical  model  is  to  conform  to  the  real  phenomenon,  we  must  assign  these 
probabilities  in  away  that  is  consistent  with  an  empirical  and  an  intuitive  understanding  of 
probability.  The  concept  ot  relative  frequency  is  admirably  suited  for  this.  Thus,  although  we 
use  relative  frequency  to  assign  (not  define)  probabilities,  it  is  all  under  the  table,  not  a  part  of 
the  mathematical  discipline  of  probability. 


Figure  8.5 

Probabilities  in 
a  coin-tossing 
experiment. 


Example  8. 1 


8.2  RANDOM  VARIABLES 

The  outcome  ot  an  experiment  may  be  a  real  number  (as  in  the  case  of  rolling  a  die),  or  it 
may  be  nonnumerical  and  describable  by  a  phrase  (such  as  “heads”  or  “tail”  in  tossing  a  coin). 
From  a  mathematical  point  of  view,  it  is  simpler  to  have  numerical  values  for  all  outcomes. 
For  this  reason,  we  assign  a  real  number  to  each  sample  point  according  to  some  rule.  If  there 
are  m  sample  points  ,  £2>  ....  then  using  some  convenient  rule,  we  assign  a  real  number 
x(£,)  to  sample  point  £,•  (/'  =1,2,...  ,m).  In  the  case  of  tossing  a  coin,  for  example,  we  may 
assign  the  number  1  for  the  outcome  heads  and  the  number  - 1  for  the  outcome  tails  (Fig.  8.5). 
Thus,  x(d  is  a  function  that  maps  sample  points  ft,  £2,  . . . , into  real  numbers 

f1,  X2 . Xn  *  We  now  have  a  random  variable  x  that  takes  on  values  jci,  . . 

We  shall  use  roman  type  (x)  to  denote  a  random  variable  (RV)  and  italic  type  (e.g.. 

Xl'  Xl . to  denote  the  value  it  takes.  The  probability  of  an  RV  x  taking  a  value  Xi 

is  Px(Xi)  =  Probability  of  “x  = 

Discrete  Random  Variables 

A  random  \ariable  is  discrete  it  there  exists  a  denumerable  sequence  of  distinct  numbers  V 

cnr»r»  thot 


X>*te)  =  i  (gl9) 


*  The  number  m  is  not  necessarily  equal  to  n.  More  than 


one  sample  point  can  map  into  one  value  of  x. 
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Figure  8.5 

Probabilities  in 

a  coin-tossing 
experiment. 

0.5 

-1 

0 

1  - 

Thus,  a  discrete  RV  can  assume  only  certain  discrete  values.  An  RV  that  can  assume  any  value 
over  a  continuous  set  is  called  a  continuous  random  variable. 


Example  8.12  Two  dice  are  thrown.  The  sum  of  the  points  appearing  on  the  two  dice  is  an  RV  x.  Find  the 
values  taken  by  x,  and  the  corresponding  probabilities. 


We  see  that  x  can  take  on  all  integral  values  from  2  through  12.  Various  probabilities  can 
be  determined  by  the  method  outlined  in  Example  8.3. 

There  are  36  sample  points  in  all,  each  with  probability  1/36.  Dice  outcomes  for 
various  values  of  x  are  shown  in  Table  8. 1 .  Note  that  although  there  are  36  sample  points, 
they  all  map  into  11  values  of  x.  This  is  because  more  than  one  sample  point  maps  into 
the  same  value  of  x.  For  example,  six  sample  points  map  into  x  =  7. 

The  reader  can  verify  that  P\(xi)  =  T 


TABLE  8.1 


Value  of  jc/ 

Dice  Outcomes 

PxUi) 

2 

(i,D 

1/36 

3 

(1,2),  (2,1) 

2/36=  1/18 

4 

(1,3),  (2,  2),  (3,  l) 

3/36=  1/12 

5 

(1,4),  (2,  3),  (3,  2).  (4,  1) 

4/36=  1/9 

6 

(1,5),  (2,  4),  (3,  3),  (4, 2),  (5,  1) 

5/36 

7 

(1,6),  (2,5),  (3.  4).  (4, 3),  (5,  2),  (6,  1) 

6/36=1/6 

8 

(2,  6),  (3,  5),  (4, 4),  (5, 3),  (6,  2) 

5/36 

9 

(3, 6),  (4,  5),  (5, 4),  (6,  3) 

4/36=  1/9 

10 

(4.  6),  (5.  5),  (6, 4) 

3/36=  1/12 

11 

(5,  6),  (6,  5) 

2/36=  1/18 

12 

(6,6) 

1/36 

The  preceding  discussion  can  be  extended  to  two  RVs,  x  and  y.  The  joint  probability 
Pxy(Xi,  yj)  is  the  probability  that  “x  =  jc,  and  y  =  Consider,  for  example,  the  case  of  a 
coin  tossed  twice  in  succession.  If  the  outcomes  of  the  first  and  second  tosses  are  mapped  into 
RVs  x  and  y,  then  x  and  y  each  takes  values  1  and  - 1 .  Because  the  outcomes  of  the  two  tosses 
are  independent,  x  and  y  are  independent,  and 


P xy (•*/♦  yj)  —  P X (xi )  Pyiyj) 
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and 

Pxy(U  1)  =  —  1)  —  P xy(—  1 ,  1)  =  PXy(— 1,  —  1)  =  | 

These  probabilities  are  plotted  in  Fig.  8.6. 

For  a  general  case  where  the  variable  x  can  take  values  x\,  xi,  x„  and  the  variable  y 
can  take  values  yi ,  yi,  . . . ,  ym,  we  have 

=  1  (8.2°) 

'■  j 

This  follows  from  the  tact  that  the  summation  on  the  left  is  the  probability  of  the  union  of  all 
possible  outcomes  and  must  be  unity  (a  certain  event). 

Conditional  Probabilities 

It  x  and  y  are  two  RVs,  then  the  conditional  probability  of  x  =  jc,  given  y  =  y,  is  denoted  by 
Px\y(*i\yj)-  Moreover. 


J2  PMy(Xi\yj)  =  Py\x(yj\xi)  =  1  (8.21) 

*'  i 

This  can  be  proved  by  observing  that  probabilities  /\|y(xi|»)  are  specified  over  the  sample 
space  corresponding  to  the  condition  y  =  yj.  Hence,  £.  />8|y(x*|w)  is  the  probability  of  the 
union  of  all  possible  outcomes  of  x  (under  the  condition  y  =  y)  and  must  be  unity  (a  certain 
event).  A  similar  argument  applies  to  £,•  Py „  (y,k).  Also  from  Eq.  (8.12),  we  have 

PXy(xh  yj)  =  Px\y(Xi\yj)Py(yj)  =  Py\x(yj\Xi)Px(Xj)  (8-22) 

Bayes'  rule  follows  from  Eq.  (8.22).  Also  from  Eq.  (8.22),  we  have 

Pxy  (xh  yj)  =  P* |y  (xi  \yj)Py(yj) 

'  I 

=  py(yj)  ^2  Px\y(Xi\yj) 

=  Pyiyj) 


(8.23a) 
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Similarly, 


Px (*/)  =  Y  pxy  (*/.  yj)  (8.23b) 

j 

The  probabilities  Px(Xj)  and  Py(yj)  are  called  marginal  probabilities.  Equations  (8.23)  show 
how  to  determine  marginal  probabilities  from  joint  probabilities.  Results  of  Eqs.  (8.20)  through 
(8.23)  can  be  extended  to  more  than  two  RVs. 


Example  8. 1  3  A  binary  symmetric  channel  (BSC)  error  probability  is  Pe.  The  probability  of  transmitting  1 
is  0  and  that  of  transmitting  0  is  1  -  Q  (Fig.  8.7).  Determine  the  probabilities  of  receiving  1 
and  0  at  the  receiver. 


Figure  8.7 

Binary  symmetric 
channel  (BSC). 


If  x  and  y  are  the  transmitted  digit  and  the  received  digit,  respectively,  then  for  a  BSC, 

Py|X(0|l)  =  />y|x(l|0)  =  P€ 

^y|x(0|0)  =  Py|x(l|l)  =  1  -Pe 


Also, 


PAD  =  Q  and  PX(0)=1-G 
We  need  to  find  Py(l)  and  Fy(0).  From  the  total  probability  theorem, 

P yC Yj)  =  y  ^  P x(xi)P y|x (yj\xi) 

I 


we  find 


Similarly, 


Py(l)  =  Px(0)Py\x(m  +  PADPylxW) 
=  {\-Q)Pe  +  QA~Pe) 


/>y(0)  =  (1-0(1  -Pe)  +  QPe 

These  answers  seem  almost  obvious  from  Fig.  8.7. 

Note  that  because  of  channel  errors,  the  probability  of  receiving  a  digit  1  is  not  the 
same  as  that  of  transmitting  1.  The  same  is  true  of  0. 
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Example  8.14  Over  a  certain  binary  communication  channel,  the  symbol  0  is  transmitted  with  probability  0.4 
and  1  is  transmitted  with  probability  0.6.  It  is  given  that  F(e|0)  =  I0~6  and  F(e|l)  =  10-4, 
where  P(€\xi)  is  the  probability  of  detecting  the  error  given  that  x ,  is  transmitted.  Determine 
P(e),  the  error  probability  of  the  channel. 

If  P(€,  Xj)  is  the  joint  probability  that  jc,  is  transmitted  and  it  is  detected  wrongly,  then  the 
total  probability  theorem  yields 


P(€)  =  ^2P(€\Xi)P(Xi) 
i 

=  Px(0)F(e|0)  +  Px(l)P(e|l) 
=  0.4(10-6)  +  0.6(10~4) 

=  0.604(1  (T4) 


Note  that  P(e|0)  —  10  6  means  that  on  the  average,  one  out  of  1  million  received 
0s  will  be  detected  erroneously.  Similarly,  P(c|l)  =  10~4  means  that  on  the  average,  one 
out  of  10.000  received  Is  will  be  in  error.  But  P(e)  =  0.604(  10-4)  indicates  that  on  the 
average,  one  out  of  1/0.604(10  4)  ~  16,556  digits  (regardless  of  whether  they  are  Is 
or  0s)  will  be  received  in  error. 


Cumulative  Distribution  Function 

The  cumulative  distribution  function  (CDF)  Fx(x)  of  an  RV  x  is  the  probability  that  x  takes 
a  value  less  than  or  equal  to  jc;  that  is. 


Fx( x)  =  P{\  <  x) 

(8.24) 

CDF  Fx(x)  has  the  following  four  properties: 

1.Fx(jc)  >0 

(8.25a) 

2.  Fx(oo)  =  1 

(8.25b) 

3.  Fx(— oo)  =  0 

(8.25c) 

4.  Fx(x)  is  a  nondecreasing  function,  that  is, 

(8.25d) 

Fx(xi)  <  Fx(xi)  for .r,  <  X2 

(8.25e) 

The  firstjjroperty  is  obvious.  The  second  and  third  properties  are  proved  by  observing  that 
from  Eq  (8  24 )~  °°  ^  °°^  =  -  ~°°).  To  prove  the  fourth  property,  we  have, 


Example  8.15 


Figure  8.8 

(a)  Probabilities 
Px(Xi)  and 

(b)  the 

cumulative 
distribution 
Unction  (CDF). 


Fx(*2)  =  P(\  <  X2) 

=  P[(X  <  *1)  U  (*i  <  x  <  JC2 ) J 
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Because  x  <  x\  and  A]  <  x  <  X2  are  disjoint,  we  have 

Fx(*2)  =  P(\  <  *1)  +  P(x i  <  x  <  xj) 

=  Fx<n)  +  P(x\  <  x  <  X2)  (8.26) 

Because  P{x\  <  x  <  X2)  is  nonnegative,  the  result  follows. 


Example  8.15  In  an  experiment,  a  trial  consists  of  four  successive  tosses  of  a  coin.  If  we  define  an  RV  x  as 
the  number  of  heads  appearing  in  a  trial,  determine  Px (jc)  and  Fx(x). 


A  total  of  1 6  distinct  equiprobable  outcomes  are  listed  in  Example  8.4.  Various  probabilities 
can  be  readily  determined  by  counting  the  outcomes  pertaining  to  a  given  value  of  x.  For 
example,  only  one  outcome  maps  into  x=0,  whereas  six  outcomes  map  into  x=2.  Hence. 
Px( 0)  =  1/16  and  Px( 2)  =  6/16.  In  the  same  way,  we  find 

IPx(0)  =  Px(4)  =  1/16 

PAD  =  PxO)  =  4/16=  1/4 
Px(2)  =  6/16  =  3/8 

The  probabilities  Px(xi)  and  the  corresponding  CDF  Fx (a, )  are  shown  in  Fig.  8.8. 


Figure  8.8 

(a)  Probabilities 
px(Xj)  and 

(b)  the 
cumulative 
distribution 
function  (CDF). 


Continuous  Random  Variables 

A  continuous  RV  x  can  assume  any  value  in  a  certain  interval.  In  a  continuum  of  any  range,  an 
uncountably  infinite  number  of  possible  values  exist,  and  Px(x,),  the  probability  that  x  =  a„ 
as  one  of  the  uncountably  infinite  values,  is  generally  zero.  Consider  the  case  of  a  temperature 
T  at  a  certain  location.  We  may  suppose  that  this  temperature  can  assume  any  of  a  range  of 
values.  Thus,  an  infinite  number  of  possible  temperature  values  may  prevail,  and  the  probability 
that  the  random  variable  T  will  assume  a  certain  value  7j  is  zero.  The  situation  is  somewhat 
similar  to  that  described  in  Sec.  3. 1  in  connection  with  a  continuously  loaded  beam  (Fig.  3.5b). 
There  is  a  loading  along  the  beam  at  every  point,  but  at  any  one  point  the  load  is  zero.  The 
meaningful  measure  in  that  case  was  the  loading  (or  weight)  not  at  a  point,  but  over  a  finite 
interval.  Similarly,  for  a  continuous  RV,  the  meaningful  quantity  is  not  the  probability  that 
x  =  x i  but  the  probability  that  x  <  x  <  a  +  Ax.  For  such  a  measure,  the  CDF  is  eminently 
suited  because  the  latter  probability  is  simply  Fx(x  4-  Aa)  —  Fx(x)  [see  Eq.  (8.26)].  Hence,  we 
begin  our  study  of  continuous  RVs  with  the  CDF. 
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Properties  of  the  CDF  [Eqs.  (8.25)  and  (8.26)]  derived  earlier  are  general  and  are  valid 
for  continuous  as  well  as  discrete  RVs. 

Probability  Density  Function:  From  Eq.  (8.26),  we  have 

Fx(x  +  Ax)  =  Fx(x)  +  P(x  <  x  <  *  +  Ax)  (8.27a) 

If  Ax  -*■  0,  then  we  can  also  express  Fx(x  +  Ax)  via  Taylor  expansion  as 


dF 

Fx  (x  +  Ax)  ~  Fx  (x)  +  *  A.v  (8.27b) 

ax 

From  Eqs.  (8.27),  it  follows  that  as  Ax  0, 

dFx(x) 

—j^—Ax  =  P(x  <  x  <  x  +  Ax)  (8.28) 

We  designated  the  derivative  of  Fx(x)  with  respect  to.v  by  px(x)  (Fig.  8.9), 

dFx(x) 

-£T-  =P*(x)  (8.29) 

The  function  px(x)  is  called  the  probability  density  function  (PDF)  of  the  RV  x.  It  follows 
from  Eq.  (8.28)  that  the  probability  of  observing  the  RV  x  in  the  interval  (x,  x  +  Ax)  is 

px(x)Ax  (Ax  — >  0).  This  is  the  area  under  the  PDF  px(x)  over  the  interval  Ax,  as  shown  in 
Fig.  8.9b. 


Figure  8.9 

(a)  Cumulative 
distribution 
function  (CDF). 

(b)  Probability 
density  function 
(PDF). 


(b) 
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From  Eq.  (8.29),  we  can  see  that 


Fx(x)  =  /  px(u)du 


(8.30) 


-00 


Here  we  use  the  fact  that  Fx(— oo)  =  0.  We  also  have  from  Eq.  (8.26) 


P(x i  <  x  <  x2)  =  Fx(x 2)  -  Fx(x i) 

px(x)dx 


Px(x)dx 


(8.31) 


Thus,  the  probability  of  observing  x  in  any  interval  (jt|,  x2)  is  given  by  the  area  under  the 
PDF Px(x)  over  the  interval  (jci,  *2),  as  shown  in  Fig.  8.9b.  Compare  this  with  a  continuously 
loaded  beam  (Fig.  3.5b),  where  the  weight  over  any  interval  was  given  by  an  integral  of  the 
loading  density  over  the  interval. 

Because  Fx(o o)  =  1,  we  have 


(8.32) 


This  also  follows  from  the  fact  that  the  integral  in  Eq.  (8.32)  represents  the  probability  of 
observing  x  in  the  interval  (-00,  00).  Every  PDF  must  satisfy  the  condition  in  Eq.  (8.32).  It 
is  also  evident  that  the  PDF  must  not  be  negative,  that  is. 


Px(x)  >  0 


Although  it  is  true  that  the  probability  of  an  impossible  event  is  0  and  that  of  a  certain  event 
is  1,  the  converse  is  not  true.  An  event  whose  probability  is  0  is  not  necessarily  an  impossible 
event,  and  an  event  with  a  probability  of  1  is  not  necessarily  a  certain  event.  This  may  be 
illustrated  by  the  following  example.  The  temperature  T  of  a  certain  city  on  a  summer  day  is 
an  RV  taking  on  any  value  in  the  range  of  5  to  50°C.  Because  the  PDF pr(T)  is  continuous,  the 
probability  that  T  =  34.56,  for  example,  is  zero.  But  this  is  not  an  impossible  event.  Similarly, 
the  probability  that  T  takes  on  any  value  but  34.56  is  1.  although  this  is  not  a  certain  event. 
In  fact,  a  continuous  RV  x  takes  every  value  in  a  certain  range.  Yet  px(x),  the  probability  that 
x  =  x,  is  zero  for  every  x  in  that  range. 

We  can  also  determine  the  PDF  px(x)  for  a  discrete  random  variable.  Because  the  CDF 
Fx(jc)  for  the  discrete  case  is  always  a  sequence  of  step  functions  (Fig.  8.8),  the  PDF  (the 
derivative  of  the  CDF)  will  consist  of  a  train  of  positive  impulses.  If  an  RV  x  takes  values 
x\ ,  JC2,  . . . ,  x„  with  probabilities  m,  a2,  ■  •  Qm  respectively,  then 


Fx(x)  =a\u(x-x\)  +  a2u(x-  x2)  + - h  a„u(x  -  x„)  (8.33a) 

This  can  be  easily  verified  from  Example  8.15  (Fig.  8.8).  Hence, 

px(x)  =  at  8(x  - x\)  +  a2&(x  - x2)  +  ■  ■  ■  +  a„  8(x  -  x„ ) 


n 


r=  1 


(8.33b) 
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Figure  8.10 

(a)  Gaussian 
PDF.  (b)  Function 
Q(y).  (c)  CDF  of 
the  Gaussian 
PDF. 


It  is,  of  course,  possible  to  have  a  mixed  case,  where  a  PDF  may  have  a  continuous  part  and 
an  impulsive  part  (see  Prob.  8.2-6). 

The  Gaussian  Random  Variable 

Consider  a  PDF  (Fig.  8.10) 


px(x)  =  ~j=e  xl/ 2 
v27T 


(8.34) 


This  is  a  case  of  the  well-known  standard  Gaussian,  or  normal,  probability  density.  It  has 
zero  mean  and  unit  variance.  This  function  was  named  after  the  famous  mathematician  Carl 
Friedrich  Gauss. 

The  CDF  Fx(x)  in  this  case  is 
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This  integral  cannot  be  evaluated  in  a  closed  form  and  must  be  computed  numerically.  It  is 
convenient  to  use  the  function  (?(.),  defined  as2 

Q(y)  =  -4=f  e-^,2dx  (8.35) 

V  2n  Jy 

The  area  under pAx)  from y  to  oo  (shaded  in  Fig.  8.10a)  is*  Q(y).  From  the  symmetry  of /;x(.v) 
about  the  origin,  and  the  fact  that  the  total  area  under  px  (x)  =  1,  it  follows  that 

Q(-y)  =  1  -Q(y)  (8.36) 

Observe  that  for  the  PDF  in  Fig.  8.10a,  the  CDF  is  given  by  (Fig.  8. 10c) 

FAx)  =  1  -  Q(x)  (8.37) 

The  function  Q(x)  is  tabulated  in  Table  8.2  (see  also  later:  Fig.  8. 1 2d).  This  function  is  widely 
tabulated  and  can  be  found  in  most  of  the  standard  mathematical  tables.2, 3  It  can  be  shown 
that,4 


Q(x)  ~ 


for  x  »  1 


(8.38a) 


For  example,  when  x  =  2,  the  error  in  this  approximation  is  18.7%.  But  for  x  =  4  it  is  10.4% 
and  for  x  =  6  it  is  2.3%. 

A  much  better  approximation  to  Q(x)  is 


1 


x  >  2 


(8.38b) 


The  error  in  this  approximation  is  just  within  l%for.v  >  2.15.  For  larger  values  of  a:  the  error 
approaches  0. 

A  more  general  Gaussian  density  function  has  two  parameters  ( m ,  a)  and  is  (Fig.  8. 1 1 ) 

pAx)  =  ‘  (8 .39) 

CTy/ZTT 


For  this  case, 


FAx)  = 


Gy/27T 


f 


?—(x—m)2/2o- 


*  The  function  Q(x)  is  closely  related  to  functions  erf  (x)  and  erfcU). 

erfc  (x)  =  4=  f  e~r  d>  =  2<2(jtn/2) 

Jx 

GW  =  i erfc  (-^)  =  |[i- erf  (^)] 


Therefore, 
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TABLE  8.2' 

Q(x) 


x  0.00  0.01  0.02  0.03  0.04  0.05  0.06  0.07  0.08  0.09 


O.(XXH) 

.5000 

.4960 

.  I(XX) 

.4602 

.4562 

.2000 

.4207 

.4168 

.3000 

.3821 

.3783 

.4(XX) 

.3446 

.3409 

.5  (XX) 

.3085 

.3050 

.6(XX) 

.2743 

.2709 

.7(XX) 

.2420 

.2389 

.80(X) 

.2119 

.2090 

.9(XX) 

.1841 

.1X14 

l.(XX) 

.1587 

.1562 

1.100 

.1357 

.1335 

1.2(X) 

.1151 

.1131 

1.300 

.9680E-01 

.95 1 OE-0 1 

l.4(X) 

.8076E-01 

.7927E-0I 

1.500 

.6681E-0I 

6552E-01 

1.600 

.5480E-0I 

.5370E-0I 

1.7(X) 

4457E-01 

.4363E-0I 

1.8(X) 

.3593E-OI 

.35 1 5E-0I 

1.900 

.2872E-OI 

.2807E-01 

2. (XX) 

.2275E-OI 

.2222E-OI 

2.100 

.I786E-01 

.I743E-01 

2.200 

.  1 390E-0 1 

.I355E-01 

2.300 

.  1 072E-01 

.  1 044E-0 1 

2.4CX) 

.8I98E-02 

.7976E-02 

2.500 

.62 1 0E-02 

.6037E-02 

2.600 

.466  IE-02 

.4527E-02 

2.700 

.3467E-02 

.3364E-02 

2.800 

.2555E-02 

.2477E-02 

2.900 

.  1 866E-02 

.I807E-02 

3. (XX) 

.  1 350E-02 

.  1 306E-02 

3.100 

9676E-03 

.9354E-03 

3.200 

.687 1 E-03 

.663 7 E-03 

3.300 

4834E-03 

.4665 E-03 

3.400 

.3369E-03 

.3248E-03 

3.5(H) 

.2326E-03 

.2241  E-03 

3.660 

.1591  E-03 

.1531  E-03 

3.7(H) 

I078E-03 

.1036E-03 

3.800 

.7235E-04 

.6948E-04 

3.900 

.48I0E-04 

.46I5E-04 

4.000 

.3167E-04 

.3036E-04 

4.KX) 

.20661:  1)4 

.I978E-04 

4.200 

.  1 335E-04 

.  1 277E-04 

4.300 

8540E-05 

.8163E-05 

4.400 

.5413E-05 

.5 1 69E-05 

4.500 

.3398E-05 

.324  IE-05 

4.600 

.21 12E-05 

.2013E-05 

4.700 

.130  IE-05 

.  1 239E-05 

4.800 

.793 3  E-06 

•7547E-06 

4.900 

.4792E-06 

.4554E-06 

5.000 

.2867E-06 

.2722E-06 

5.100 

.  1 698E-06 

.1611 E-06 

.4920 

.4880 

.4840 

.4522 

.4483 

.4443 

.4129 

.4090 

.4052 

.3745 

.3707 

.3669 

.3372 

.3336 

.3300 

.3015 

.2981 

.2946 

.2676 

.2643 

.2611 

.2358 

.2327 

.2296 

.2061 

.2033 

.2005 

.1788 

.1762 

.1736 

.1539 

.1515 

.1492 

.1314 

.1292 

.1271 

.1112 

.1093 

.1075 

.9342E-01 

.9176E-0I 

.90I2E-01 

.7780E-01 

.7636E-01 

.7493E-01 

.6426E-01 

.630  IE-01 

.61 78E-01 

.5262E-01 

.5 155E-OI 

.5050E-01 

.4272E-OI 

.4182E-01 

.4093E-0I 

.3438E-0I 

.3362E-01 

.3288E-01 

.27436-01 

.2680E-01 

.2619E-0I 

.2I69E-0I 

.21 18E-01 

.2068E-01 

.  1 7(H)E-()I 

.  1 659E-0 1 

.161 8E-01 

.132  IE-01 

.1287E-01 

.  1 255E-0 1 

.10176-01 

.9903E-02 

.9642E-02 

.7760E-02 

.7549E-02 

.7344E-02 

58686-02 

.5703E-02 

.5543E-02 

.4396E-02 

.4269E-02 

.4145E-02 

.32646-02 

.3I67E-02 

.3072E-02 

.240  IE-02 

.2327E-02 

.2256E-02 

.17501  Q2 

.1695E-02 

•I64IE-02 

.  1 264E-02 

.  1 223E-02 

.  1 1 83E-02 

.9043E-03 

.8740E-03 

.8447E-03 

.64 1 0E-03 

.6 1 90E-03 

.5976E-03 

.4501  E-03 

.4342E-03 

.4 1 89E-03 

.3131 E-03 

.3018E-03 

.2909E-03 

.2158E-03 

.2078E-03 

.2001  E-03 

.  I473E-03 

.1417E-03 

.1363 E-03 

.996 1 E-04 

.9574E-04 

.9201  E-04 

.6673E-04 

.6407 E-04 

.6152E-04 

.4427E-04 

.4247E-04 

.4074E-04 

.29I0E-04 

.2789E-04 

•2673E-04 

.  1 894E-04 

•  I814E-04 

•1737E-04 

.  1 222E-04 

.  1 1 68E-04 

.1 1 18E-04 

.7801E-05 

.7455E-05 

.7124E-05 

•4935E-05 

.47 1 2E-05 

•4498E-05 

.3092E-05 

.2949E-05 

.2813E-05 

.191 9E-05 

.1828E-05 

.  1 742E-05 

.  1 1 79E-05 

.  1 1 23E-05 

.1069E-05 

.7178E-06 

.6827E-06 

.6492E-06 

4327E-06 

.4111  E-06 

•3906E-06 

.2584E-06 

•2452E-06 

.2328E-06 

.1528E-06 

.1449E-06 

.  1 374E-06 

.4801 

.4761 

.4721 

.4404 

.4364 

.4325 

.4013 

.3974 

.3936 

.3632 

.3594 

.3557 

.3264 

.3228 

.3192 

.2912 

.2877 

.2843 

.2578 

.2546 

.2514 

.2266 

.2236 

.2206 

.1977 

.1949 

.1922 

.1711 

.1685 

.1660 

.1469 

.1446 

.1423 

.1251 

.1230 

.1210 

.1056 

.1038 

.1020 

.8851E-01 

.8691E-0I 

.8534E-01 

.7353E-01 

.7215E-01 

.7078E-01 

.6057E-01 

.5938E-01 

.5821E-01 

.4947E-01 

.4846E-0I 

.4746E-01 

.4006E-0 1 

.3920E-01 

.3836E-01 

.3216E-01 

.3 144E-01 

.3074E-01 

.2559E-01 

.2500E-01 

.2442E-01 

.2018E-01 

.1970E-01 

.  1923E-01 

.1578E-01 

.1539E-01 

.  1 500E-0 1 

.1222E-01 

.1 191E-01 

.1 160E-01 

.9387 E-02 

.9 1 37E-02 

.8894E-02 

.7143E-02 

.6947E-02 

.6756E-02 

.5386E-02 

.5234E-02 

.5085E-02 

.4025 E-02 

.3907E-02 

.3793E-02 

.2980E-02 

.2890E-02 

.2803E-02 

.2186E-02 

.21 18E-02 

.2052E-02 

.1589E-02 

.1538E-02 

.  1489E-02 

.1 144E-02 

.  1 1 07E-02 

.  1070E-02 

.8164E-03 

.7888E-03 

.7622E-03 

.5770E-03 

.5571  E-03 

.5377E-03 

.4041  E-03 

.3897E-03 

.3758E-03 

•2802E-03 

.2701  E-03 

.2602E-03 

.1926E-03 

.  1 854E-03 

.  1 785E-03 

.1311  E-03 

.1261  E-03 

.1213E-03 

.8842E-04 

.8496E-04 

.8162E-04 

.5906E-04 

.5669E-04 

.5442E-04 

•3908E-04 

•3747E-04 

.3594E-04 

.2561  E-04 

.2454E-04 

.2351  E-04 

.  1 662E-04 

.1591  E-04 

.1523E-04 

.  1 069E-04 

.1022E-04 

.9774E-05 

.8807E-05 

•6503E-05 

.62 1 2E-05 

.4294E-05 

.4098E-05 

.391  IE-05 

•2682E-05 

.2558E-05 

.2439E-05 

.  1 660E-05 

1581E-05 

.  1 506E-05 

1017E-05 

.9680E-06 

.92 11  E-06 

.6173E-06 

.5869E-06 

.5580E-06 

.37 11  E-06 

.3525E-06 

.3448E-06 

2209E-06 

•2096E-06 

.  1 989E-06 

.  1 302E-06 

1235E-06 

.1 170E-06 

.4681 

.4641 

.4286 

.4247 

.3897 

.3859 

.3520 

.3483 

.3156 

.3121 

.2810 

.2776 

.2483 

.2451 

.2177 

.2148 

.1894 

.1867 

.1635 

.1611 

.1401 

.1379 

.1190 

.1170 

.1003 

.9853E-0I 

.8379E-01 

.8226E-01 

.6944E-0 1 

.68I1E-0I 

.5705 E-01 

.5592E-0I 

.4648E-0 1 

.455  IE-01 

.3754E-01 

.3673E-01 

.3005E-01 

.2938E-0I 

.2385E-01 

2330E-0I 

.  1 876E-0 1 

.183IE-0I 

.1463E-01 

.I426E-0I 

.1130E-01 

.1 10IE-0I 

.8656E-02 

.8424E-02 

.6569E-02 

.6387E-02 

.4940E-02 

.4799E-02 

.368  IE-02 

.3573E-02 

.2718E-02 

.2635E-02 

.1988E-02 

.1926&02 

.1441E-02 

.  1 395E-02 

.1035E-02 

.1001  E-02 

.7364E-03 

.71 14E-03 

.5 1 90E-03 

.5009E-03 

.3624E-03 

.3495  E-03 

.2507E-03 

.24151 

.171 8E-03 

.  1 653E-03 

.  1 1 66E-03 

.1121  E-03 

.7841  E-04 

.7532E-0* 

.5223E-04 

.5012E-04 

.3446E-04 

.3304E-04 

.2252E-04 

.2I57E-0* 

.1458E-04 

.I395E-0* 

.9345 E-05 

.8934E-05 

.5934E-05 

.5668E-05 

.3732E-05 

.356  IE-05 

.2325E-05 

.22 1 6E-05 

.1434E-05 

.1366E-05 

.8765E-06 

.8339E-06 

.5304E-06 

.5042E-06 

.3 1 79E-06 

.3019E-06 

.1887E-06 

.I790E-06 

.  1 1 09E-06 

.  105  IE-06 

(continued 
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TABLE  8.2 

Continued 


X 

0.00 

0.01 

0.02 

0.03 

0.04 

0.05 

0.06 

0.07 

0.08 

0.09 

5.200 

.9964E-07 

.9442E-07 

.8946E-07 

.8476E-07 

.8029E-07 

.7605E-07 

.7203E-07 

.6821  E-07 

.6459E-07 

.61 I6E-07 

5.300 

.5790E-07 

.548 1 E-07 

.5188E-07 

.49 11  E-07 

.4647E-07 

.4398E-07 

.4 161  E-07 

.3937E-07 

3724E-07 

.35 23 E-07 

5.400 

.3332E-07 

.3151E-07 

.2980E-07 

.2818E-07 

.2664E-07 

.25 1 8E-07 

.2381  E-07 

.2250E-07 

.2 1 27E-07 

.2010E-07 

5.500 

.  1 899E-07 

.  1 794E-07 

.1695E-07 

.1601  E-07 

.151 2E-07 

.1428E-07 

.  1 349E-07 

.  1 274E-07 

.  1 203 E-07 

.  1 135  E-07 

5.600 

.1072E-07 

.101 2E-07 

.9548E-08 

.9010E-08 

.8503E-08 

.8022E-08 

.7569E-08 

.7140E-08 

.6735E-08 

.6352E-08 

5.700 

.5990E-08 

.5649E-08 

.5326E-08 

.5022E-08 

.4734E-08 

.4462E-08 

.4206E-08 

.3964E-08 

.3735E-08 

.35 1 9E-08 

5.800 

.3316E-08 

.3124E-08 

.2942E-08 

.2771  E-08 

.2610E-08 

.2458E-08 

.2314E-08 

.2 1 79E-08 

.205  IE-08 

.193  IE-08 

5.900 

.181 8E-08 

.1711 E-08 

.161 0E-08 

.1515E-08 

.1425E-08 

.1341  E-08 

.1261  E-08 

.  1 1 86E-08 

.111  6E-08 

.1049E-08 

6.000 

.9866E-09 

.9276E-09 

.8721E-09 

.8198E-09 

.7706E-09 

.7242E-09 

.6806E-09 

.6396E-09 

.6009E-09 

.5646E-09 

6.100 

.5303E-09 

.4982E-09 

.4679E-09 

.4394E-09 

.4 1 26E-09 

.3874E-09 

.3637E-09 

.3414E-09 

.3205E-09 

.3008E-09 

6.200 

.2823E-09 

.2649E-09 

.2486E-09 

.2332E-09 

.2188E-09 

.2052E-09 

.1925E-09 

.I805E-09 

.  1 692E-09 

.  1587E-09 

6.300 

.1488E-09 

.1395E-09 

.1308E-09 

.  1 226E-09 

.1 149E-09 

.1077E-09 

.  1 009E-09 

.945  IE- 10 

.8854E-10 

.8294E-I0 

6.400 

.7769E-10 

.7276E-10 

.6814E-10 

.6380E-10 

.5974E-10 

.5593E-10 

.5235E-10 

.4900E-I0 

4586E-10 

.4292E-10 

6.500 

.4016E-10 

.3758E-IO 

.3515E-10 

.3288E-10 

.3077E-10 

.2877E-10 

.2690E-10 

.25I6E-10 

.2352E-10 

.2199E-10 

6.600 

.2056E-10 

.1922E-10 

.  1 796E- 1 0 

.  1678E-10 

.1568E-10 

.1465E-10 

.1369E-10 

.1279E-10 

.1 195E-10 

.111  6E- 1 0 

6.700 

.1042E-10 

.973  IE- 11 

.9086E- 1 1 

.8483E-1 1 

.7919E-1 1 

.7392E-1 1 

.6900E- 1 1 

.6439E-1 1 

.6009E-1 1 

.5607E- 1 1 

6.800 

.523  IE- II 

.4880E-1 1 

.4552E-1 1 

.4246E- 1 1 

.3960E-1 1 

.3692E- 1 1 

.3443E- 1 1 

.32IOE-1 1 

.2993E- 1 1 

.2790E-1 1 

6.900 

-2600E- 1 1 

.2423E-1 1 

.2258E-1 1 

.2104E-1 1 

.  1 960E- 1 1 

.1826E-1 1 

. 1 70 1 E- 1 1 

.1585E-1 1 

.1476E-1 1 

. 1374E-1 1 

7.000 

.1280E-1 1 

.  1 1 92E- 1 1 

.1 109E-1 1 

.1033E-1 1 

.9612E-12 

.8946E- 1 2 

.8325E-12 

.7747E-12 

.7208E-12 

.6706E- 1 2 

7.100 

.6238E-I2 

.5802E-12 

.5396E- 1 2 

.5018E-12 

.4667E- 1 2 

.4339E- 1 2 

.4034E- 1 2 

.3750E-12 

.3486E-I2 

.3240E-12 

7.200 

.301  IE- 12 

.2798E-12 

.2599E- 1 2 

.2415E-12 

.2243E- 1 2 

.2084E- 1 2 

.1935E-12 

.1797E-12 

I669E-I2 

.I550E-I2 

7.300 

•  I439E-12 

.  1 336E- 1 2 

.  1 240E- 1 2 

.  1 1 5 1 E- 1 2 

1068E-12 

.9910E-13 

.9196E-13 

.853  IE- 13 

.79 1 4E- 1 3 

.734  IE- 13 

7.400 

.6809E-I3 

•6315E-13 

.5856E-13 

.5430E- 1 3 

.5034E- 1 3 

.4667E- 1 3 

.4326E- 1 3 

4010E-13 

.37I6E-I3 

.3444E-13 

7.500 

3I9IE-13 

.2956E-13 

.2739E-13 

.2537E-13 

.2350E- 1 3 

.2176E-I3 

.2015E-13 

.  1 866E- 1 3 

.  1 728E- 1 3 

.  1600E-13 

7.600 

•  I48IE-I3 

. 1 370E- 1 3 

. 1 268E- 1 3 

.  1 1 74E- 1 3 

.  1 086E- 1 3 

.1005E-13 

.9297E-14 

.8600E-14 

.7954E-I4 

.7357E-I4 

7.700 

.6803E-14 

.629  IE- 14 

.5816E-14 

.5377E-14 

.497  IE- 14 

.4595E-14 

4246E-14 

.3924E-14 

.3626E-14 

.3350E-14 

7.800 

•3095E-14 

.2859E-14 

.2641E-14 

.2439E-14 

.2253E-14 

.2080E-14 

.  1 92 1 E- 1 4 

.1773E-14 

.  1 637E- 14 

.151 1E-I4 

7.900 

-1395E-I4 

.1287E-14 

.1 188E-14 

.1096E-14 

.1011 E- 14 

.9326E-15 

.8602E-I5 

.7934E-I5 

.7317E-15 

.6747E-15 

8.000 

.622  IE- 15 

.5735E-15 

.5287E-15 

.4874E-15 

.4492E-15 

.4 1 40E- 1 5 

.3815E-15 

.3515E-15 

.3238E-I5 

.2983E-15 

8.100 

•2748E-I5 

.253  IE- 15 

.233  IE- 15 

.2 1 46E- 1 5 

.  1 976E- 1 5 

. 1 820E- 1 5 

.1675E-15 

.  1 542E- 1 5 

.I4I9E-15 

1306E-I5 

8.200 

•  1202E-I5 

.  1 1 06E- 1 5 

.1018E-15 

.936  IE- 16 

.861  IE-16 

.7920E-16 

.7284E-16 

.6698E- 1 6 

.6I59E-I6 

5662E-16 

8.300 

•5206E-I6 

.4785E-16 

.4398E-16 

.4042E-16 

.3715E-16 

.3413E-16 

.3136E-16 

.288  IE- 16 

.2646E-16 

.243  IE- 16 

8.400 

•2232E-16 

.2050E-16 

.1882E-16 

.1728E-16 

.1587E-16 

.1457E-16 

1337E-16 

1227E-16 

.1 126E-I6 

.  1033E-I6 

8.500 

9480E-I7 

.8697E-17 

.7978E-17 

.7317E-17 

.67 1 1 E- 1 7 

.6154E-17 

.5643E-17 

.5 1 74E- 1 7 

.4744E-17 

4348E-I7 

8.600 

•3986E-17 

.3653E-17 

.3348E-17 

.3068E-17 

.281  IE- 17 

.2575E-17 

.2359E-17 

.2161 E- 1 7 

.1979E-17 

.I8I2E-I7 

8.700 

•  1659E-17 

.151 9E- 1 7 

.  1 39 1 E- 1 7 

.1273E-17 

.  1 1 66E- 1 7 

.1067E-17 

.9763E-18 

8933E-18 

.8I74E-18 

.7478E-I8 

8.800 

•684IE-18 

.6257E-18 

.5723E-18 

.5234E- 1 8 

.4786E-18 

.4376E- 1 8 

.400 1 E- 1 8 

.3657E-18 

.3343E-18 

.3055E-18 

8.900 

2792E-I8 

.2552E-18 

.233  IE- 18 

.2130E-18 

.  1 946E- 1 8 

.1777E-18 

.1623E-18 

1483E-18 

I354E-I8 

. 1 236E- 1 8 

9.000 

1129E-I8 

.1030E-18 

.9404E-19 

.8584E-19 

.7834E-19 

.7148E-19 

.6523E-19 

.595  IE- 19 

.5429E-I9 

.4952E-19 

9.100 

■45I7E-I9 

.41 19E-19 

.3756E-19 

.3425E-19 

.3123E-19 

.2847E-19 

.2595E-19 

.2365E-19 

.2155E-I9 

.  1 964E- 1 9 

9.200 

•  I790E-I9 

.  1 63 1 E- 1 9 

.1486E-19 

.1353E-19 

.1232E-19 

.  1 1 22E- 1 9 

.1022E-19 

.9307E-20 

.8474E-20 

.7714E-20 

9.300 

7022E-20 

.6392E-20 

.5817E-20 

.5294E-20 

.4817E-20 

.4382E-20 

.3987E-20 

3627E-20 

.3299E-20 

.3000E-20 

9.400 

•2728E-20 

.248 1 E-20 

.2255E-20 

.2050E-20 

.  1 864E-20 

.1694E-20 

.  1 540E-20 

.  1 399E-20 

.1271  E-20 

.  1 155E-20 

9.500 

•  1049E-20 

.9533E-21 

.8659E-21 

.7864E-21 

.7142E-21 

.6485E-21 

.5888E-21 

.5345E-21 

.4852E-21 

.4404E-21 

9.600 

•3997E-21 

.3627E-21 

.3292E-21 

.2986E-21 

.2709E-21 

.2458E-21 

.2229E-21 

.2022E-21 

.  1 834E-2 1 

.  1663E-21 

9.700 

I507E-21 

.  1 367E-2 1 

.1239E-21 

.  1 123E-21 

.1018E-21 

.9223E-22 

.8358E-22 

.7573E-22 

.6861  E-22 

.62I5E-22 

9.800 

5629E-22 

.5098E-22 

.4617E-22 

.4181E-22 

.3786E-22 

.3427E-22 

.3 102E-22 

.2808E-22 

.2542E-22 

.2300E-22 

9.900 

•208  IE-22 

.1883E-22 

.1704E-22 

.1541 E-22 

.  1 394E-22 

.1261  E-22 

.1 140E-22 

.1031  E-22 

.9323E-23 

.8429E-23 

10.00 

•7620E-23 

.6888E-23 

.6225E-23 

.5626E-23 

.5084E-23 

.4593E-23 

.4150E-23 

3749B-23 

.3386E-23 

.3058E-23 

Notes:  (1)  E-01  should  be  read  as  x  I0_l;  E-02  should  be  read  as  x  10  and  soon. 

(2)  This  table  lists  Q(x)  for  .r  in  the  range  of  0  to  10  in  the  increments  of  0.01 .  To  find  0(5.36)  for  example, 
look  up  the  row  starting  with  x  =  5.3.  The  sixth  entry  in  this  row  (under  0.06)  is  the  desired  value 

0.4161  x  10-7. 
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Figure  8.12 

Error  probability 
in  threshold 
detection: 

(a)  transmitted 
pulse;  (b)  noise 
PDF;  (c)  received 
pulses  with  noise 
(d)  detection 
error  probability. 


Letting  ( x  —  m)/cr  =  z. 


(8.40a) 


Therefore, 


and 


/><*<*) -l-o(i^!) 

p(x>«  =  e(  £zi) 


(8.40b) 


(8.40c) 


The  Gaussian  PDF  is  perhaps  the  most  important  PDF  in  the  field  of  communications.  The 
majority  of  the  noise  processes  observed  in  practice  are  Gaussian.  The  amplitude  n  of  a  Gaussian 
noise  signal  is  an  RV  with  a  Gaussian  PDF.  This  means  the  probability  of  observing  n  in  an 
interval  (n,  n  +  An)  i$pn(n)An,  where  p„(n)  is  of  the  form  in  Eq.  (8.39)  [with  m  =  0]. 


Example  8. 16  Threshold  Detection 

Over  a  certain  binary  channel,  messages  m=  0  and  1  are  transmitted  with  equal  probability  by 
usuij.  a  positive  and  a  negative  pulse,  respectively.  The  received  pulse  corresponding  to  1  is 
p(t)  shown  in  Fig.  8.12a,  and  the  received  pulse  corresponding  to  0  is  -p(t).  Let  the  peak 

amplitude  ot  p(t)  be  Apatt  =  Tp.  Because  of  the  channel  noise  n(f),  the  received  pulses  will 
be  (Fig.  8.12c) 


— r  \m  /  i  v*  / 

To  detect  the  pulses  at  the  receiver,  each  pulse  is  sampled  at  its  peak  amplitude.  In  the  absence 
of  no.se  the  sampler  output  is  either  Ap  (for  m=l)  or  -A„  (for  m=0).  Because  of  the  channel 

("fiTx  nerP  Lrp\;tnUt  ^  ^zAp  +  W^ere  n’  t*1e  no'se  amplitude  at  the  sampling  instant 
(Fig.  8. 1 2b).  is  an  R\ .  For  Gaussian  noise,  the  PDF  of  n  is  (Fig.  8. 1 2b) 


Pn(rt)  = 


o„\/2n 


„-n2/2a„2 


(8.41) 
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Figure  8.12 

Error  probability 
in  threshold 
detection: 

(a)  transmitted 
pulse;  (b)  noise 
PDF;  (c)  received 
pulses  with  noise; 
(d)  detection 
error  probability. 


Because  of  the  symmetry  of  the  situation,  the  optimum  detection  threshold  is  zero;  that 
is,  the  received  pulse  is  detected  as  a  1  or  a  0,  depending  on  whether  the  sample  value  is 
positive  or  negative. 

Because  noise  amplitudes  range  from  -oo  to  oo,  the  sample  value  -Ap  +  n  can 
occasionally  be  positive,  causing  the  received  0  to  be  read  as  1  (see  Fig.  8. 1 2b).  Similarly, 
Ar  +  n  can  occasionally  be  negative,  causing  the  received  1  to  be  read  as  0.  If  0  is 
transmitted,  it  will  be  detected  as  1  if  ~Ap  -f-  n  >  0,  that  is,  it  n  >  Ap. 

If  P(e|0)  is  the  error  probability  given  that  0  is  transmitted,  then 


P(e  |0)  =  P( n  >  Ap) 
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Because  P( n  >  Ap)  is  the  shaded  area  in  Fig.  8.12b  to  the  right  of  Ap ,  from  Eq.  (8.40c) 
[with  m  =  0|  it  follows  that 


(8.42a) 


Figure  8.13 

(a)  Joint  PDF. 

(b)  Conditional 
PDF. 


Similarly, 


P(€\l)  =  P(n  <  -Ap) 

=  =/,(e|0) 


(8.42b) 


and 


Pe  =  Y^j  P(€’  m<  ) 


=  ^  P(mi)P(€\mi) 
i 


The  error  probability  Pe  can  be  found  from  Fig.  8. 1 2d. 


(8.42c) 


Joint  Distribution 

For  two  RVs  x  and  y,  we  define  a  CDF  Fxy( x,  y)  as  follows: 


and  the  joint  PDF  pxy(x,  y)  as 


pxy(x,  y )  —  F(x  <  x  and  y  <  y) 


Pxy(x' y)  =  y) 


(8.43) 


(8.44) 


Arguing  along  lines  similar  to  those  used  for  a  single  variable,  we  can  show  that  as  Ax  0 
and  Ay  ->  0 

Pxy  (x,  y)Ax  Ay  =  P(x  <  x  <  x  +  Ax,  y  <  y  <  y  +  Ay)  (8-45) 

Hence  the  probability  of  observing  the  variables  x  in  the  interval  (x,  x  +  Ax)  and  y  in  the 
interval  (y,  y  +  Ay)  jointly  is  given  by  the  volume  under  the  joint  PDF  pxv(x,  y)  over  the 
region  bounded  by  (x,  x  +  Ax)  and  (y,  y  +  Ay),  as  shown  in  Fig  8  1 3a  ' 

From  Eq.  (8.45),  it  follows  that  * 


P(x]  <xZ*2’y\<y<y2)=  r  r Pxy(x,  y ) dxdy 

Jx,  Jy, 


(8.46) 


*  -  j\ 

Thus,  the  probability  of  jointly  observing  x  in  the  interval  (x,,x2)  and  y  in  the  interval  (vi,  yH 
is  the  volume  under  the  PDF  over  the  region  bounded  by  (x,,  x2)  and  (y,,  y2). 
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The  event  of  observing  x  in  the  interval  (-00,  00)  and  observing  y  in  the  interval  (-00,  00) 
is  a  certainty.  Hence. 


Pxy(x,  y)dxdy  =  I 


(8.47) 


Thus,  the  total  volume  under  the  joint  PDF  must  be  unity. 

When  we  are  dealing  with  two  RVs  x  and  y.  the  individual  probability  densities  px(x) 
and  py(y)  can  be  obtained  from  the  joint  density  pxy(x,  y).  These  individual  densities  are  also 
called  marginal  densities.  To  obtain  these  densities,  we  note  that  px(x)  Ax  is  the  probability 
of  observing  x  in  the  interval  (*,  *  +  Ax).  The  value  of  y  may  lie  anywhere  in  the  interval 
(—00,  00).  Hence, 


lim  px(x) Ax  =  lim  Probability  (x  <  x  <  *  +  Ax,  -00  <  y  <  00) 
Ax—>0  Ax-+0 

rx+Ax  roo 

=  lim  /  /  pXy(x,  y)dxdy 

Ax-+0  Jx  J—oo 

/oo  rx+Ax 

Pxy(x,  y)  dy  I  dx 

•OO  dx 


=  lim  A.v 

A.v->0 


f 


pxy(x,  y)dy 
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The  last  two  steps  follow  from  the  fact  that  pXy  (x,  y)  is  constant  over  (jc,  .v  +  Ax)  because 
Ax  — ►  0.  Therefore, 


Similarly, 

/•OO 

Px(x)  =  /  Pxy(x,y)dy 

J -00 

roo 

(8.48a) 

In  terms  of  the  CDF,  we  have 

Pyiy)  =  I  Pxy(x,  y)dx 

J -00 

(8.48b) 

Fy(y)  =  Fxy  (oo,  y) 

(8.49a) 

Fx(x)  =  Fxy(x,  oo) 

(8.49b) 

These  results  may  be  generalized  for  multiple  RVs  xi,  X2,  . . . ,  x„. 


Conditional  Densities 

The  concept  of  conditional  probabilities  can  be  extended  to  the  case  of  continuous  RVs.  We 
define  the  conditional  PDF  Px\y(x\}’j)  as  the  PDF  of  x  given  that  y  has  the  value  V/.  This  is 
equivalent  to  saying  that  px|y(jc|yy)  Axis  the  probability  of  observing  x  in  the  range  (x,  x+ Ax), 
given  that  y  =yj.  The  probability  density  Px\y(x\yj)  is  the  intersection  of  the  plane  y  =yj  with 
the  joint  PDF  pxy(x,  y)  (Fig.  8.13b).  Because  every  PDF  must  have  unit  area,  however,  we 
must  normalize  the  area  under  the  intersection  curve  C  to  unity  to  get  the  desired  PDF.  Hence, 

C  is  Apx\y(x\y),  where  A  is  the  area  under  C.  An  extension  of  the  results  derived  for  the  discrete 
case  yields 


Px|yC*|y)py(y)  =Px y(x,  y) 

(8.50a) 

PylxCykKW  =  Pxy(x,  y) 

(8.50b) 

and 

P.|yC»M  - 

Pyiy) 

(8.51a) 

Equation  (8.51a)  is  Bayes’  rule  for  continuous  RVs.  When  we 
discrete  and  continuous),  the  mixed  form  of  Bayes’  rule  is 

have  mixed  variables  (i.e-i 

^xiyWyOpyOO  =  PxMpyixCyl*) 

(8.51b) 

where  x  is  a  discrete  RV  and  y  is  a  continuous  RV.* 

Note  that  Px|y  Wy)  is  still,  first  and  foremost,  a  probability  density  function.  Thus, 


P*|y(x|y)  dx  =  LssPyb'  y)dx_  =  Py(y)  =  (8.52) 

00  Py(y)  py(y)  ~ 


Example  8.17 


Figure  8.14 

Derivation  of  the 
Rayleigh  density. 


It  may  be  worth  noting  that  PX|y(jr|y)  is  conditioned 


on  an  event  y  —  y  that  has  probability  zero. 


Independent  Random  Variables 

The  continuous  RVs  x  and  y  are  said  to  be  independent  if 
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/>x|y(*l;y)  =pAx ) 

In  this  case  from  Eqs.  (8.53a)  and  (8.51)  it  follows  that 

Py|x(yW  =py(y) 

This  implies  that  for  independent  RVs  x  and  y, 

Pxy(x,  y)  =Px(x)Py(y) 

Based  on  Eq.  (8.53c),  the  joint  CDF  is  also  separable: 


Fxy(x 


,y)=(  f  Pxy(v,  w)dwdv 

J-oo  J-oo 

=  f  Px(v)dv ■  f  py(w)dw 

J-oo  J-oo 

=  FX(X)  ■  Fy(y) 


Example  8.17  Rayleigh  Density 

The  Rayleigh  density  is  characterized  by  the  PDF  (Fig.  8.14b) 


Pr(r)  = 


_^-rW  r  >  0 

or 


r  <  0 


Figure  8.14 

Derivation  of  the 
Rayleigh  density. 


(8.53a) 


(8.53b) 


(8.53c) 


(8.54) 


(8.55) 
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A  Rayleigh  RV  can  be  derived  from  two  independent  Gaussian  RVs  as  follows.  Let  x  and 
y  be  independent  Gaussian  variables  with  identical  PDFs: 


Px(x)  = 


1 


o\/2n 


—x2 /la1 


PyCv)  =  — \=e  y2,2a 2 

o  \2tz 


Then 


Px y(x,  y )  =  pAx)Py(y)  =  _Le-^+>'2)^2  (8.56) 

The  joint  density  appears  somewhat  like  the  bell-shaped  surface  shown  in  Fig.  8.13.  The 
points  in  the  (.x,y)  plane  can  also  be  described  in  polar  coordinates  as  (r,  0 ),  where 
(Fig.  8.14a) 


r  =  Jx2  +  y2  0  =  tan-1  - 

X 

In  Fig.  8. 14a,  the  shaded  region  represents  r  <  r  <  r  +  dr  and  9  <  0  <  0  +  dO  (where 
clt  and  dO  both  — ►  0).  Hence,  if  pr&(r,  0)  is  the  joint  PDF  of  r  and  0,  then  by  definition 
[Eq.  (8.45)],  the  probability  of  observing  r  and  0  in  this  region  is  /?r0(r,  6)  drdO.  But  we 
^  also  know  that  this  probability  is/?xy(.r,  y )  times  the  area  rdrdO  of  the  shaded  region. 

Hence,  [Eq.  (8.56)] 

__<?-(*2+y2)2<72  rdfde  =  pr@(r>  G)  df.de 

and 


Pre(r,  6)  =  - — -e-(*2+y1)/2o2 


2  no2 


2  no2 


r 2/2a2 


(8.57) 


and  [Eq.  (8.48a)] 


/oo 

Pr«  (r,  0)de 

-00 

Because  0  exists  only  in  the  region  (0,  27r), 

r'lir 

Mr)  =  /  —e-^2 

Jo  ino1 

= 

Note  that  r  is  always  greater  than  0.  In  a  similar  way,  we  find 


d9 


(8.58a) 


Pe(0)  = 


0  <  ©  <  2jt 


8.3  STATI 


0,  otherwise 


(8.58b) 
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IRVs  r  and  0  are  independent  because  pr@(r,  6)  =  pr(r)pe(6).  The  PDF  pt(r)  is  the 
Rayleigh  density  function.  We  shall  later  show  that  the  envelope  of  narrowband  Gaussian 
noise  has  a  Rayleigh  density.  Both  pt(r)  and  />0(6>)  are  shown  in  Fig.  8.14b  and  c. 


8.3  STATISTICAL  AVERAGES  (MEANS) 


Averages  are  extremely  important  in  the  study  of  RVs.  To  find  a  proper  definition  for  the 
average  of  a  random  variable  x,  consider  the  problem  of  determining  the  average  height  of  the 
entire  population  of  a  country.  Let  us  assume  that  we  have  enough  resources  to  gather  data 
about  the  height  of  every  person.  If  the  data  is  recorded  within  the  accuracy  of  an  inch,  then 
the  height  x  of  every  person  will  be  approximated  to  one  of  the  n  numbers  x\ ,  jc2,  . . .  ,x„.  If 
there  are  V,  persons  of  height  jc(,  then  the  average  height  x  is  given  by 


N{x\  +  N2X2  4 - b  Nnx„ 

N 


x  = 


where  the  total  number  of  persons  is  N  =  '%  •  Hence, 


In  the  limit  as  N  00,  the  ratio  NJN  approaches  Px (xj )  according  to  the  relative  frequency 
definition  of  the  probability.  Hence, 


n 


X  =  J2 XipAxi ) 


1=1 


The  mean  value  is  also  called  the  average  value,  or  expected  value,  of  the  RV  x  and  is  denoted 
by  £[x],  Thus, 


(8.59a) 


We  shall  use  both  these  notations,  our  choice  depending  on  the  circumstances  and  convenience. 
If  the  RV  x  is  continuous,  an  argument  similar  to  that  used  in  arriving  at  Eq.  (8.59a)  yields 


(8.59b) 


This  result  can  be  derived  by  approximating  the  continuous  variable  x  with  a  discrete  variable 
by  quantizing  it  in  steps  of  Ax  and  then  letting  Ax  —*■  0. 

Equation  (8.59b)  is  more  general  and  includes  Eq.  (8.59a),  because  the  discrete  RV  can  be 
considered  as  a  continuous  RV  with  an  impulsive  density.  In  such  a  case,  Eq.  (8.59b)  reduces 
to  Eq.  (8.59a). 

As  an  example,  consider  the  general  Gaussian  PDF  given  by  (Fig.  8.11) 


(8.60a) 
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From  Eq.  (8.59b)  we  have 


Figure  8.15 

Random 
sampling  of  a 


x  = 


sine-wave 

generator. 


Changing  the  variable  to  x  =  y  +  m  yields 


The  first  integral  inside  the  bracket  is  zero,  because  the  integrand  is  an  odd  function  of  y.  The 
term  inside  the  square  brackets  is  the  integration  of  the  Gaussian  PDF,  and  is  equal  to  1 .  Hence, 


(8.60b) 


x  =  m 


Mean  of  a  Function  of  a  Random  Variable 

It  is  often  necessary  to  find  the  mean  value  of  a  function  of  a  RV.  For  instance,  in  practice  we 
are  often  interested  in  the  mean  square  amplitude  of  a  signal.  The  mean  square  amplitude  is 
the  mean  of  the  square  of  the  amplitude  x,  that  is,  x2. 

In  general,  we  may  seek  the  mean  value  of  an  RV  y  that  is  a  function  of  the  RV  x;  that  is, 
we  wish  to  find  y  where  y  =  g(x).  Let  x  be  a  discrete  RV  that  takes  values  jci ,  x2,  . . .  ,Jt„with 
probabilities  P\{x i),  P >, (a'2 ) ,  . . . , P x (xn).  respectively.  But  because  y  =  g(x),  y  takes  values 
#C*i).  g(x2),  ...,g(x„)  with  probabilities  />x(-*'i).  P\(x 2),  . . . ,  P^(xn),  respectively.  Hence, 
from  Eq.  (8.59a)  we  have 


n 


y  =  g(x)  =  Y,s(xi)Px(xi) 


(8.61a) 


;=i 


If  x  is  a  continuous  RV,  a  similar  line  of  reasoning  leads  to 


(8.61b) 


Example  8.18  The  output  voltage  of  sinusoid  generator  is  A  cos  cot.  This  output  is  sampled  randomly 


(  lg.  8. 1 5a).  The  sampled  output  is  an  RV  x,  which  can  take  on  any  value  in  the  range  (-4,  A)- 
Determine  the  mean  value  (x)  and  the  mean  square  value  (x2)  of  the  sampled  output  x. 

It  the  output  is  sampled  at  a  random  instant  t,  the  output  x  is  a  function  of  the  RV  t: 


x(r)  =  A  cos  cot 


If  we  let  rut  =  0,  0  is  also  an  RV.  and  if  we  consider  only  modulo-27r  values  of©,  then 
the  RV  0  lies  in  the  range  (0,  2n).  Because  t  is  randomly  chosen,  0  can  take  any  value 
in  the  range  (0,  In)  with  uniform  probability.  Because  the  area  under  the  PDF  must  be 
unity,  p<r)(0)  is  as  shown  in  Fig.  8.15b. 


Figure  8.15 

Random 
sampling  of  a 
sine-wave 
generator. 
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Sine-wave 

generator 


Voltmeter 


(a) 


1 

27 T 

0 

2tt  e — 

(b) 


The  RV  x  is  thus  a  function  of  another  RV,  0, 

x  =  A  cos  0 

Hence,  from  Eq.  (8.61b), 

r2n  j  r2n 

x  =  I  xp®(0)d0  =  —  /  Acos0d0  =  0 
JO  27T  J 0 

and 

_  [2  71  ^2  f2jr 

x2  =  /  x2pe(&)dO=—  I  cos  20d0 

Jo  2tt  Jo 

=  A* 

2 


Similarly,  for  the  case  of  two  variables  x  and  y,  we  have 


s(x,  y)  = 


g(x,  y)Pxy(x,  y)dxdy 


(8.62) 


Mean  of  the  Sum 

y),  g2(x,  y),  ...,£n(x,  y)  are  functions  of  the  RVs  X  and  y,  then 

#i(x,  y)  +  £2(x,  y)  H - hgn(x,  y)=gi(x,  y)+«2(x.  y)+--+£„(x,  y)  (8.63a) 

The  proof  is  trivial  and  follows  directly  from  Eq.  (8.62). 

Thus,  the  mean  (expected  value)  of  the  sum  is  equal  to  the  sum  of  the  means.  An  important 
special  case  is 


x  +  y  =  x  +  y 


(8.63b) 


Equation  (8.63a)  can  be  extended  to  functions  of  any  number  of  RVs. 
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Mean  of  the  Product  of  Two  Functions 

Unfortunately,  there  is  no  simple  result  [as  in  Eq.  (8.63)]  for  the  product  of  two  functions.  For 
the  special  case  where 

#(x,  y)  =gi(x)g2(y)  (8.64a) 

/oo  roo 

/  gi(x)g2(y)Pxy(x,  y)dxdy 
-oo  J  — OO 

If  x  and  y  are  independent,  then  [Eq.  (8.53c)] 

Px yU,  y)  =Px(x)Py(y) 

and 

/OO  r  OO 

8i(x)px(x)  dx  /  g2(y)py(y)dy 
-00  J  —  oo 

=  8\ (x)  82(y)  if  x  and  y  independent  (8.64b) 

A  special  case  of  this  is 

xy  =  x  y  if  x  and  y  independent  (8.64c) 


Moments 

The  /7th  moment  of  an  RV  x  is  defined  as  the  mean  value  of  x".  Thus,  the  nth  moment  of  x  is 


/oo 

x"px(x)dx 

-OO 

The  nth  central  moment  of  an  RV  x  is  defined  as 

/OO 

(x-x)npAx)dx 

-OO 


(8.65a) 


(8.65b) 


The  second  central  moment  ot  an  RV  x  is  of  special  importance.  It  is  called  the  variance  of 
x  and  is  denoted  by  <r2,  where  ctx  is  known  as  the  standard  deviation  (SD)  of  the  RV  x.  By 
definition. 


CTx  =  (X  -  X)2 

=  x2  -  2xx  +  x2  =  ^2  _  2x2  +  -2 

=  x2  —  x2  (8.66) 

Thus,  the  variance  of  x  is  equal  to  the  mean  square  value  minus  the  square  of  the  mean.  When 
the  mean  is  zero,  the  variance  is  the  mean  square;  that  is,  x^  =  cr^ 
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Example  8.19  Find  the  mean  square  and  the  variance  of  the  Gaussian  RV  with  the  PDF  in  Eq.  (8.39)  [see 
Fig.  8.11]. 

We  have 


Changing  the  variable 


— =  i  r 

CTy/27T  J- c 


X2e-(x-n,)>/ 2a2  dx 


to  v  =  (x  —  m)/a  and  integrating,  we  get 


1  2.2 

=  <7  +  m 


(8.67a) 


Also,  from  Eqs.  (8.66)  and  (8.60b), 


=  (a2  +  m2)  -  (m)2 

=  (8.67b) 


Hence,  a  Gaussian  RV  described  by  the  density  in  Eq.  (8.60a)  has  mean  m  and  variance  a2. 
In  other  words,  the  Gaussian  density  function  is  completely  specified  by  the  first  moment 
(x)  and  the  second  moment  (x2). 


Example  8.20  Mean  Square  of  the  Uniform  Quantization  Error  in  PCM 


In  the  PCM  scheme  discussed  in  Chapter  6,  a  signal  band-limited  to  B  Hz  is  sampled  at 
a  rate  of  2 B  samples  per  second.  The  entire  range  (—mp,  mp)  of  the  signal  amplitudes  is 
partitioned  into  L  uniform  intervals,  each  of  magnitude  2 mp/L  (Fig.  8. 1 6a).  Each  sample  is 
approximated  to  the  midpoint  of  the  interval  in  which  it  falls.  Thus,  sample  m  in  Fig.  8. 1 6a 
is  approximated  by  a  value  m,  the  midpoint  of  the  interval  in  which  m  falls.  Each  sample 
is  thus  approximated  (quantized)  to  one  of  the  L  numbers. 

The  difference  q  =  m  -  m  is  the  quantization  error  and  is  an  RV.  We  shall  determine 
q2,  the  mean  square  value  of  the  quantization  error.  From  Fig.  8.16a  it  can  be  seen  that  q 
is  a  continuous  RV  existing  over  the  range  (- mp/L ,  mp/L)  and  is  zero  outside  this  range. 
If  we  assume  that  it  is  equally  likely  for  the  sample  to  lie  anywhere  in  the  quantizing 
interval,*  then  the  PDF  of  q  is  uniform 

pq(q)  =  L/2mp  q  €  (-mp/L  mp/L) 


*  Because  the  quantizing  interval  is  generally  very  small,  variations  in  the  PDF  of  signal  amplitudes  over  the 
interval  are  small  and  this  assumption  is  reasonable. 
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Figure  8.16 

(a)  Quantization 
error  in  PCM 
and  (b)  its  PDF. 


as  shown  in  Fig.  8.16b,  and 


rmp/L 

q- =  /  nPqiq)  dq 
J-mp/L 


-Li. 

2  mp  3 


mp/L 

-ntp/L 


1 

3 


From  Fig.  8. 16b  it  can  be  seen  that  q  =  0.  Hence, 


1 

3 


(8.68a) 


(8.68b) 


Example  8.21  Mean  Square  Error  Caused  by  Channel  Noise  in  PCM 

Quantization  noise  is  one  of  the  sources  of  error  in  PCM.  The  other  source  of  error  is  channel 
noise.  Each  quantized  sample  is  coded  by  a  group  of  n  binary  pulses.  Because  of  channel  noise, 
some  of  these  pulses  are  incorrectly  detected  at  the  receiver.  Hence,  the  decoded  sample  value 
m  at  the  receiver  will  differ  from  the  quantized  sample  value  m  that  is  transmitted.  The  error 

e  =  m  -  m  is  a  random  variable.  Let  us  calculate  €2,  the  mean  square  error  in  the  sample  value 
caused  by  the  channel  noise. 
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To  begin  with,  let  us  determine  the  values  that  6  can  take  and  the  corresponding  probabili¬ 
ties.  Each  sample  is  transmitted  by  n  binary  pulses.  The  value  of  6  depends  on  the  position 
ot  the  incorrectly  detected  pulse.  Consider,  for  example,  the  case  of  L  =  16  transmitted  by 
tour  binary  pulses  ( n  =  4),  as  shown  in  Fig.  1 .5.  Here  the  transmitted  code  1101  represents 
a  value  of  13.  A  detection  error  in  the  first  digit  changes  the  received  code  to  0101,  which 
is  a  value  of  5.  This  causes  an  error  6  =  8.  Similarly,  an  error  in  the  second  digit  gives 
^  Errors  in  the  third  and  the  fourth  digits  will  give  6  =  2  and  6=1,  respectively 
In  general,  the  error  in  the  /th  digit  causes  an  error  6/  =  (2  1 )  1 6.  For  a  general  case,  the 
error  6/  =  (2 ~‘)F,  where  F  is  the  full  scale,  that  is,  2mp ,  in  PCM.  Thus, 


6/  =  (2~l)(2wip)  /=  1,  2 . // 


Note  that  the  error  6  is  a  discrete  RV.  Hence,* 


n 


(8.69) 


Because  Pf(e,)  is  the  probability  that  e  =  e,.  Pf  (<?,-)  is  the  probability  of  error  in  the 
detection  of  the  /th  digit.  Because  the  error  probability  of  detecting  any  one  digit  is  the 
same  as  that  of  any  other,  that  is,  Pe, 


n 


1=1 


n 


=  Pe  £4/^(2-2') 


/=! 

n 


=  4m2pPe^2-2i 


i=l 


This  summation  is  a  geometric  progression  with  a  common  ratio  r  =  2~2.  with  the  first 
term  a\  =  2~2  and  the  last  term  a„  =  2-2".  Hence  (see  Appendix  E.4), 


4mpPe(22n  -  1) 
3(22n) 


(8.70a) 


Note  that  the  magnitude  of  the  error  e  varies  from  2-l(2 mp)  to  2~n(2mp).  The  error  e 
can  be  positive  as  well  as  negative.  For  example,  e  =  8  because  of  a  first-digit  error 
in  1101.  But  the  corresponding  error  e  will  be  -8  if  the  transmitted  code  is  0101.  Of 
course  the  sign  of  e  does  not  matter  in  Eq.  (8.69).  It  must  be  remembered,  however, 
that  €  varies  from  -2~n(2mp)  to  2~n(2mp)  and  its  probabilities  are  symmetrical  about 


*  Here  we  are  assuming  that  the  error  can  occur  only  in  one  of  the  n  digits.  But  more  than  one  digit  may  be  in  error. 
Because  the  digit  error  probability  Pe  <£  1  (on  the  order  10-5  or  less),  however,  the  probability  of  more  than  one 
wrong  digit  is  extremely  small  (see  Example  8.6).  and  its  contribution  is  negligible. 
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e  =  0.  Hence,  e  =  0  and 


Am2pPe(22n  -  1) 
3(22") 


Example  8.2! 


(8.70b) 


Variance  of  a  Sum  of  Independent  Random  Variables 

The  variance  of  a  sum  of  independent  RVs  is  equal  to  the  sum  of  their  variances.  Thus,  if  x 
and  y  are  independent  RVs  and 


z  =  x  +  y 


then 


_2  2  ,  _2 
CTz  =  a%  +  CTy 


(8.71) 


This  can  be  shown  as  follows: 


al  =  (z  -  z)2  =  [x  +  y  -  (x  +  y)]2 


=  [(x-x)  +  (y-y)]2 


=  (x  -  x)2  +  (y  -  y)2  +  2(x  -  x)(y  -  y) 


=  Gl  +  ay  +  2(x  -  x)(y  -  y) 


Because  x  and  y  are  independent  RVs,  (x  -  x)  and  (y  -  y)  are  also  independent  RVs.  Hence, 
from  Eq.  (8.64b)  we  have 


(x  -  x)(y  -  y)  =  (X  -  x)  •  (y  -  y) 


But 


(x-x)  =  x-  x  =  x-  x  =  0 


Similarly, 


(y  -  y)  =  0 


and 


J^gure  8.17 

Gaussian  PDF 
^  standard 
Nations  a  =  i 
a^d  <7  =  3 


This  result  can  be  extended  to  any  number  of  variables.  If  RVs  x  and  y  both  have  zero  means 

(i.e.,x  —  y  —  0),  then  z  =  x  +  y  =  0.  Also,  because  the  variance  equals  the  mean  square  value 

when  the  mean  is  zero,  it  follows  that 


Z2  =  (x  +  y)2  =  x2  +  y2 

provided  x  =  y  =  0.  and  provided  x  and  y  are  independent  RVs. 


(8.72) 
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f'flure  8.17 

Gaussian  PDF 
standard 
deviations  a  =  | 
and  <t  -  3 


22  Total  Mean  Square  Error  in  PCM 

In  PCM,  as  seen  in  Examples  8.20  and  8.21,  a  signal  sample  m  is  transmitted  as  a  quan¬ 
tized  sample  m,  causing  a  quantization  error  q  =  m  —  m.  Because  of  channel  noise,  the 
transmitted  sample  m  is  read  as  m,  causing  a  detection  error  e  =  m  -  in.  Hence,  the  actual 
signal  sample  m  is  received  as  m  with  a  total  error 


m  —  m  =  (m  —  m)  +  (m  —  m)  =  q  +  e 


where  both  q  and  e  are  zero  mean  RVs.  Because  the  quantization  error  q  and  the  channel- 
noise  error  e  are  independent,  the  mean  square  of  the  sum  is  [see  Eq.  (8.72)] 


(m  -  in)2  =  (q  4-  e)2  =  q2  +  e2 


Also,  because  L  =  2", 


(8.73) 


Chebyshev’s  Inequality 

The  standard  deviation  ctx  of  an  RV  x  is  a  measure  of  the  width  of  its  PDF.  The  larger  the  <rx,  the 
wider  the  PDF.  Figure  8.17  illustrates  this  effect  for  a  Gaussian  PDF.  Chebyshev’s  inequality 
is  a  statement  of  this  fact.  It  states  that  for  a  zero  mean  RV  x 


P(|x|  <  kax)  >1-77 


(8.74) 


This  means  the  probability  of  observing  x  within  a  few  standard  deviations  is  very  high.  For 
example,  the  probability  of  finding  |x|  within  3crx  is  equal  to  or  greater  than  0.88.  Thus,  for  a 
PDF  with  ctx  =  1,  P(|x|  <  3)  >  0.88,  whereas  for  a  PDF  with  erx  =  3,  P(|x|  <  9)  >  0.88.  It 
is  clear  that  the  PDF  with  crx  =  3  is  spread  out  much  more  than  the  PDF  with  <rx  =  1 .  Hence, 


f'9ure  8.17 

Gaussian  PDF 
standard 
Aviations  a  =  i 
and  (7  =  3 
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<tx  or  o\  is  often  used  as  a  measure  of  the  width  of  a  PDF.  In  Chapter  10,  we  shall  use  this 
measure  to  estimate  the  bandwidth  of  a  signal  spectrum.  The  proof  of  Eq.  (8.74)  is  as  follows: 

/(X) 

x2px(x)dx 

-00 

Because  the  integrand  is  positive, 

o\  >  /  x 2px(x)dx 

J\x\>kox 

If  we  replace  x  by  its  smallest  value  kox,  the  inequality  still  holds, 

o\  >  k2ol  f  px(x)dx  =  k2olP(\x\  >  kox) 

J\x\>kax 


or 


P(|x|  >  kax)  <  -j-L 

Hence, 

**(1*1  <  kox)  >  1  - 

This  inequality  can  be  generalized  for  a  nonzero  mean  RV  as: 

P( |x  —  x|  <  kcrx)  >1 - L  (8.75) 


Example  8.23  Estimate  the  width,  or  spread,  of  a  Gaussian  PDF  [Eq.  (8.60a)] 

For  a  Gaussian  RV  [see  Eqs.  (8.35)  and  (8.40b)] 

p( lx  -  x|  <  a)  =  1  -  2g(l)  =  0.6826 
P(|x  -  x|  <  2<t)  =  1  -  20(2)  =  0.9546 
p( lx  —  x|  <  3<x)  =  1  —  20(3)  =  0.9974 

This  means  that  the  area  under  the  PDF  over  the  interval  (x  -  3a,  x  +  3a)  is  99.74%  of  the 
total  area.  A  negligible  fraction  (0.26%)  of  the  area  lies  outside  this  interval.  Hence,  the 
width,  or  spread,  of  the  Gaussian  PDF  may  be  considered  roughly  ±3a  about  its  mean, 
giving  a  total  width  of  roughly  6a. 
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8.4  CORRELATION 

Otten  we  are  interested  in  determining  the  nature  of  dependence  between  two  entities,  such 
as  smoking  and  lung  cancer.  Consider  a  random  experiment  with  two  outcomes  described  by 
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RVs  x  and  y- We  conduct  several  trials  of  this  experiment  and  record  values  of  x  and  y  for  each 
trial.  From  this  data,  it  may  be  possible  to  determine  the  nature  of  a  dependence  between  x 
and  y.  The  covariance  of  RVs  x  and  y  is  one  measure  that  is  simple  to  compute  and  can  yield 
useful  information  about  the  dependence  between  x  and  y. 

The  covariance  crxy  of  two  RVs  is  defined  as 

Oxy  =  (x  -  x)(y  -  y)  (8.76) 

Note  that  the  concept  of  covariance  is  a  natural  extension  of  the  concept  of  variance,  which  is 
defined  as 


nx  (x  x)(x  x) 

Let  us  consider  a  case  of  two  variables  x  and  y  that  are  dependent  such  that  they  tend 
to  vary  in  harmony;  that  is,  if  x  increases  y  increases,  and  if  x  decreases  y  also  decreases. 
For  instance,  x  may  be  the  average  daily  temperature  of  a  city  and  y  the  volume  of  soft  drink 
sales  that  day  in  the  city.  It  is  reasonable  to  expect  the  two  quantities  to  vary  in  harmony  for  a 
majority  of  the  cases.  Suppose  we  consider  the  following  experiment:  pick  a  random  day  and 
record  the  average  temperature  of  that  day  as  the  value  of  x  and  the  soft  drink  sales  volume 
that  day  as  the  value  of  y.  We  perform  this  measurement  over  several  days  (several  trials  of 
the  experiment)  and  record  the  data  x  and  y  for  each  trial.  We  now  plot  points  (x,  y)  for  all 
the  trials.  This  plot,  known  as  the  scatter  diagram,  may  appear  as  shown  in  Fig.  8.18a.  The 
plot  shows  that  when  x  is  large,  y  is  likely  to  be  large.  Note  the  use  of  the  word  likely.  It  is  not 
always  true  that  y  will  be  large  if  x  is  large,  but  it  is  true  most  of  the  time.  In  other  words,  in 
a  few  cases,  a  low  average  temperature  will  be  paired  with  higher  soft  drink  sales  owing  to 
some  atypical  situation,  such  as  a  major  soccer  match.  This  is  quite  obvious  from  the  scatter 
diagram  in  Fig.  8. 1 8a. 

To  continue  this  example,  the  variable  x  —  x  represents  the  difference  between  actual 
and  average  values  of  x,  and  y  —  y  represents  the  difference  between  actual  and  average 
values  of  y.  It  is  more  instructive  to  plot  (y  -  y)  vs.  (x  -  x).  This  is  the  same  as  the  scatter 
diagram  in  Fig.  8. 1 8a  with  the  origin  shifted  to  (x,  y ),  as  in  Fig.  8. 1 8b,  which  shows  that  a  day 
with  an  above-average  temperature  is  likely  to  produce  above-average  soft  drink  sales,  and 
a  day  with  a  below-average  temperature  is  likely  to  produce  below-average  soft  drink  sales. 


f'9ure  8.18 
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That  is,  if  jc  -  x  is  positive,  y  -  y  is  likely  to  be  positive,  and  if  .v  -  x  is  negative,  y  -  y 
is  more  likely  to  be  negative.  Thus,  the  quantity  (x  -  x)(v  -  y)  will  be  positive  for  most 
trials.  We  compute  this  product  for  every  pair,  add  these  products,  and  then  divide  by  the 
number  of  trials.  The  result  is  the  mean  value  of  (x  —  x)(y  —  y),  that  is,  the  covariance  crXy  = 
(x  —  x)(y  -  y).  The  covariance  will  be  positive  in  the  example  under  consideration.  In  such 
cases,  we  say  that  a  positive  correlation  exists  between  variables  x  and  y.  We  can  conclude  that  a 
positive  correlation  implies  variation  of  two  variables  in  harmony  (in  the  same  direction,  up  or 
down). 

Next,  we  consider  the  case  of  the  two  variables:  x,  the  average  daily  temperature,  and  z, 
the  sales  volume  of  sweaters  that  day.  It  is  reasonable  to  believe  that  as  x  (daily  average 
temperature)  increases,  z  (the  sweater  sales  volume)  tends  to  decrease.  A  hypothetical  scatter 
diagram  for  this  experiment  is  shown  in  Fig.  8.18c.  Thus,  if  x  —  x  is  positive  (above-average 
temperature),  z  —  z  is  likely  to  be  negative  (below-average  sweater  sales).  Similarly,  when 
x  —  x  is  negative,  z  —  z  is  likely  to  be  positive.  The  product  (x  —  x)(z  —  z)  will  be  negative 
for  most  of  the  trials,  and  the  mean  (x  —  x)(z  —  z)  =  crxz  will  be  negative.  In  such  a  case,  we 
say  that  negative  correlation  exists  between  x  and  y.  It  should  be  stressed  here  that  negative 
correlation  does  not  mean  that  x  and  y  are  unrelated.  It  means  that  they  are  dependent,  but 
when  one  increases,  the  other  decreases,  and  vice  versa. 

Last,  consider  the  variables  x  (the  average  daily  temperature)  and  w  (the  number  of  births). 
It  is  reasonable  to  expect  that  the  daily  temperature  has  little  to  do  with  the  number  of  children 
born.  A  hypothetical  scatter  diagram  for  this  case  will  appear  as  shown  in  Fig.  8. 1 8d.  If  x  —  x 
is  positive,  w  —  w  is  equally  likely  to  be  positive  or  negative.  The  product  (jc  —  x)(w  -  w)  is 
therefore  equally  likely  to  be  positive  or  negative,  and  the  mean  (x  -  x)(w  —  w)  =  tfXw 
be  zero.  In  such  a  case,  we  say  that  RVs  x  and  w  are  uncorrelated. 

To  reiterate,  if  axy  is  positive  (or  negative),  then  x  and  y  are  said  to  have  a  positive  (or 
negative)  correlation,  and  if  axy  =  0,  then  the  variables  x  and  y  are  said  to  be  uncorrelated. 

From  this  discussion,  it  appears  that  under  suitable  conditions,  covariance  can  serve  as 
a  measure  ot  the  dependence  of  two  variables.  It  often  provides  some  information  about  the 
interdependence  of  the  two  RVs  and  proves  useful  in  a  number  of  applications. 

The  covariance  <rxy  may  be  expressed  in  another  way,  as  follows.  By  definition. 


axy  =  (x  —  x)(y  —  y) 

=  xy  -  xy  -  xy  +  xy 
=  xy  —  xy  —  xy  +  xy 

=  xy  —  xv  (8*77) 


From  Eq.  (8.77)  it  follows  that  the  variables  x  and  y  are  uncorrelated  (crxy  =  0)  it 


The  correlation  between  x  and  y  cannot  be  directly  compared  with  the  correlation  between  i 
and  w.  This  is  because  different  RVs  may  differ  in  strength.  To  be  fair,  the  covariance  value 
should  be  normalized  appropriately.  For  this  reason,  the  definition  of  correlation  coefficient 
is  particularly  useful.  Correlation  coefficient  pxy  is  crxy  normalized  by  <Jxay, 


Pxy  = 


tfxy 


GXGy 


(8.79) 
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Thus,  if  x  and  y  are  uncorrelated,  then  pxy  =  0.  Also,  it  can  be  shown  that  (Prob.  8.5-5) 

that 


-l<Pxy<l  (8.80) 

Independence  vs.  Uncorrelatedness 

Note  that  for  independent  RVs  [Eq.  (8.64c)] 


xy  =  xy  and  <rxy  =  0 

Hence,  independent  RVs  are  uncorrelated.  This  supports  the  heuristic  argument  presented 
earlier.  It  should  be  noted  that  whereas  independent  variables  are  uncorrelated,  the  converse 
is  not  necessarily  true — uncorrelated  variables  are  generally  not  independent  (Prob.  8.5-7). 
Independence  is,  in  general,  a  stronger  and  more  restrictive  condition  than  uncorrelatedness. 
For  independent  variables,  we  have  shown  [Eq.  (8.64b)]  that,  when  the  expectations  exist. 


#i(x)#2(y)  =  si(x)g2(y) 

for  any  functions  g|  (•)  and  g2(  ).  whereas  for  uncorrelatedness,  the  only  requirement  is  that 

xy  =  xy 

There  is  only  one  special  case  for  which  independence  and  uncorrelatedness  are  equivalent — 
when  random  variables  x  and  y  are  jointly  Gaussian.  Note  that  when  x  and  y  are  jointly 
Gaussian,  individually  x  and  y  are  also  Gaussian. 

Mean  Square  of  the  Sum  of  Uncorrelated  Variables 

If  x  and  y  are  uncorrelated,  then  for  z  =  x  +  y  we  show  that 

orz=°x+°ry  (8.81) 

That  is,  the  variance  of  the  sum  is  the  sum  of  variances  for  uncorrelated  RVs.  We  have  proved 
this  result  earlier  for  independent  variables  x  and  y.  Following  the  development  after  Eq.  (8.7 1 ), 
we  have 


a\  =  [(x  -  x)  -I-  (y  -  y)l2 

=  (x  -  x)2  +  (y  -  y)2  +  2(x  -  x)(y  -  y) 

=  <rx  +  <?y  +  2<rxy 

Because  x  and  y  are  uncorrelated.  <rxy  =  0,  and  Eq.  (8.81)  follows.  If  x  and  y  have  zero  means, 
then  z  also  has  a  zero  mean,  and  the  mean  square  values  of  these  variables  are  equal  to  their 
variances.  Hence, 

(x  T  y)2  =  x2  -l-y2  (8.82) 

if  x  and  y  are  uncorrelated  and  have  zero  means.  Thus,  Eqs.  (8.81)  and  (8.82)  are  valid  not 
only  when  x  and  y  are  independent,  but  also  under  the  less  restrictive  condition  that  x  and  y 
be  uncorrelated. 
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8.5  LINEAR  MEAN  SQUARE  ESTIMATION 


When  two  random  variables  x  and  y  are  related  (or  dependent),  then  a  knowledge  of  one  gives 
certain  information  about  the  other.  Hence,  it  is  possible  to  estimate  the  value  of  (parameter 
or  signal)  y  from  a  knowledge  of  the  value  of  x.  The  estimate  of  y  will  be  another  random 
variable  y.  The  estimated  random  variable  y  will  in  general  be  different  from  the  actual  y. 
One  may  choose  various  criteria  of  goodness  for  estimation.  Minimum  mean  square  error  is 
one  possible  criterion.  The  optimum  estimate  in  this  case  minimizes  the  mean  square  error  e2 
given  by 


2  =  (y  -  y)2 


€ 


In  general,  the  optimum  estimate  y  is  a  nonlinear  function  of  x.*  We  simplify  the  problem  by 
constraining  the  estimate  y  to  be  a  linear  function  of  x  of  the  form 


y  =  ax 


assuming  that  x  =  0.+  In  this  case, 


62  =  (y  -  y)2  =  (y  -  a\)2 

=  y2  +  a2x2  -  2axy 


To  minimize  €2,  we  have 


—  =  2  ax2  —  2xy  =  0 


Hence, 


*y  =  Kxy 

X2  ^xx 


(8.83) 


where  Rxy  xy,  RA ^  x2,  and  Ryy  —  y2.  Note  that  for  this  constant  choice  of  a , 


Hence, 


*  11  can  shown  that5  the  optimum  estimate  y  is  the  conditional 


mean  of  y  when  x  =  jc,  that  is. 


y  =  £[y  |  x  =  jc] 


In  general,  this  is  a  nonlinear  function  of  x. 

+  Throughout  the  discussion,  the  variables  x,  y, . . .  will  be  assuir 
of  generality.  If  the  variables  have  nonzero  means,  we  can  form 
and  so  on.  The  new  variables  obviously  have  zero  mean  values. 


will  be  assumed  to  have  zero  mean.  This  can  be  done  without  loss 
we  can  form  new  variables  jc'  =  x  -  x  and  /  =  y  -  y. 
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Since  by  definition  xy  =  Rxy  and  xx  =  x2  =  flxx,  we  have 

x?  =  /?xy  -  Rxy  =  0  (8.84) 

The  condition  of  Eq.  (8.84)  is  known  as  the  principle  of  orthogonality.  The  physical  inter¬ 
pretation  is  that  the  data  (x)  used  in  estimation  and  the  (minimum)  error  (6)  are  orthogonal 
(implying  uncorrelatedness  in  this  case)  when  the  mean  square  error  is  minimum. 

Given  the  principle  of  orthogonality,  the  minimum  mean  square  error  is  given  by 

62  =  (y  —  ax)2 

=  (y  —  ax)y  -  a  •  ?x 
=  (y  -  ax) y 
=  y2  —  a  •  yx 

=  ^yy  (8.85) 

Using  n  Random  Variables  to  Estimate  a  Random  Variable 

If  a  random  variable  xo  is  related  to  n  RVs  xi ,  X2,  . . . ,  x„,  then  we  can  estimate  xo  using  a 
linear  combination*  of  \\ ,  X2,  . . . ,  x„: 


n 

xo  =  a\X\  +  aixi  H - b  anxn  =  ^ tf,x,  (8.86) 

i=l 


The  mean  square  error  is  given  by 


€2  =  [x0  -  (tfixi  -f  a2x2  +  •  •  •  +  anxn)]2 


To  minimize  e2,  we  must  set 

d€2  de2  d€2  ^ 

da  i  da2  dan 

that  is, 

^2  ^  _ _ _ 

—  =  —  [x0-(aixi  +  a2\2  +  ■  ■  ■  +  anxn)]2  =  0 
dat  dat 

Interchanging  the  order  of  differentiation  and  averaging,  we  have 

(8.87a) 

(8.87b) 


S€2  - _ 

—  = -2[x0- (flixi  +02*2  +  -  -  +  anxn)]xi  =0 
dai 


Equation  (8.87a)  can  be  written  as 


e  .  x/  =  0  /  =  1 ,  2,  . . . ,  n 


*  Throughout  this  section  as  before,  we  assume  that  all  the  random  variables  have  zero  mean  values.  This  can  be 
done  without  loss  of  generality. 
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It  can  be  rewritten  into  Yule- Walker  equations 


Roi  —  a  1  *it  +  «2*/2  + - 1-  o„Ri„ 


(8.88) 


where 


Rij  =  X/Xj 

Differentiating  e2  with  respect  to  a\,  a2,  •  ■ . .  a„  and  equating  to  zero,  we  obtain  n  simultaneous 
equations  of  the  form  shown  in  Eq.  (8.88).  The  desired  constants  a\ ,  a2,  . . . ,  an  can  be  found 
from  these  equations  by  matrix  inversion 


a\ 

'  *ii 

*12  • 

•*  R\n 

-1 

*01 

a2 

= 

*21 

*22  ' 

’  *  Rln 

*02 

an  _ 

*nl 

*„2  • 

'  *  Rnn 

*0/i 

(8.89) 


Equation  (8.87)  shows  that  e  (the  error)  is  orthogonal  to  data  (xj,  X2,  . . . ,  x„)  for  optimum 
estimation.  This  gives  the  more  general  form  for  the  principle  of  orthogonality  in  mean  square 
estimation.  Consequently,  the  mean  square  error  (under  optimum  conditions)  is 


e2  —  €€  —  e[xo  —  (fl|Xi  +a2X2  -I - banX„)] 

Because  ejq  =  0  (t  =  1,2 . ri). 


e2  =  exo 

=  xofxo  -  (aixi  +  a2X2  + - F  a„x„)] 

=  Roo  —  («i/?oi  -I-  02R02  +  ■  •  •  +  anR()n)  (8.90) 


Example  8.24  In  dillerential  pulse  code  modulation  (DPCM),  instead  of  transmitting  sample  values  directly. 

we  estimate  (predict)  the  value  of  each  sample  from  the  knowledge  of  previous  n  samples. 
The  estimation  error  e/. .  the  difference  between  the  actual  value  and  the  estimated  value  of  the 
A  th  sample,  is  quantized  and  transmitted  (Fig.  8.19).  Because  the  estimation  error  e*  is  smaller 
than  the  sample  value  m*,  for  the  same  number  of  quantization  levels  (the  same  number  ot 
PCM  code  bits),  the  SNR  is  increased.  It  was  shown  in  Sec.  6.5  that  the  SNR  improvement 
is  equal  to  m2/e2,  where  m2  and  e2  are  the  mean  square  values  of  the  speech  signal  and  the 
estimation  error  e,  respectively.  In  this  example,  we  shall  find  the  optimum  linear  second-order 
predictor  and  the  corresponding  SNR  improvement. 

The  equation  of  a  second-order  estimator  (predictor),  shown  in  Fig.  8.19,  is 


m*  =  aim*_!  +  a2m*_2 

where  m*  is  the  best  linear  estimate  of  m*.  The  estimation  error  e*  is  given  by 


Figure  8.1 9 

Second-order 
predictor  in 
Example  8.24. 
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(k  =  m*  -  mk  =  aim*,]  +  a2mk_2  -  m* 
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Example  8.24. 


For  speech  signals,  Jayant  and  Noll5  give  the  values  of  correlations  of  various 
samples  as: 

m*m*  =  m2,  m*m*_i  =  0.825m2,  m;m*_2  =  0.562m2, 

m*m*_3  =  0.308m2,  m^m;_4  =  0.004m2,  =  —0.243m2 

Note  that  Rjj  =  m*m Hence, 


R\ i  =  Rn  —  m2 

R\2  —  Ri\  —  Ro\  —  0.825m2 

Rq2  =  0.562m2 


The  optimum  values  of  a\  and  02  are  found  from  Eq.  (8.89)  as  a\  =  1.1314  and 
CL2  =  -0.3714,  and  the  mean  square  error  in  the  estimation  is  given  by  Eq.  (8.90)  as 


e2  =  [1  -  (0.825tf|  +  0.562</2)]m2  =  0.2753m2 


(8.91) 


The  SNR  improvement  is  10  log10  m2/0.2752m2  =  5.6  dB. 


8.6  SUM  OF  RANDOM  VARIABLES 


In  many  applications,  it  is  useful  to  characterize  the  RV  z  that  is  the  sum  of  two  RVs  x  and  y: 


z  =  x  +  y 


Because  z  =  x  +  y.  y  =  z  —  x  regardless  of  the  value  of  x.  Hence,  the  event  z  <  z  is  the  joint 
event  [y  <  z  -  x  and  x  to  have  any  value  in  the  range  (-00,  00)].  Hence. 


FZ(Z)  =  P( z  <  z)  =  P(x  <  00,  y  <  z  -  x) 
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and 


oo 


pXy(x,  Z  —  X)d.X 


-00 


If  x  and  y  are  independent  RVs,  then 


Pxy  (*,  Z  ~  x)  =  /?x(x)py(z  -  *) 


and 


roo 

Pz(z)=  PxWPyiZ  ~  X)d.X 


(8.92) 


J  -  OO 


The  PDF  pz(z)  is  then  the  convolution  of  PDFs  px(z)  and  Py(z)-  We  can  extend  this  result  to 
a  sum  of  n  independent  RVs  X|,  X2,  . . . ,  x„.  If 


z  —  X|  +  X2  +  •  •  •  +  X„ 


then  the  PDF  pz(z)  will  be  the  convolution  of  PDFs  pX|  (jc),  pX2  (x),  p*„  (*),  that  is. 


Pz(x)  —  pX|  (x)  *pX2(x)  *  ...  *  pXn  (x) 


(8.93) 


Sum  of  Gaussian  Random  Variables 

Gaussian  random  variables  have  several  very  important  properties.  For  example,  a  Gaussian 
random  variable  x  and  its  probability  density  function  /?x(jc)  are  fully  described  by  the  mean 
Px  and  the  variance  <rx.  Furthermore,  the  sum  of  any  number  of  jointly  distributed  Gaussian 
random  variables  is  also  a  Gaussian  random  variable,  regardless  of  their  relationships  (such  as 
dependency).  Again,  note  that  when  the  members  of  a  set  of  random  variables  {x,}  are  jointly 
Gaussian,  each  individual  random  variable  x,  also  has  Gaussian  distribution. 

As  an  example,  we  will  show  that  the  sum  of  two  independent,  zero  mean,  Gaussian 
random  variables  is  Gaussian.  Let  X|  and  X2  be  two  zero  mean  and  independent  Gaussian 
random  variables  with  probability  density  functions 


Let 


y  =  xi  +x2 


The  probability  density  function  of  y  is  therefore 
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Upon  carrying  out  this  convolution  (integration),  we  have 
1 


Py(y)  = 


2no\<J2  J- 

1 


/>(- 


jc 2  (y  —  x )2 

2cr^  2cr2 


dx 


f 

e  2(<rf+aj) 


yflMa]  +  o\) 

By  a  simple  change  of  variable 


V2tt~P^= 

V<Ti+ori 


f 

J-c 


exp  — 


t2/t2 


°7+*2 


n2^ 


jc  — 


2  2^ 
^1+^2 


dx 
(8.94) 


w  = 


[■*  ~ 


y^+of 


we  can  rewrite  the  integral  of  Eq.  (8.94)  as 
1 


PyOO  = 


JlMo]  +  of) 


^  2(af+aj> 


1  f0 

y/2jt  J—  c 


yflirUrf+oi) 


_ X2 

e  2(«,+4»  (8.95) 


By  examining  Eq.  (8.95),  it  can  be  seen  that  y  is  a  Gaussian  RV  with  zero  mean  and  variance: 


al  =  erf  +  a\ 


In  fact,  because  X|  and  X2  are  independent,  they  must  be  uncorrelated.  This  relationship  can 
be  obtained  from  Eq.  (8.81). 

More  generally,5  if  xj  and  X2  are  jointly  Gaussian  but  not  necessarily  independent,  then 
y  =  xi  +  X2  is  Gaussian  RV  with  mean 


y  =  xi  +  x2 

and  variance 

°y  =  axi  +°r*2  +2aXiX2 

Based  on  induction,  the  sum  of  any  number  of  jointly  Gaussian  distributed  RV’s  is  still 
Gaussian.  More  importantly,  for  any  fixed  constants  {fl/,  i  —  1,  . . . ,  aw}  and  jointly  Gaussian 
RVs  {x/,  i  =  1,  ...»  aw}, 

m 

X>x' 

1=1 

remains  Gaussian.  This  result  has  important  practical  implications.  For  example,  if  x*  is  a 
sequence  of  jointly  Gaussian  signal  samples  passing  through  a  discrete  time  filter  with  impulse 
response  |/i,  },  then  the  filter  output 

OO 

y  =  ^2  hi*k-i 
1=0 


(8.96) 
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will  continue  to  be  Gaussian.  The  fact  that  linear  filter  output  to  a  Gaussian  signal  input  will  be 
a  Gaussian  signal  is  highly  significant  and  is  one  of  the  most  useful  results  in  communication 
analysis. 


8.7  CENTRAL  LIMIT  THEOREM 


Under  certain  conditions,  the  sum  of  a  large  number  of  independent  RVs  tends  to  be  a  Gaussian 
random  variable,  independent  of  the  probability  densities  of  the  variables  added.*  The  rigorous 
statement  of  this  tendency  is  what  is  known  as  the  central  limit  theorem.  Proof  of  this  theorem 
can  be  found  in  the  Refs.  6  and  7.  We  shall  give  here  only  a  simple  plausibility  argument. 

The  tendency  toward  a  Gaussian  distribution  when  a  large  number  of  functions  are  con¬ 
volved  in  shown  in  Fig.  8.20.  For  simplicity,  we  assume  all  PDFs  to  be  identical,  that  is,  a 
gate  function  0.5  W(x/2).  Figure  8.20  shows  the  successive  convolutions  of  gate  functions. 
The  tendency  toward  a  bell-shaped  density  is  evident. 

This  important  result  that  the  distribution  of  the  sum  of  n  independent  Bernoulli  random 
variables,  when  properly  normalized,  converges  toward  Gaussian  distribution  was  established 
first  by  A.  de  Moivre  in  the  early  1700s.  The  more  general  proof  for  an  arbitrary  distribution 
was  credited  to  J.  W.  Lindenber  and  P.  Levy  in  the  1920s.  Note  that  the  “normalized  sum”  is 
the  sample  average  (or  sample  mean)  of  n  random  variables. 

Central  Limit  Theorem  (for  the  sample  mean): 

Let  X),  . . . ,  x„  be  independent  random  samples  from  a  given  distribution  with  mean  p  and 
variance  a2  with  0  <  a2  <  oo.  Then  for  any  value  x,  we  have 


(8.97) 


or  equivalently ; 


lim  P 


(8.98) 


Note  that 


_  xi  H - Fx„ 


n 


Figure  8.20 

Demonstration  of 
the  central  limit 
theorem. 
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is  known  as  the  sample  mean.  The  interpretation  is  that  the  sample  mean  of  any  distribu¬ 
tion  with  nonzero  finite  variance  converges  to  Gaussian  distribution  with  fixed  mean  p  and 
decreasing  variance  o2/n.  In  other  words ,  regardless  of  the  true  distribution  of  x,\  J2'i=\  */ 
can  be  approximated  by  a  Gaussian  distribution  with  mean  np  and  variance  no2. 


Example  8.25  Consider  a  communication  system  that  transmits  a  data  packet  of  1024  bits.  Each  bit  can  be 
in  error  with  probability  of  10“2.  Find  the  (approximate)  probability  that  more  than  30  of  the 
1024  bits  are  in  error. 

Define  a  random  variable  X/  such  that  x/  =  1  if  the  rth  bit  is  in  error  and  x,  =  0  if  not. 
Hence 


is  the  number  of  errors  in  the  data  packet.  We  would  like  to  find  P(\  >  30). 

Since  P(Xj  =  1)  =  10-2  and  P(x,  =  0)  =  1  -  10“2,  strictly  speaking  we  would 
need  to  find 

!^/1024\/  1024— m 

P(v  >  30)  =  £  (  m  )(io-2)  (l  - 10—) 

m= 3 1 


This  calculation  is  time-consuming.  We  now  apply  the  central  limit  theorem  to  solve  this 
problem  approximately. 

First,  we  find 

x7=  1(T2  x  (1)  +  (1  -  1(T2)  x  (0)  =  10"2 
x?  =  10-2  x  (l)2  +  (1  —  10-2)  x  (0)  =  10-2 

As  a  result, 

af=7j-  (xj)2  =  0.0099 

Based  on  the  central  limit  theorem,  v  =  £,  x,  is  approximately  Gaussian  with  mean  of 
1024  •  I0-2  =  10.24  and  variance  1024  x  0.0099  =  10.1376.  Since 

v-  10.24 
y  “  s/10.1376 


is  a  standard  Gaussian  with  zero  mean  and  unit  variance. 

30-  10.24' 


P(\  >  30) 


=4 


v/10.1376 
-  p( y  >  6.2061 1) 

=  (9(6.20611) 
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Now  is  a  good  time  to  further  relax  the  conditions  in  the  central  limit  theorem  for  the  sample 


problem: 


mean.  This  highly  important  generalization  is  proved  by  the  famous  Russian  mathematician 
A.  Lyapunov  in  1901. 

Central  Limit  Theorem  (for  the  sum  of  independent  random  variables): 

Let  random  variables  xj ,  . . . ,  x„  be  independent  but  not  necessarily  identically  distributed. 
Each  of  the  random  variable  x,  has  mean  m,  and  nonzero  variance  of  <  oo.  Furthermore, 
suppose  that  each  third-order  central  moment 


|x(  -  M/I3  <oo,  i  =  1,  ...,n 


and  suppose 


Then  random  variable 


converges  to  a  standard  Gaussian  density  as  n  -+  oo,  that  is, 


\}rnoP[y(n)>x]  =  Q(x) 


(8.99) 


The  central  limit  theorem  provides  a  plausible  explanation  for  the  well-known  fact  that 
many  random  variables  in  practical  experiments  are  approximately  Gaussian.  For  example, 
communication  channel  noise  is  the  sum  effect  of  many  different  random  disturbance  sources 
(e  g  sparks,  lightning,  static  electricity).  Based  on  the  central  limit  theorem,  noise  as  the  sum 
of  all  these  random  disturbances  should  be  approximately  Gaussian 
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8.1- 1  A  communication  network  has  26  nodes.  Each  node  may  fail  with  probability  of  p  =  0.05 

independently.  Find  the  probability  that  (a)  2  out  of  26  nodes  fail;  (b)  no  more  than  5  nodes  fail; 
(c)  the  number  of  failed  nodes  is  between  20  and  23. 

8.1- 2  A  bank  customer  has  selected  his  four-digit  personal  identification  number  (PIN).  Find  the 

probabilities  of  the  following  events:  the  sum  of  the  digits  is:  (a)  3;  (b)  7;  (c)  8. 

8.1- 3  A  family  selects  two  phones  from  a  box  containing  cellphones:  four  are  Nokia,  marked  N\ , 

A3,  and  A4,  and  the  other  three  are  Samsung,  marked  Sj,  Sis  and  53.  Two  phones  are  picked 
randomly  in  succession  without  replacement. 

(a)  How  many  outcomes  are  possible?  That  is,  how  many  points  are  in  the  sample  space?  List 
all  the  outcomes  and  assign  probabilities  to  each  of  them. 

(b)  Express  the  following  events  as  unions  of  the  outcomes  in  part  (a)  (i)  one  is  a  Nokia  and 
the  other  is  a  Samsung;  (ii)  both  are  Nokia  phones;  (iii)  both  are  Samsung  phones;  and  (iv) 
both  are  of  the  same  kind.  Assign  probabilities  to  each  of  these  events. 

8.1- 4  Use  Eq.  (8.12)  to  find  the  probabilities  in  Prob.  8.1-3,  part  (b). 

8.1- 5  In  Prob.  8.1-3,  determine  the  probability  that  each  of  the  following  is  true. 

(a)  The  second  pick  is  a  Nokia,  given  that  the  first  pick  is  a  Samsung. 

(b)  The  second  pick  is  a  Nokia,  given  that  the  first  pick  is  also  a  Nokia. 

8.1- 6  In  the  transmission  of  a  string  of  12  bits,  each  bit  is  likely  to  be  in  error  with  probability  of 

p  =  0.4  independently.  Find  probabilities  of  the  following  events,  (a)  there  are  exactly  3  errors 
and  9  correct  bits:  (b)  there  are  at  least  4  errors. 

8.1- 7  In  the  California  lottery  (Lotto),  a  player  chooses  any  6  numbers  out  of  49  numbers  (1 

through  49).  Six  balls  are  drawn  randomly  (without  replacement)  from  the  49  balls  numbered 
1  through  49. 

(a)  Find  the  probability  of  matching  all  6  balls  to  the  6  numbers  chosen  by  the  player. 

(b)  Find  the  probability  of  matching  exactly  5  balls. 

(c)  Find  the  probability  of  matching  exactly  4  balls. 

(d)  Find  the  probability  of  matching  exactly  3  balls. 

8.1- 8  A  network  consists  of  ten  links  s\>  $2*  ...»  $io  *n  cascade  (Fig.  P8. 1-8).  If  any  one  of  the  links 

fails,  the  entire  system  fails.  All  links  are  independent  with  equal  probability  of  failure  p. 

(a)  The  probability  that  a  link  does  not  fail  equals  0.98.  What  is  the  probability  of  failure  of  the 
network? 

Hint:  Consider  the  probability  that  none  of  the  links  fails. 

(b)  The  reliability  of  a  network  is  the  probability  of  not  failing.  If  the  system  reliability  is 
required  to  be  0.99.  what  must  be  the  failure  probability  of  each  link? 

(c)  Repeat  part  (a)  if  link  si  has  a  probability  of  failure  of  0.03  while  other  links  can  fail  with 
equal  probability  of  0.01 . 
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Figure  P.8.1-8 
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8.1-9  Network  reliability  is  improved  if  redundant  links  are  used.  The  reliability  of  the  network  in 
Prob.  8. 1  -8  (Fig.  P8. 1  -8)  can  be  improved  by  building  two  subnetworks  in  parallel  (Fig.  P8. 1  -9). 
Thus,  if  one  subnetwork  fails,  the  other  one  will  still  connect. 

(a)  Use  the  data  in  Prob.  8.1-8  to  determine  the  reliability  of  the  network  in  Fig.  P8.1-9. 

(b)  If  the  reliability  of  this  new  network  is  required  to  be  0.999,  what  must  be  the  failure 
probability  of  each  link? 


Figure  P.8.1-9 


Figure 

P.8.1-10 


8.1-10  Compare  the  reliability  ot  the  two  networks  in  Fig.  P8.1-10,  given  that  the  failure  probability  of 
links  s\  and  S2  is  p  each. 


(a)  (b) 


8.1- 11  Repeat  Prob.  8.1-9  if  we  connect  the  outputs  of  $f-  and  s/,  for  /  =  1,2,  ...»  9. 

8.1- 12  Repeat  Prob.  8.1-9  if  we  only  connect  the  outputs  of  s\  and  s\. 

8.1- 13  In  Sec.  8.1,  Example  8.5,  determine  the  following. 

(a)  P(B ),  the  probability  of  drawing  an  ace  in  the  second  draw. 

(b)  P(A  |#),  the  probability  that  the  first  draw  was  a  red  ace  given  that  the  second  draw  is  an  ace. 

Hinr.  Event  B  can  occur  in  two  ways:  the  first  draw  is  a  red  ace  and  the  second  draw  is  an 

ace,  or  the  first  draw  is  not  a  red  ace  and  the  second  draw  is  an  ace.  This  isAC\BL)AcB  (see 
Fig.  8.2). 


8.1- 14  Abinary  source  generates  digits  1  and  0  randomly  with  probabilities  />(1)  =0.8  and  P(0)  =0.2. 

(a)  What  is  the  probability  that  exactly  two  Is  will  occur  in  a  n-digit  sequence? 

(b)  What  is  the  probability  that  at  least  three  Is  will  occur  in  a  n-digit  sequence? 

8'1'15  p  au,Try  C?mmUniCat'°n  Channel’ the  receiver  detec,s  binary  pulses  with  an  error  probability 
Pe-  What  is  the  probability  that  out  of  100  received  digits,  no  more  than  four  digits  are  in  error? 

8.1- 16  A  PCM  channel  consists  of  15  links,  each  with  a  regenerative  repeater  at  the  end.  If  the  detec- 

k  ieS°fthe  detectors  are  p,,  P2 . Pl5,  determine  the  detection  error 

probability  of  the  entire  channel  if  pj  <£  1. 
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8.1- 17  Example  8.8  considers  the  possibility  of  improving  reliability  by  repeating  a  digit  three  times. 

Repeat  this  analysis  for  four  repetitions. 

8.1- 18  In  a  box  there  are  12  microprocessors.  One  of  them  is  broken.  There  are  12  students  in  a  lab. 

Each  of  the  12  students  selects  a  microprocessor  by  picking  one  from  the  box.  Determine  which 
position  in  the  sequence  favors  drawing  the  broken  processor  the  most  and  which  position  favors 
the  least. 

8.1- 19  In  a  network  with  12  links,  one  of  the  links  has  failed.  The  failed  link  is  randomly  located.  An 

engineer  tests  the  links  one  by  one  until  the  failed  link  is  located. 

(a)  What  is  the  probability  that  he  will  find  the  failed  link  in  the  first  test? 

(b)  What  is  the  probability  that  he  will  find  the  failed  link  in  five  tests? 

8.2-1  For  a  certain  binary  nonsymmetric  channel  it  is  given  that 

Pyjx(0|l)  =  0.1  and  Pyjx(l|0)  =  0.2 


where  x  is  the  transmitted  digit  and  y  is  the  received  digit.  Assume  that  Px{ 0)  =  0.4. 


(a)  Determine  Py(0)  and  Py(l). 

(b)  What  is  the  probability  that  no  Is  will  be  in  the  output  for  an  input  sequence  of  10  digits? 

(c)  What  is  the  probability  that  eight  Is  and  two  0s  will  be  in  the  output  for  an  input  sequence 
of  10  digits? 

(d)  What  is  the  probability  that  at  least  five  0s  will  be  in  the  output  for  an  input  sequence  of  10 
digits? 


8.2- 2  A  binary  symmetric  channel  (Example  8.13)  has  an  error  probability  Pe .  The  probability  of 

transmitting  1  is  Q.  If  the  receiver  detects  an  incoming  digit  as  I,  what  is  the  probability  that  the 
originally  transmitted  digit  was:  (a)  1:  (b)  0? 

Hint:  If  x  is  the  transmitted  digit  and  y  is  the  received  digit,  you  are  given  Py|x(0|l)  = 
P  1,(110)  =  Pe •  Now  use  Bayes'  rule  to  find  Px|y(l|l)  and  Px|y(0|l). 

8.2- 3  We  consider  a  binary  transmission  system  that  is  formed  by  the  cascade  of  n  stages  of  the  same 

binary  symmetric  channel  described  in  Prob.  8.2-2.  Notice  that  a  transmitted  bit  is  in  error  if 
and  only  if  the  number  of  erroneous  stages  is  odd. 


(a)  Show  that  the  error  probability  Pe  (n)  and  the  correct  probability  Pc(n)  after  m  stages  (m  <  n) 
can  be  written  as  a  recursion 


r  PA*)  i  r  i  -  Pe 

\_Pe(n)\  L  p‘ 


Pe  1  r  Pc{n-  1)1 
1  -Pe  J  [Pe(n-  1)J 


where 


1  ~Pe  Pe 

Pe  '~Pe  J 


i[ 


(1 


0 

-2  Pe) 


(b)  Show  that 
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Figure  P.8.2-6 


(c)  Given  part  (b),  show  that 


1"  Pc(n)  1  1  [ 

'  1  +  (1  -  2Pe)n  1 

[M«)J  2  [ 

1-(1  -2Pe)n  J 

(d)  For  Pe  <&  1,  discuss  the  approximation  of  the  probability  of  error  Pe(n). 


8.2-4  The  PDF  of  amplitude  x  of  a  certain  signal  x (/)  is  given  by  px(x)  =  C\x\e  Both  C  and  K 
are  positive  constants. 


(a)  Find  C  as  a  function  of  K. 

(b)  Find  the  probability  that  x  >  -1. 

(c)  Find  the  probability  that  -1  <  x  <  2. 

(d)  Find  the  probability  that  x  <  2. 

8.2-5  The  PDF  of  a  Gaussian  variable  x  is  given  by 


PxW  = 


1  r-U-4)2/18 

CV 25F 


Determine  (a)  C;  (b)  P(x  >  2);  (c)  P(x  <  -1);  (d)  P(x  >  -2). 

8.2-6  The  PDF  of  an  amplitude  x  of  a  Gaussian  signal  x(/)  is  given  by 


Px(x)  = 

Gy/ln 

This  signal  is  applied  to  the  input  of  a  half-wave  rectifier  circuit  (Fig.  P8.2-6). 

(a)  Assuming  an  ideal  diode,  determine  Fy(y)  and  py(y)  of  the  output  signal  amplitude  y  = 
x  •  m(x).  Notice  that  the  probability  of  x=  0  is  not  zero. 

(b)  Assuming  a  nonideal  diode,  determine  Fy(y)  and  /?v(v)  of  the  output  signal  amplitude 
y  =  (x  —  o.  l)  •  w(x). 


-o 

yW 

-o 


8.2-7  For  an  RV  x  with  PDF 

Px(Jr)  =  C^e ~x2/32“M 

(a)  Find  C.  sketch  Px(x),  and  state  (with  reasons)  if  this  is  a  Gaussian  RV. 

(b)  Determine:  (i)  P(x  >  1).  (ij)  />(_  j  <  x  <  2). 

(c)  How  would  one  generate  RV  x  from  another  Gaussian  RV?  Show  block  diagram  and  explain- 
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8.2-8  In  the  example  on  threshold  detection  (Example  8.16).  it  was  assumed  that  the  digits  1  and  0 
were  transmitted  with  equal  probability.  If  Px(l)  and  Px( 0).  the  probabilities  of  transmitting  1 
and  0,  respectively,  are  not  equal,  show  that  the  optimum  threshold  is  not  0  but  is  a,  where 


a  = 


2A,,  /\(l) 


Hint:  Assume  that  the  optimum  threshold  is  a,  and  write  Pe  in  terms  of  the  Q  functions.  For  the 
optimum  case,  dPe/da  =  0.  Use  the  fact  that 

Q(x)  =  1 - JL  f  <r>2/2  dy 

y/2n  J-oo 

and 

dQM  = _ Le-^/2 

dx  y/ 2n 


8.2-9  The  joint  PDF  pxy  (jc,  y )  of  two  continuous  RVs  is  given  by 

Px y(*,  y)  =Axye~(x2)e~y2/2u(x)u{y) 

(a)  Find  A. 

(b)  Find px(x),  py(y ),  px\y(x\y ),  and/7y|x(y|x). 

(c)  Are  x  and  y  independent? 

8.2-10  The  joint  PDF  of  RVs  x  and  y  is  shown  in  Fig.  P8.2-10. 

(a)  Determine:  (i)  A;  (ii ) pxM;  (iii)py(y):  (iv)  Px\y(x\y);  (v)  PyixCvIx). 

(b)  Are  x  and  y  independent?  Explain. 


figure 

P.8.2-10 


8.2- 11  RVs  x  and  y  are  said  to  be  jointly  Gaussian  if  their  joint  PDF  is  given  by 

.  1  -(ax2+by2-'2£Xy)/2M 

p'>(X'y)'2nVM 

where  M  =ab-c1.  Show  thatpx  W.  Py(ylPx\y(Myh  and  pylx(y\x)  are  all  Gaussian  and  that 
x2  =b,  y2  =  a,  and  xy  =  c. 
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Figure  P.8.3-6 


Hint:  Use 


r 


e-pr+qxdx  = 


8.2- 12  Use  Prob.  8.2-1 1  to  show  that  if  two  jointly  Gaussian  RVs  are  uncorrelated,  they  are  independent. 

8.2- 13  The  joint  PDF  of  RVs  x  and  y  is  given  by 


pXy(x,  y )  = 


Determine  (a)  the  constant  k;  (b)  px(x)\  (c)  py(y);  (d)  px|yU».v);  (e)  Py\x(y\x).  Are  x  andy 
independent? 

8.3- 1  If  an  amplitude  x  of  a  Gaussian  signal  x(/)  has  a  mean  value  of  2  and  an  RMS  value  of 

determine  its  PDF. 

8.3- 2  Determine  the  mean,  the  mean  square,  and  the  variance  of  the  RV  x  with  PDF 

px(x)  =  C  ■  £?-2|jr~21 


8.3-3  Determine  the  mean  and  the  mean  square  value  of  x  whose  PDF  is 


8.3-4 


Let  X/  —  il,  i  —  1 . 8  be  binary  independent  random  variables  with  equal  probability.  A 

new  random  variable  is  constructed  from 


8 

y  =  X> 

i=i 

Determine  the  mean  and  the  mean  square  value  of  y. 

8.3- 5  Find  the  mean,  the  mean  square,  and  the  variance  of  the  RV  x  in  Prob.  8.2-9. 

8.3- 6  Find  the  mean,  the  mean  square,  and  the  variance  of  the  RV  x  in  Fig.  P8.3-6. 


P,(x) 


8.3-7  Find  the  mean,  the  mean  square,  and  the  variance  of  the  RV  y  in  Prob.  8.2-10. 
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8.3- 8  For  a  Gaussian  PDF p\(x)  =  (l/crx\/27r)?  xl^a* ,  show  that 

(1)(3)(5)  •  •  •  (n  -  Ucr'1  weven 
0  n  odd 

Hint :  See  appropriate  definite  integrals  in  any  standard  mathematical  table. 

8.3- 9  Find  the  mean  and  the  variance  of  a  Rayleigh  RV. 

8.3-10  A  random  signal  x  is  measured  to  have  mean  of  2  and  variance  of  3.  Without  knowing  its  PDF. 
estimate  the  probability  that  x  is  between  0  and  4. 

8.4- 1  Determine  the  correlation  between  random  variables  x  and  y  in  Prob.  8.2-9. 

8.4- 2  Determine  the  correlation  between  random  variables  x  and  y  in  Prob.  8.2-10. 

8.4- 3  Determine  the  correlation  between  random  variables  x  and  y  in  Prob.  8.2-1 1 . 

8.4- 4  Determine  the  correlation  between  random  variables  x  and  y  in  Prob.  8.2- 1 3. 

8.5- 1  Find  the  linear  mean  square  estimation  of  y  from  x  in  Prob.  8.2-10. 

8.5- 2  Find  the  linear  mean  square  estimation  of  y  from  x  in  Prob.  8.2-11 . 

8.5- 3  Find  the  linear  mean  square  estimation  of  y  from  x  in  Prob.  8.2-13. 

8.5- 4  Let  x/  =  ±1,  i  =  1 . 8  be  binary  independent  random  variables  with  equal  probability.  A 

new  random  variable  is  constructed  from 

8 

y  =  I> 

i=l 

(a)  Use  x/  =  ±1,  /  =  1, _ 4  to  determine  the  minimum  mean  square  error  linear  estimator 

of  y. 

(b)  Find  the  minimum  mean  square  error  in  part  (a). 

8.5- 5  Show  that  |pXy  I  <  1,  where  pxy  is  the  correlation  coefficient  [Eq.  (8.79)]  of  RVs  x  and  y. 

Hint :  For  any  real  number  a, 

[a(x  -  x)  —  (y  —  y)l2 

The  discriminant  of  this  quadratic  in  a  is  non  positive. 

8.5- 6  Show  that  if  two  RVs  x  and  y  are  related  by 

y  =  k\x  +  k2 

where  A' i  and  ki  are  arbitrary  constants,  the  correlation  coefficient  pxy  =  1  if  A]  is  positive,  and 
pxy  =  - 1  if  k\  is  negative. 

8.5- 7  Given  x  =  cos  ©  and  y  =  sin  0,  where  ©  is  an  RV  uniformly  distributed  in  the  range  (0.  2: r), 

show  that  x  and  y  are  uncorrelated  but  are  not  independent. 
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8.6-1  The  random  binary  signal  x(f),  shown  in  Fig.  P8.6-la,  can  take  on  only  two  values,  3  and  0,  with 
equal  probability.  A  symmetric  exponential  channel  noise  n(/)  shown  in  Fig.  P8.6-lb  is  added 
to  this  signal,  giving  the  received  signal  y (/).  The  PDF  of  the  noise  amplitude  n  is  exponential 
(see  Prob.  8.2-4)  with  a  zero  mean  and  a  variance  of  2.  Determine  and  sketch  the  PDF  of  the 
amplitude  y. 

Hint:  Use  ofEq.  (8.92)  yields py(n)  =  px(n)  *pn00- pn (jc)  =  exp(-2|jc|). 


Figure  P.8.6-1 


(a) 


(b) 


8.6- 2  Repeat  Prob.  8.6-1  if  the  amplitudes  3  and  0  of  x(/)  are  not  equiprobable  but  Px(3)  =  0.6  and 

Px(0)  =  0.4. 

8.6- 3  If  x(/)  and  y (/)  are  both  independent  binary  signals  each  taking  on  values  —1  and  1  only  with 


P x(l)  =  G  =  1  -  Px(-l) 
Py(l)=P  =  1  -/>y(-l) 


determine  Pz(zi)  where  z  =  x  +  y. 

8.6-4  If  z  =  x  +  y,  where  x  and  y  are  independent  Gaussian  RVs  with 


Px(x)  = 


e 


then  show  that  z  is  also  Gaussian  with 


Z  ~  x  +  y  and  aj  =  cr?  +  ory 


Hint:  Convolve  px (jc)  and  py  ( y ).  See  pair  22  in  Table  3.1. 

8.6  5  In  Example  8.24,  design  the  optimum  third-order  predictor  processor  for  speech  signals  and 
determine  the  SNR  improvement.  Values  of  various  correlation  coefficients  for  speech  signals 
are  given  in  Example  8.24. 

8.7-1  In  a  binary  communication  channel,  each  data  packet  has  2048  bits  Each  bit  can  be  in  error 

.independently  with  probability  of  10~3.  Find  the  approximate  probability  that  less  than  10%  of 
the  bits  are  in  error. 
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RANDOM  PROCESSES  AND 
SPECTRAL  ANALYSIS 


he  notion  of  a  random  process  is  a  natural  extension  of  the  random  variable  ( RV)  concept. 


Consider,  for  example,  the  temperature  x  of  a  certain  city  at  noon.  The  temperature  x  is 


JL  an  RV  and  takes  on  a  different  value  every  day.  To  get  the  complete  statistics  of  x,  we 
need  to  record  values  of  x  at  noon  over  many  days  (a  large  number  of  trials).  From  this  data, 
we  can  determine  /?*(.*),  the  PDF  of  the  RV  x  (the  temperature  at  noon). 

But  the  temperature  is  also  a  function  of  time.  At  1  p.m.,  for  example,  the  temperature 
may  have  an  entirely  different  distribution  from  that  of  the  temperature  at  noon.  Still,  the 
two  temperatures  may  be  related,  via  a  joint  probability  density  function.  Thus,  this  random 
temperature  x  is  a  function  of  time  and  can  be  expressed  as  x(/).  If  the  random  variable  is 
defined  for  a  time  interval  t  6  [ta ,  f/>],  then  x(r)  is  a  function  of  time  and  is  random  for  every 
instant/  €  [ ta ,  /^].  An  RV  that  is  a  function  of  time*  is  called  a  random,  or  stochastic,  process. 
Thus,  a  random  process  is  a  collection  of  an  infinite  number  of  RVs.  Communication  signals 
as  well  as  noises,  typically  random  and  varying  with  time,  are  well  characterized  by  random 
processes.  For  this  reason,  random  processes  are  the  subject  of  this  chapter,  which  prepares  us 
to  study  the  performance  analysis  of  different  communication  systems  in  later  chapters. 


9.1  FROM  RANDOM  VARIABLE  TO 
RANDOM  PROCESS 


To  specify  an  RV  x,  we  run  multiple  trials  of  the  experiment  and  from  the  outcomes  estimate 
px(x).  Similarly,  to  specify  the  random  process  x(f),  we  do  the  same  thing  for  each  time 
instant  t.  To  continue  with  our  example  of  the  random  process  x(r),  the  temperature  of  the 
city,  we  need  to  record  daily  temperatures  for  each  value  of  t.  This  can  be  done  by  recording 
temperatures  at  every  instant  of  the  day,  which  gives  a  waveform  .t(f,  &),  where  indicates 
the  day  for  which  the  record  was  taken.  We  need  to  repeat  this  procedure  every  day  for  a 
large  number  of  days.  The  collection  of  all  possible  waveforms  is  known  as  the  ensemble 
(corresponding  to  the  sample  space)  of  the  random  process  x(t).  A  waveform  in  this  collection 
is  a  sample  function  (rather  than  a  sample  point)  of  the  random  process  (Fig.  9.1).  Sample 


*  Actually,  to  qualify  as  a  random  process,  x  could  be  a  function  of  any  practical  variable,  such  as  distance.  In  fact,  a 
random  process  may  also  be  a  function  of  more  than  one  variable. 
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Figure  9.1 

Random  process 
for  representing 
the  temperature 
of  a  city. 


Figure  9.2 

Ensemble  with  a 
finite  number  of 
sample  functions. 


x(r  ] )  =  otj  x(r2)  =  *2 
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function  amplitudes  at  some  instant  t  —  t\  are  the  random  values  taken  by  the  RV  x(fi) 
various  trials. 

We  can  view  a  random  process  in  another  way.  In  the  case  of  an  RV,  the  outcome  of 
each  trial  of  the  experiment  is  a  number.  We  can  view  a  random  process  also  as  the  outcome 
of  an  experiment,  where  the  outcome  of  each  trial  is  a  waveform  (a  sample  function)  that  is 
a  function  ot  t.  The  number  of  waveforms  in  an  ensemble  may  be  finite  or  infinite.  In  the 
case  of  the  random  process  x(/)  (the  temperature  of  a  city),  the  ensemble  has  infinitely  many 
waveforms.  On  the  other  hand,  if  we  consider  the  output  of  a  binary  signal  generator  (over  the 
period  0  to  107"),  there  are  at  most  2 10  waveforms  in  this  ensemble  (Fig.  9.2). 

One  fine  point  that  needs  clarification  is  that  the  waveforms  (or  sample  functions)  in  the 
ensemble  are  not  random.  They  have  occurred  and  are  therefore  deterministic.  Randomness 
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in  this  situation  is  associated  not  with  the  waveform  but  with  the  uncertainty  regarding  which 
waveform  would  occur  in  a  given  trial.  This  is  completely  analogous  to  the  situation  of  an  RV. 
For  example,  in  the  experiment  of  tossing  a  coin  four  times  in  succession  (Example  8.4),  16 
possible  outcomes  exist,  all  of  which  are  known.  The  randomness  in  this  situation  is  associated 
not  with  the  outcomes  but  with  the  uncertainty  about  which  of  the  16  outcomes  will  occur  in  a 
given  trial.  Indeed,  the  random  process  is  basically  an  infinitely  long  vector  of  random  variables. 
Once  an  experiment  has  been  completed,  the  sampled  vector  is  deterministic.  However,  since 
each  element  in  the  vector  is  random,  the  experimental  outcome  is  also  random,  leading  to 
uncertainty  over  what  vector  (or  function)  will  be  generated  in  each  experiment. 

Characterization  of  a  Random  Process 

The  next  important  question  is  how  to  characterize  (describe)  a  random  process.  In  some  cases, 
we  may  be  able  to  describe  it  analytically.  Consider,  for  instance,  a  random  process  described 
by  \(t)  =  A  cos  ( coct  +  0),  where  0  is  an  RV  uniformly  distributed  over  the  range  (0,  2tt  ). 
This  analytical  expression  completely  describes  a  random  process  (and  its  ensemble).  Each 
sample  function  is  a  sinusoid  of  amplitude  A  and  frequency  coc.  But  the  phase  is  random  (see 
later:  Fig.  9.5).  It  is  equally  likely  to  take  any  value  in  the  range  (0,  2 tt).  Such  an  analytical 
description  requires  well-defined  models  such  that  the  random  process  is  characterized  by 
specific  parameters  that  are  random  variables. 

Unfortunately,  we  are  not  always  able  to  describe  a  random  process  analytically.  Without 
a  specific  model,  we  may  have  just  an  ensemble  that  was  obtained  experimentally.  From  this 
ensemble,  which  has  complete  information  about  the  random  process,  we  must  find  some 
quantitative  measure  that  will  specify  or  characterize  the  random  process.  In  this  case,  we 
consider  the  random  process  to  be  an  RV  x  that  is  a  function  of  time.  Thus,  a  random  process 
is  just  a  collection  of  an  infinite  number  of  RVs,  which  are  generally  dependent.  We  know  that 
complete  information  of  several  dependent  RVs  is  provided  by  the  joint  PDF  of  those  variables. 
Let  X/  represent  the  RV  x(f/)  generated  by  the  amplitudes  of  the  random  process  at  instant  t  =  //. 
Thus,  x  i  is  the  RV  generated  by  the  amplitudes  at  t  =  fj,  and  x2  is  the  RV  generated  by  the 
amplitudes  at  t  =  f2 ,  and  so  on,  as  shown  in  Fig.  9.1.  The  n  RVs  \  x2,  X3, . . . ,  x„  generated 

by  the  amplitudes  at  t  =  t\ ,  tj,  ^3,* . . *  tn ,  respectively,  are  dependent  in  general.  For  the  n 
samples,  they  are  fully  characterized  by  the  nth-order  joint  probability  density  function  (PDF) 
or  the  nth-order  joint  cumulative  distribution  function  (CDF) 


FX(JCI,  *2,...,  Xnm,tu  th  •  • 


tn)  =  P[x(fl)  <  JCi;x(f2)  <  *2; . .  •  ;  x(*n)  <Xn\ 


The  definition  of  the  joint  CDF  of  the  n  random  samples  leads  to  the  joint  PDF 

d" 


Px(x  1,  *2,...,  Xn\t  1,  fn)  - 


dx\  dX2  •  •  •  dx„ 


F x  (*t  1  *  X2 1  •  •  • »  Xfj ,  /] ,  ^2, .  .  .  f  tfi)  (9.1) 


This  discussion  provides  some  good  insight.  It  can  be  shown  that  the  random  process  is  com- 
pletely  described  by  the  nth-order  joint  PDF  (9.1)  for  all  n  (up  to  oo)  and  for  any  choice  of 
t\  h  t3  tH.  Determining  this  PDF  (of  infinite  order)  is  a  formidable  task.  Fortunately, 
we  shall  soon  see  that  in  the  analysis  of  random  signals  and  noises  in  conjunction  with  linear 
systems  we  are  often  content  with  the  specifications  of  the  first-  and  second-order  statistics. 

Higher  order  PDF  is  the  joint  PDF  of  the  random  process  at  multiple  time  instants.  Hence, 
we  can  always  derive  a  lower  order  PDF  from  a  higher  order  PDF  by  simple  integration. 

For  instance. 


pA* D  = 


P\  (.v  1 ,  X2.  t| ,  tl )  dx2 
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Figure  9.3 

Random  process 
for  representing 
a  channel  noise. 
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varying  random 
process. 


Hence,  when  the  nth-order  PDF  is  available,  there  is  no  need  to  separately  specify  PDFs  of 
order  lower  than  n. 

The  mean  x(f)  of  a  random  process  x(t)  can  be  determined  from  the  first-order  PDF  as 

/oo 

xpx(x\  t)dx  (9-2) 

-OO 

which  is  typically  a  deterministic  function  of  time  t. 


Why  Do  We  Need  Ensemble  Statistics? 

The  preceding  discussion  shows  that  to  specify  a  random  process,  we  need  ensemble  statistics. 
For  instance,  to  determine  the  PDF  /?x,(jc i),  we  need  to  find  the  values  of  all  the  sample 
functions  at  t  =  t \.  This  is  ensemble  statistics.  In  the  same  way,  acquiring  other  statistics  to 
specify  a  random  process  also  necessitates  some  kind  of  ensemble  statistics.  In  deterministic 
signals,  we  are  used  to  studying  the  data  of  a  waveform  (or  waveforms)  as  a  function  of  time. 
Hence,  the  idea  of  investigating  ensemble  statistics  makes  us  feel  a  bit  uncomfortable  at  first. 
Theoretically,  we  may  accept  it,  but  does  it  have  any  practical  significance?  How  is  this  concept 
useful  in  practice?  We  shall  now  answer  this  question. 

To  understand  the  necessity  ot  ensemble  statistics,  consider  the  problem  of  threshold 
detection  in  Example  8.16.  A  1  is  transmitted  by  p(t)  and  a  0  is  transmitted  by  -pit)  (p°lar 
signaling).  The  peak  pulse  amplitude  is  Ap.  When  1  is  transmitted,  the  received  sample  value 
is  Ap  +  n,  where  n  is  the  noise.  We  would  make  a  decision  error  if  the  noise  value  at  the 
sampling  instant  ts  is  less  than  —  Ap,  forcing  the  sum  of  signal  and  noise  to  fall  below  the 
threshold.  To  find  this  error  probability,  we  repeat  the  experiment  N  times  (N  -»  oo)  and  see 
how  many  times  the  noise  at  t  =  ts  is  less  than  -Ap  (Fig.  9.3).  This  information  is  precisely 
one  of  ensemble  statistics  of  the  noise  process  n(t)  at  instant  rv. 

The  importance  of  ensemble  statistics  is  clear  from  this  example.  When  we  are  dealing  with 
a  random  process  or  processes,  we  do  not  know  which  sample  function  will  occur  in  a  given 
trial.  Hence,  for  any  statistical  specification  and  characterization  of  the  random  process,  we 
need  to  average  over  the  entire  ensemble.  This  is  the  basic  physical  reason  for  the  appearance 
of  ensemble  statistics  in  random  processes. 

Autocorrelation  Function  of  a  Random  Process 

Foi  the  purpose  of  signal  analysis,  one  of  the  most  important  (statistical)  characteristics  of a 
random  process  is  its  autocorrelation  function,  which  leads  to  the  spectral  information  of  the 
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Figure  9.4 

Autocorrelation 
functions  for  a 
slowly  varying 
and  a  rapidly 
varying  random 
process. 
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random  process.  The  spectral  content  of  a  process  depends  on  the  rapidity  of  the  amplitude 
change  with  time.  This  can  be  measured  by  correlating  amplitudes  at  t\  and  t\  +  r.  On  average, 
the  random  process  x(f)  in  Fig.  9.4a  is  a  slowly  varying  process  compared  to  the  process  y (/)  in 
Fig.  9.4b.  For  x(f),  the  amplitudes  at  t\  and  t\  +  r  are  similar  (Fig.  9.4a),  that  is,  have  stronger 
correlation.  On  the  other  hand,  for  y(0,  the  amplitudes  at  t\  and  t\  +  r  have  little  resemblance 
(Fig.  9.4b),  that  is,  have  weaker  correlation.  Recall  that  correlation  is  a  measure  of  the  similarity 
of  two  RVs.  Hence,  we  can  use  correlation  to  measure  the  similarity  of  amplitudes  at  t\  and 
t2  =  /,  +  r.  If  the  RVs  x(/|)  and  x(/2)  are  denoted  by  xi  and  x2,  respectively,  then  for  a  real 
random  process,*  the  autocorrelation  function  Rx(t\,  ^2)  defined  as 


Rx(t\,  t2)  =  X(tl)x(t2)  =  X,X2 


(9.3a) 


This  is  the  correlation  of  RVs  x(fi)  and  x(/2),  indicating  the  similarity  between  RVs  x(/,)  and 
x(f2).  It  is  computed  by  multiplying  amplitudes  at  t\  and  f2  of  a  sample  function  and  then 
averaging  this  product  over  the  ensemble.  It  can  be  seen  that  for  a  small  r,  the  product  x\x2 
will  be  positive  for  most  sample  functions  of  x(f),  but  the  product  y,y2  will  be  equally  likely 
to  be  positive  or  negative.  Hence,  x&2  will  be  larger  than  y,y2.  Moreover,  x,  and  x2  will 
show  correlation  for  considerably  larger  values  of  r,  whereas  y,  and  y2  will  lose  correlation 
quickly  even  for  small  r,  as  shown  in  Fig.  9.4c.  Thus,  Rx«u  h),  the  autocorrelation  function 
of  x(/)  provides  valuable  information  about  the  frequency  content  of  the  process.  In  fact,  we 
shall  show  that  the  PSD  of  x(f)  is  the  Fourier  transform  of  its  autocorrelation  function,  given 


*  For  a  complex  random  process  x(l),  the  autocorrelation  function  is  defined  as  follows: 


Rx(t |,  t2)  =  x*(t|)x(r2) 
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by  (for  real  processes) 


Rx(t i,  t2)  =  XiX2 

roo  r oo 


/oo  rO 

-OO*/  — 0 


x2-  D  •  t2)dx\dx2 


(9.3b) 


Hence,  R x  (t\ ,  t2)  can  be  derived  from  the  joint  PDF  of  x  |  and  \2.  which  is  the  second-order  PDF. 


9.2  CLASSIFICATION  OF  RANDOM  PROCESSES 

We  now  clarify  the  definition  of  some  important  categories  of  random  processes. 

Stationary  and  Nonstationary  Random  Processes 

A  random  process  whose  statistical  characteristics  do  not  change  with  time  is  classified  as  a 
stationary  random  process.  For  a  stationary  process,  we  can  say  that  a  shift  of  time  origin 
will  be  impossible  to  detect;  the  process  will  appear  to  be  the  same.  Suppose  we  determine 
Px(x;  1 1 ),  then  shift  the  origin  by  to ,  and  again  determine  px(jc;  t\ ).  The  instant  t\  in  the  new 
frame  of  reference  is  ti  =  t\  +  to  in  the  old  frame  of  reference.  Hence,  the  PDFs  of  x  at  t\  and 
t2  =  t\  T-  to  must  be  the  same,  that  is ,px(x;  t\ )  and px(x\  ti)  must  be  identical  for  a  stationary 
random  process.  This  is  possible  only  if  px(x;  t)  is  independent  of  t.  Thus,  the  first-order 
density  of  a  stationary  random  process  can  be  expressed  as 

Px(x;  t)  =  px(x) 

Similarly,  for  a  stationary  random  process  the  autocorrelation  function  Rx(t\ ,  ti)  must  depend 
on  1 1  and  t2  only  through  the  difference  t2  — 1\ .  If  not,  we  could  determine  a  unique  time  origin. 
Hence,  for  a  real  stationary  process, 

tfxfo,  *2)  =  R\(t2  -  fl) 


Therefore, 


rtx(r)  =  x(/)x(/  +  r)  (9-4) 

For  a  stationary  process,  the  joint  PDF  tor.ri  and  x2  must  also  depend  only  on  t2  —  ti  •  Simile 
higher  order  PDFs  are  all  independent  of  the  choice  of  origin,  that  is, 

P\(x\ ,  x2, . . . ,  x„;  t\,  t2,...,  tn)  =  px(xi,  x2,...,  x„;  t\  —  t,  t2  —  t, . . . ,  t„  —  t) 

=  Px(xi ,  X2, . . . ,  JC„;  o,  t2  -  t\ . (9'5) 

The  random  process  x (/)  representing  the  temperature  of  a  city  is  an  example  of  a  nonsta 
tionary  random  process,  because  the  temperature  statistics  (mean  value,  e.g.)  depend  on  the 
time  of  day.  On  the  other  hand,  the  noise  process  in  Fig.  9.3  is  stationary  because  its  statistics 
(the  mean  and  the  mean  square  values,  e.g.)  do  not  change  with  time.  In  general,  it  is  not 
easy  to  determine  whether  a  process  is  stationary,  because  the  task  involves  investigation  « 
/ith-order  ( n  =  oo)  statistics.  In  practice,  we  can  ascertain  stationarity  if  there  is  no  change  in 
the  signal-generating  mechanism.  Such  is  the  case  for  the  noise  process  in  Fig.  9.3. 
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Wide-Sense  (or  Weakly)  Stationary  Processes 

A  process  that  is  not  stationary  in  the  strict  sense,  as  discussed  in  the  last  subsection,  may  yet 
have  a  mean  value  and  an  autocorrelation  function  that  are  independent  of  the  shift  of  time 
origin.  This  means 


\(t)  =  constant 


and 


t2)  =  Rx( t)  t  =  t2-t\ 


(9.6) 


Such  a  process  is  known  as  a  wide-sense  stationary,  or  weakly  stationary,  process.  Note 
that  stationarity  is  a  stronger  condition  than  wide-sense  stationarity.  Stationary  processes 
with  a  well-defined  autocorrelation  function  are  wide-sense  stationary,  but  the  converse  is 
not  necessarily  true,  except  for  Gaussian  random  processes. 

Just  as  no  sinusoidal  signal  exists  in  actual  practice,  no  truly  stationary  process  can  occur 
in  real  life.  All  processes  in  practice  are  nonstationary  because  they  must  begin  at  some  finite 
time  and  terminate  at  some  finite  time.  A  truly  stationary  process  would  start  at  /  =  —  oo  and  go 
on  forever.  Many  processes  can  be  considered  to  be  stationary  for  the  time  interval  of  interest, 
however,  and  the  stationarity  assumption  allows  a  manageable  mathematical  model.  The  use 
of  a  stationary  model  is  analogous  to  the  use  of  a  sinusoidal  model  in  deterministic  analysis. 


*9.1  Show  that  the  random  process 


x(f)  =  A  cos  (c oct  +  ©) 


where  0  is  an  RV  uniformly  distributed  in  the  range  (0,  270,  is  a  wide-sense  stationary  process. 

The  ensemble  (Fig.  9.5)  consists  of  sinusoids  of  constant  amplitude  A  and  constant  fre¬ 
quency  o)c ,  but  the  phase  ©  is  random.  For  any  sample  function,  the  phase  is  equally 
likely  to  have  any  value  in  the  range  (0,  2tt).  Because  ©  is  an  RV  uniformly  distributed 
over  the  range  (0,  2j r),  one  can  determine1  px(x ,  t)  and,  hence,  x(»,  as  in  Eq.  (9.2).  For 
this  particular  case,  however,  x(/)  can  be  determined  directly  as  a  function  of  random 
variable  0: 


x(f)  =  A  cos  (o)(  t  +  ©)  =  A  cos  (a)ct  +  0) 


Because  cos  (coct  +  0)  is  a  function  of  an  RV  0,  we  have  [see  Eq.  (8.61b)] 


Because  Pq(0)  =  \/2n  over  (0,  2n)  and  0  outside  this  range. 


Hence, 


x(f)  =  0 


(9.7a) 
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Figure  9.5 

An  ensemble  for 
the  random 
process 

A  cos  (a>cf  +  ©)• 


Thus,  the  ensemble  mean  of  sample  function  amplitudes  at  any  instant  t  is  zero. 

The  autocorrelation  function  Rx  (t\ ,  t2)  for  this  process  also  can  be  determined  directly 
from  Eq.  (9.3a), 


RAtU  t2)  =  A2  COS  (C0ct\  +  0)  COS  ((Vct2  +  ®) 

=  A2  cos  ( (oct\  +  @)  cos  (d)ct2  T  0) 

A2  r - - 1 

=  " 2  |cos  t^c(^2  “  *l)]  +  cos  \(Oc{t2  +  ri)  +  20]  J 

The  first  term  on  the  right-hand  side  contains  no  RV.  Hence,  cos[coc(t2  -  t\Y\  is  cosK 
(t2  -  1 1 )]  itself.  The  second  term  is  a  function  of  the  uniform  RV  0,  and  its  mean  is 

- l  r2jl 

cos  [(oc{t2  +  *i)  +  20]  =  —  /  cos[(Dc(t2  +  ti)  +  2O]d0  =  0 

2tz  J o 

Hence, 

R*(tu  t2)  =  y  COS  [(Dc (t2  -  t\)] 
or 

R\A)  =  y  COS  0)CT  T  =  t2  —  t\ 

From  Eqs.  (9. 1  a,b)  it  is  clear  that  x(/)  is  a  wide-sense  stationary  process. 


hrgodic  Wide-Sense  Stationary  Processes 

We  have  studied  the  mean  and  the  autocorrelation  function  of  a  random  process.  These  atf 
ensemble  averages.  For  example,  x(t)  is  the  ensemble  average  of  sample  function  ampli^eS 


Figure  9.6 

Classification  of 

random 

processes. 
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atr,and/?x(/|,  ti)  =  xi  X2  is  the  ensemble  average  of  the  product  of  sample  function  amplitudes 
x(/i)  and  x(/2). 

We  can  also  define  time  averages  for  each  sample  function.  For  example,  a  time  mean  x(t) 
of  a  sample  function  x(t)  is* 


x  (t)  dt 


(9.8a) 


Similarly,  the  time  autocorrelation  function  )  defined  in  Eq.  (3.83)  is 


(9.8b) 


For  ergodic  (wide-sense)  stationary  processes,  ensemble  averages  are  equal  to  the  time 
averages  of  any  sample  function.  Thus,  for  an  ergodic  process  x(f). 


(9.9a) 

(9.9b) 


x(r)  =  x(t) 

RA  r)  =  nx(T) 


These  are  the  two  averages  for  ergodic  wide-sense  stationary  processes.  For  the  broader  def¬ 
inition  of  an  ergodic  process,  all  possible  ensemble  averages  are  equal  to  the  corresponding 
time  averages  of  one  of  its  sample  functions.  Figure  9.6  illustrates  the  relation  among  different 
classes  of  (ergodic)  processes.  In  the  coverage  of  this  book,  our  focus  lies  in  the  class  of  ergodic 
wide-sense  stationary  processes. 

It  is  difficult  to  test  whether  a  process  is  or  is  not  ergodic  because  we  must  test  all  pos¬ 
sible  orders  of  time  and  ensemble  averages.  Nevertheless,  in  practice  many  of  the  stationary 
processes  are  ergodic  with  respect  to  at  least  low-order  statistics,  such  as  the  mean  and  the 

autocorrelation.  For  the  process  in  Example  9.1  (Fig.  9.5),  we  can  show  that  *(/)=  0  and 
7?x(r)  =  (A2 /2)  cos  a>cr  (see  Prob.  3.8-1).  Therefore,  this  process  is  ergodic  at  least  with 

respect  to  the  first-  and  second-order  averages. 

The  ergodicity  concept  can  be  explained  by  a  simple  example  of  traffic  lights  in  a  city. 
Suppose  the  city  is  well  planned,  with  all  its  streets  running  east  to  west  and  north  to  south 


only  and  with  traffic  lights  at  each  intersection.  Assume  that  each  light  stays  green  for  0.75 


second  in  the  E-W  direction  and  0.25  second  in  the  N-S  direction  and  that  switching  of  any 
light  is  independent  of  the  other  lights.  For  the  sake  of  simplicity,  we  ignore  the  amber  light. 


*  Here  a  sample  function  x(t,  f.  )  is  represented  by  x(t)  for  convenience. 
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If  we  consider  a  certain  motorist  approaching  any  traffic  light  randomly  from  the  E-W 
direction,  the  probability  that  the  person  will  have  a  green  light  is  0.75:  that  is,  on  the  average, 
75%  of  the  time  the  person  will  observe  a  green  light.  On  the  other  hand,  it  we  consider  a  large 
number  of  drivers  arriving  at  a  traffic  light  in  the  E-W  direction  at  some  instant  t,  then  75% 
of  the  drivers  will  have  a  green  light,  and  the  remaining  25%  will  have  a  red  light.  Thus,  the 
experience  of  a  single  driver  arriving  randomly  many  times  at  a  traffic  light  will  contain  the 
same  statistical  information  (sample  function  statistics)  as  that  of  a  large  number  of  drivers 
arriving  simultaneously  at  various  traffic  lights  (ensemble  statistics)  at  one  instant. 

The  ergodicity  notion  is  extremely  important  because  we  do  not  have  a  large  number  of 
sample  functions  available  in  practice  from  which  to  compute  ensemble  averages.  It  the  process 
is  known  to  be  ergodic,  then  we  need  only  one  sample  function  to  compute  ensemble  averages. 
As  mentioned  earlier,  many  of  the  stationary  processes  encountered  in  practice  are  ergodic 
with  respect  to  at  least  first-  and  second-order  averages.  As  we  shall  see  when  dealing  with 
stationary  processes  in  conjunction  with  linear  systems,  we  need  only  the  first-  and  second- 
order  averages.  This  means  that  in  most  cases  we  can  get  by  with  a  single  sample  function,  as 
is  often  the  case  in  practice. 


9.3  POWER  SPECTRAL  DENSITY 

An  electrical  engineer  instinctively  thinks  of  signals  and  linear  systems  in  terms  of  their  fre¬ 
quency  domain  descriptions.  Linear  systems  are  characterized  by  their  frequency  response  (the 
transfer  function),  and  signals  are  expressed  in  terms  of  the  relative  amplitudes  and  phases  of 
their  frequency  components  (the  Fourier  transform).  From  a  knowledge  of  the  input  spectrum 
and  transfer  function,  the  response  of  a  linear  system  to  a  given  signal  can  be  obtained  in 
terms  of  the  frequency  content  of  that  signal.  This  is  an  important  analytical  procedure  tor 
deterministic  signals.  We  may  wonder  if  similar  methods  may  be  found  for  random  processes. 
Ideally,  all  the  sample  functions  of  a  random  process  are  assumed  to  exist  over  the  entire  time 
interval  (—00,  00)  and  thus,  are  power  signals.*  We  therefore  inquire  about  the  existence  0 
a  power  spectral  density  (PSD).  Superficially,  the  concept  of  a  PSD  of  a  random  process  may 
appear  ridiculous  for  the  following  reasons.  In  the  first  place,  we  may  not  be  able  to  describe 
a  sample  function  analytically.  Second,  for  a  given  process,  every  sample  function  may  be 
different  from  another  one.  Hence,  even  if  a  PSD  does  exist  for  each  sample  function,  it  may 
be  different  for  different  sample  functions.  Fortunately,  both  problems  can  be  neatly  resolve  « 
and  it  is  possible  to  define  a  meaningful  PSD  for  a  stationary  (at  least  in  the  wide  sense)  random 
process.  For  nonstationary  processes,  the  PSD  may  not  exist.  ^ 

Whenever  randomness  is  involved,  our  inquiries  can  at  best  provide  answers  in  terms 
averages.  When  tossing  a  coin,  for  instance,  the  most  we  can  say  about  the  outcome  is 
on  the  average  we  will  obtain  heads  in  about  half  the  trials  and  tails  in  the  remaining 
of  the  trials.  For  random  signals  or  RVs,  we  do  not  have  enough  information  to  predict 
outcome  with  certainty,  and  we  must  accept  answers  in  terms  of  averages.  It  is  not  possi  ^ 
to  transcend  this  limit  of  knowledge  because  of  our  fundamental  ignorance  of  the  process, 
seems  reasonable  to  define  the  PSD  of  a  random  process  as  a  weighted  mean  of  the  Pb  s 
all  sample  functions.  This  is  the  only  sensible  solution,  since  we  do  not  know  exactly  whic 
the  sample  functions  may  occur  in  a  given  trial.  We  must  be  prepared  for  any  sample  function. 
Consider,  for  example,  the  problem  of  filtering  a  certain  random  process.  We  would  not  w 
to  design  a  filter  with  respect  to  any  one  particular  sample  function  because  any  ot  the  samp 


*  Stationary  processes,  because  their  statistics  do  not  change  with  time,  are  power  signals. 
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functions  in  the  ensemble  may  be  present  at  the  input.  A  sensible  approach  is  to  design  the  filter 
with  respect  to  the  mean  parameters  of  the  input  process.  In  designing  a  system  to  perform 
certain  operations,  one  must  design  it  with  respect  to  the  whole  ensemble.  We  are  therefore 
justified  in  defining  the  PSD  Sx(f )  of  a  random  process  \(t)  as  the  ensemble  average  of  the 
PSDs  of  all  sample  functions.  Thus  [see  Eq.  (3.79)], 


W/Hz 


(9.10a) 


where  Xj(f)  is  the  Fourier  transform  of  the  time-truncated  random  process 

xT(t)  =  x(f)  U(t/T) 

and  the  bar  atop  indicates  ensemble  average.  Note  that  ensemble  averaging  is  done  before  the 
limiting  operation.  We  shall  now  show  that  the  PSD  as  defined  in  Eq.  (9.10a)  is  the  Fourier 
transform  of  the  autocorrelation  function  Rx(r)  of  the  process  x(/);  that  is, 


R\(t)  <=>  Sx(f) 


(9.10b) 


This  can  be  proved  as  follows: 


(9.11) 


Thus,  for  real  x(/). 


\XT(f)\2  =  Xr(-/)Xr(0 


and 


[l  rT/2  rT/2 

-  /  /  xto)x(tz)e-fl*fta-Vdtidt2 

T  J-T/2J-T/2 


x(t\)\(t2)e~j27Tf('2  l^dt\dt2 


(9.12) 


Interchanging  the  operation  of  integration  and  ensemble  averaging,  we  get 


=  lim  -  [  [  ^  «2~ti)e  j27Tf('2  ")dt\dt2 


T^oo  T  J-T/2  J-T/2 


The  operation  of  ensemble  averaging  is  also  an  operation  of  integration.  Hence,  interchanging  integration  with 
K  .  .  .  •  _ _ Unnoinn  orrW  nt  infppration. 


ensemble  averaging  is  equivalent  to  interchanging  the  order  of  integration. 


522 


RANDOM  PROCESSES  AND  SPECTRAL  ANALYSIS 

Here  we  are  assuming  that  the  process  x(f)  is  at  least  wide-sense  stationary,  so  that  x(r,  )x(r2)  = 
R\(t2  —  fi).  For  convenience,  let 


Rx(.t2-ti)e-j2*f{,2-,l)  =  <p{t2-ti) 


(9.13) 


Then, 


(9.14) 


The  integral  on  the  right-hand  side  is  a  double  integral  over  the  range  (-7/ 2,  7/2)  for  each  ot 
the  variables  t\  and  f2.  The  square  region  of  integration  in  the  t\-tj  plane  is  shown  in  Fig.  9.7. 
The  integral  in  Eq.  (9.14)  is  a  volume  under  the  surface  <p(f2  -  / 1 )  over  the  square  region  in 
Fig.  9.7.  The  double  integral  in  Eq.  (9.14)  can  be  converted  to  a  single  integral  by  observing 
that  <p(t2  —  f i )  is  constant  along  any  line  r2  —  f|  =  r  (a  constant)  in  the  t\-t2  plane  (Fig.  9.7). 

Let  us  consider  two  such  lines,  r2  —  t\  =  r  and  f2  —  ti  =  r  +  Ar.  It  Ar  ->•  0,  then 
tp(t2  —  f|)  ~  <p( t)  over  the  shaded  region  whose  area  is  (7  —  r)  Ar.  Hence,  the  volume 
under  the  surface  <p(f2  —  fi)  over  the  shaded  region  is  <p(r)(T  —  r)  Ar.  If  r  were  negative, 
the  volume  would  be  «p(r)(7  +  r)  Ar.  Hence,  in  general,  the  volume  over  the  shaded  region 
is  (p(r)(T  —  |r  |)  Ar.  The  desired  volume  over  the  square  region  in  Fig.  9.7  is  the  sum  of  the 
volumes  over  the  shaded  strips  and  is  obtained  by  integrating  <p(r)(T  -  |r  |)  over  the  range  ot 
r,  which  is  (-7,  7 )  (see  Fig.  9.7).  Hence, 


Figure  9.7 

Derivation  of  the 
Wiener- 
Khintchine 
theorem. 


T 


t  +  At 


,  “  T 


'V  / 
/  w 


-T 
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provided  f_OQ  \x\(p(x)dx  is  bounded.  Substituting  Eq.  (9.13)  into  this  equation,  we  have 


00 


RAr)e~j2nfT  dr 


(9.15) 


-00 


provided  f^°OQ\x\Rx(r)e  }lTl^T  dr  is  bounded.  Thus,  the  PSD  of  a  wide-sense  stationary 
random  process  is  the  Fourier  transform  of  its  autocorrelation  function,* 


R\(t)  <=>  Sx(f) 


(9.16) 


This  is  the  well-known  Wiener-Khintchine  theorem,  first  seen  in  Chapter  3. 

From  the  discussion  thus  far,  the  autocorrelation  function  emerges  as  one  of  the  most 
significant  entities  in  the  spectral  analysis  of  a  random  process.  Earlier  we  showed  heuristically 
how  the  autocorrelation  function  is  connected  with  the  frequency  content  of  a  random  process. 

The  autocorrelation  function  Rx(r)  for  real  processes  is  an  even  function  of  r.  This  can 
be  proved  in  two  ways.  First,  because  \Xr(f)\2  =  \Xr(f)Xj(f)\  =  \Xr(f)Xr(—f)\  is  an  even 
function  of /,  Sx(f )  is  also  an  even  function  of /,  and  /?x (r ),  its  inverse  transform,  is  also 
an  even  function  of  r  (see  Prob.  3.1-1).  Alternately,  we  may  argue  that 


Rx(r)  =  x(f)x(f  +  r)  and  Rx(-r)  =  x(/)x(f  —  r) 


Letting  t  -  r  =  a,  we  have 


(9.17) 


Rx(-r)  =  x(<t)x(ct  +  r)  =  Rx(  r) 


The  PSD  Sx(f)  is  also  a  real  and  even  function  of/. 

The  mean  square  value  x2(f)  of  the  random  process  x(/)  is  Rx( 0), 


Rx(  0)  =  x(f)x(r)  =  x2(f)  =  x2 


(9.18) 


The  mean  square  value  x2  is  not  the  time  mean  square  of  a  sample  function  but  the  ensemble 
average  of  the  amplitude  squares  of  sample  functions  at  any  instant  t. 

The  Power  of  a  Random  Process 

The  power  Px  (average  power)  of  a  wide-sense  random  process  x(/)  is  its  mean  square  value  x2. 
From  Eq.  (9.16), 


/OO 

Sx(/V27f/r  df 

-00 


Hence,  from  Eq.  (9. 1 8), 


*  It  can  be  shown  that  Eq.  (9. 1 5)  holds  also  for  complex  random  processes,  for  which  we  define 
f?x(r)  =  x*(0x(r  +  r). 


(9.19a) 


524 


RANDOM  PROCESSES  AND  SPECTRAL  ANALYSIS 


Because  Sx(f)  is  an  even  function  off ,  we  have 

A  00 

Px  =  \2  =  2  Sx(f)df  (9.19b) 

Jo 

where/  is  the  frequency  in  Hertz.  This  is  the  same  relationship  as  that  derived  for  deterministic 
signals  in  Chapter  3  [Eq.  (3.82)].  The  power  Px  is  the  area  under  the  PSD.  Also,  Px  =  x2  is 
the  ensemble  mean  of  the  square  amplitudes  of  the  sample  functions  at  any  instant. 

It  is  helpful  to  note  here,  once  again,  that  the  PSD  may  not  exist  for  processes  that  are  not 
wide-sense  stationary.  Hence,  in  our  future  discussion,  unless  specifically  stated  otherwise, 
random  processes  will  be  assumed  to  be  at  least  wide-sense  stationary. 


Example  9.2 


Determine  the  autocorrelation  function  Rx{r)  and  the  power  Px  of  a  low-pass  random  process 
with  a  white  noise  PSD  Sx(f  )  =  / 2  (Fig.  9.8a). 


Figure  9.8 

Lowpass  white 
noise  PSD  and  its 
autocorrelation 
function. 


Sx(f ) 

m 

-B 

0 

B  /  — 

(a) 


We  have 


(9.20a) 


Hence,  from  Table  3.1  (pair  18), 

RAt)  =  A/7? sine  (27 zBt) 


(9.20b) 


This  is  shown  in  Fig.  9.8b.  Also, 


/»x  =  X2  =  Rx(0)  =ffB 


(9.20c) 
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Alternately, 


Sx(f)df 


=  A fB 


( 9.20d) 


Example  9.3  Determine  the  PSD  and  the  mean  square  value  of  a  random  process 

x(r)  =  A  cos  (a)ct  +  0)  (9.21a) 


where  0  is  an  RV  uniformly  distributed  over  (0,  In ). 


For  this  case  Rx(r)  is  already  determined  [Eq.  (9.7c)], 

A 2 

Rx(r)  =  —  cos  Ct)cT 


(9.21a) 


Hence, 


Sx(f)  =  jW+fc)  +  8(f~fc)]  (9.21b) 

PX=1?  =  RA  0)  =  y  (9.21c) 

Thus,  the  power,  or  the  mean  square  value,  of  the  process  x(t)  =  A  cos  ( coct  +  0)  is  A2/ 2. 
The  power  Px  can  also  be  obtained  by  integrating  Sx(f)  with  respect  to/. 


Example  9.4 


Amplitude  Modulation 

Determine  the  autocorrelation  function  and  the  PSD  of  the  DSB-SC-modulated  process 
m(/)  cos  (coct  +  0),  where  m(/)  is  a  wide-sense  stationary  random  process  and  0  is  an  RV 
uniformly  distributed  over  (0, 2 n )  and  independent  of  m(t). 


Let 


<p(t)  =  m(/)  cos  ( o)ct  +  ©) 


Then 


R^r)  =  m(f) cos  (co7l  +  0)  •  rn(t  +  r)cos[ojf(/  +  r)  +  0] 
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■■MJU 


Because  m(t)  and  0  are  independent,  we  can  write  [see  Eqs.  (8.64b)  and  (9.7c)] 

R^t)  =  m(f)m(f  +  r)  cos  (coct  4-  0)  cos  [a)c(t  +  r)  +  0] 

=  ^/?m(r)cos  (oct  (9.22a) 

Consequently,* 

S9{f)  =  '-[Sm{f  +fc)  +  Sm{f  -fc) ]  (9.22b) 

4 

From  Eq.  (9.22a)  it  follows  that 

vHt)  =  0)  =  ^m(O)  =  l-m2(t)  (9.22c) 

Hence,  the  power  of  the  DSB-SC-modulated  signal  is  half  the  power  of  the  modulating 
signal.  We  derived  the  same  result  earlier  [Eq.  (3.94)]  for  deterministic  signals. 


Figure  9.9 

Derivation  of  the 
autocorrelation 
function  and  PSD 
of  a  random 
binary  process. 


We  note  that,  without  the  random  phase  0,  a  DSB-SC  amplitude  modulated  signal 
m(/)  cos  (a)ct)  is  in  fact  not  wide-sense  stationary.  To  find  its  PSD,  we  can  resort  to  the  time 
autocorrelation  concept  of  Sec  3.8. 


Example  9.5  Random  Binary  Process 

In  this  example  we  shall  consider  a  random  binary  process  for  which  a  typical  sample 
function  is  shown  in  Fig.  9.9a.  The  signal  can  assume  only  two  states  (values),  1  or  -U 
with  equal  probability.  The  transition  from  one  state  to  another  can  take  place  only  at  node 
points,  which  occur  every  Th  seconds.  The  probability  of  a  transition  from  one  state  to  the 
other  is  0.5.  The  first  node  is  equally  likely  to  be  situated  at  any  instant  within  the  interval 
0  to  Tb  from  the  origin.  Analytically,  we  can  represent  x(f)  as 

x(0  =  ^  a„p(f  -  nTb  -  a) 

where  a  is  an  RV  uniformly  distributed  over  the  range  (0,  Tb)  and  p(t )  is  the  basic  pulse 
(in  this  case  n  [(r  —  Tb/2)/Tb]).  Note  that  a  is  the  distance  of  the  first  node  from  the  origin, 
and  it  varies  randomly  from  sample  function  to  sample  function.  In  addition,  a„  is  random, 
taking  values  1  or  —1  with  equal  probability.  The  amplitudes  at  t  represent  RV  xi* 
those  at  t  +  t  represent  RV  X2.  Note  that  xj  and  X2  are  discrete  and  each  can  assume  only 
two  values,  - 1  and  1 .  Hence, 

flx(t)  =  =  ^  x2 ) 

JT|  X2 

=  />x,x2(l,  U  +  Zx^C-l,  -1)  -^X,X2(-1,  1) -/>x,x2(1-  _1)  *9'23a> 


*  We  obtain  the  same  result  even  if  <p(t)  =  m(r)  sin  (cuct  +  0). 
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Figure  9.9 

Derivation  of  the 
autocorrelation 
function  and  PSD 
of  a  random 
binary  process. 


(a) 


b~ 

T 

/ 

n\ 

n2 

X1 

(b) 


By  symmetry,  the  first  two  terms  and  the  last  two  terms  on  the  right-hand  side  are 
equal.  Therefore, 

Px( r)  =  2[PX|X2(1,  1)  “  ,x20,  -1)]  (9.23b) 

From  Bayes’  rule,  we  have 

Px(r)  =  2PXl(l)[Px2|x1(l|l)-/>x2|xl(-iU)] 

=  PX2|X|(1|1)-PX2|X1(-1|1)  (9.23c) 

Moreover, 

/>x2|Xl(l|l)  =  l-PX2|x.(-l|l) 

Hence, 

Rx(t)  =  1  -  2P x2|xi (— 1|1) 

It  is  helpful  to  compute  Px( r)  for  small  values  of  r  first.  Let  us  consider  the  case  r  <  Tb, 
where  at  most,  one  node  is  in  the  interval  t  to  /  +  r .  In  this  case,  the  event  x2  =  - 1  given 
x  i  is  a  joint  event  A  fl  B,  where  the  event  A  is  “a  node  in  the  interval  (f,  t  +  r)  and 
B  is  “the  state  change  at  this  node.”  Because  A  and  B  are  independent  events, 

I 

px  |xi  (_i  1 1)  =  P(a  node  lies  in  Mo  t  4-  r)P( state  change) 

=  -P(a  node  lies  in  t  to  t  +  r) 
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Figure  9.9b  shows  adjacent  nodes  n  \  and  ni,  between  which  t  lies.  We  mark  off  the  interval 
r  from  the  node  «2-  If  t  lies  anywhere  in  this  interval  (sawtooth  line),  the  node  ni  lies 
within  t  and  t  +  r.  But  because  the  instant  l  is  chosen  arbitrarily  between  nodes  n\  and 
«2,  it  is  equally  likely  to  be  at  any  instant  over  the  Tb  seconds  between  n\  and  ni.  and  the 
probability  that  t  lies  in  the  designated  interval  is  simply  r/7/,.  Therefore. 

PW-HD^Qt)  (9-24) 

and 

RAr)  =y-~  r  <Tb  (9.25) 

Tb 

Because  R*(r)  is  an  even  function  of  r,  we  have 

*x(T)  =  l  — |t|  <Tb  (9.26) 

Tb 

Next,  consider  the  range  r  >  Tb.  In  this  case  at  least  one  node  lies  in  the  interval  t  to  t  +  t. 
Hence,  \\  and  X2  become  independent,  and 


/?x(r)  =  xix2  =  xi  x2  =  0  r  >  Th 

where,  by  inspection,  we  observe  that  \\  =  x2  =  0  (Fig.  9.9a).  This  result  can  also  be 
obtained  by  observing  that  for  |r|  >  7^,  xj  and  x2  are  independent,  and  it  is  equally  likely 
that  X2  =  1  or  —1  given  that  x\  =  1  (or  —1).  Hence,  all  four  probabilities  in  Eq.  (9.23a) 
are  equal  to  1  /4,  and 


Example  9.6 


Figure  9.10 

Random  PAM 
process. 


*x(r)  =  0  r  >Tb 


Therefore, 


and 


*x(r) 


l-M/7*  |r|<r* 

0  M  >  Tb 


(9.27a) 


Sx(f  )  =  7),  sine2  (njTh)  (9  27b) 

The  autocorrelation  lunction  and  the  PSD  of  this  process  are  shown  in  Fig.  9.9c  and  d. 
Observe  that  x2  =  Rx(0)  =  1,  as  expected. 


The  random  binary  process  described  in  Example  9.5  is  sometimes  known  as  the  telegraph 
signal.  This  process  also  coincides  with  the  polar  signaling  of  Sec.  7.2.2  when  the  pulse  shape 
is  a  rectangular  NRZ  pulse  (Fig.  7.2).  For  wide-sense  stationarity,  the  signal’s  initial  starting 
point  a  is  randomly  distributed. 

Let  us  now  consider  a  more  general  case  of  the  pulse  train  y(t),  discussed  in  Sec.  7.2 
(Fig.  7.4).  From  the  knowledge  of  the  PSD  of  this  train,  we  can  derive  the  PSD  of  on-off. 
polar,  bipolar,  duobinary,  split-phase,  and  many  more  important  digital  signals. 
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Example  9.6  Random  PAM  Pulse  Train 

A  basic  pulse  pit)  is  used  to  transmit  digital  data,  as  shown  in  Fig.  9. 1  Oa.  The  successive  pulses 
are  separated  by  7/,  seconds,  and  the  &th  pulse  is  a*/?(f),  where  a*  is  an  RV.  The  distance  a 
of  the  first  pulse  (corresponding  to  k  =  0)  from  the  origin  is  equally  likely  to  be  any  value 
in  the  range  (0,  7/,).  Find  the  autocorrelation  function  and  the  PSD  of  such  a  random  pulse 
train  y (r)  whose  sample  function  is  shown  in  Fig.  9.10b.  The  random  process  y it)  can  be 
described  as 


oo 

y(/)  =  ^  a kp(t  -  kTh  -  or) 
k=-oc 


Figure  9.10 

Random  PAM 
process. 


where  a ,  is  an  RV  uniformly  distributed  in  the  interval  (0,  Th).  Thus,  a  is  different  for  each 
sample  function.  Note  that _p(a)  =  1/7*.  over  the  interval  (0,  Th),  and  is  zero  everywhere 
else.*  It  can  be  shown  that  y(t)  =  (a k/Tb) is  a  constant.' 


pit) 

r 

0 

Tb  r— 

(a) 


y(t ) 

(Li 

^\71 

(b) 


We  have  the  expression 


Ry(r)  =  y(t)y(t  +  r) 


=  a kp(t-kTh-a)  ^  am/?(f  +  r  -  mTh  -  a) 

*=-oo  m=-°° 

00  00  _ 

=  -  kTb  -  "M*  +  T  -mTh -a) 

k=—oo  m=-oo 


*  If  a  =  0  the  process  can  be  expressed  as  y(r)  =  oo  a*P(/  "  kThl  ,n  th,s  case 

W7T  =  57  V00  n(t-  kTh)  is  not  constant,  but  is  periodic  with  period  Th.  Similarly,  we  can  show  that  the 

yto  “*/-*=- oo  i'  "...  ith  ,h  same  period  Th.  This  is  an  example  of  a  cyclostationary.  or  periodically 

autocorrelation  funf,on^^  a  shift  of  ,he  time  origin  by  integral  multiples  of  a 

stationary  process  (a .process  who*  stat ™ ^ ^  ^  wide.sense  stationary.  But  they  can  he  made 

wide^ense  stationary^w’ithfsHght  modification  by  adding  the  RV  «  in  the  expression  of  y,,).  as  in  this  example, 
^smg  exacfiy  the^ame  approach  to  be  shown  shortly  in  the  derivation  of  Eq.  (9.28)  we  can  show  that 

y{t)  =  (H/Tb)S%0p(t)dt. 
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Because  a*  and  a„,  are  independent  of  a , 


oo  oo 


Ry(  r)  =  XI  XI  a*a™  '  ^  -  -a)-p(t  +  r-  mTb  -  or) 

k=-OQ  m=-o o 

Both  £  and  m  are  integers.  Letting  m  =  /:  +  n.  this  expression  can  be  written 


Ry(t)=  X  X  ‘  p(t  ~  kTh  ~oi)-pO  +  T  ~[k  +  n]Tb  -  or) 


k=— oo  n=-oo 


The  first  term  under  the  double  sum  is  the  correlation  of  RVs  a*  and  a*+„  and  will  be 
denoted  by  TZtl.  The  second  term,  being  a  mean  with  respect  to  the  RV  a,  can  be  expressed 
as  an  integral.  Thus, 

~  ~  fTb 

Ry(r)  =  p(t  -  kTb  -  ot)p(t  +  t -[k  +  n]Tb  -  a)p(a)d<x 

n=-o o  k=- oo 

Recall  that  or  is  uniformly  distributed  over  the  interval  0  to  7V  Hence,  p(ct)  =  l/7i  over 
the  interval  (0,  7/,),  and  is  zero  otherwise.  Therefore, 


OO  00 


'  '  |  rib 

Ry(r)  =  X  W-X  /  p(t-kTb-a)p(t  +  T  ~[k+n]Tb-a)da 

n=^oo  kfoc  T»  Jo 
.  00  00 

-s  E^E/ 


n=-oo  k=—oo  * 


-(*+1)7* 


P(i8)p(^  +  T-/i7i)rf^ 


]  00  /’OO 

=  F  £  n''  P(P)p(P  +  T~nTb)dp 

1  h  J  —OO 


The  integral  on  the  right-hand  side  is  the  time  autocorrelation  function  of  the  pulse /KO 
with  the  argument  r  -  nT^.  Thus, 


where 


and 


fly(r)  =  —  X  *■  )MT  ~  nTb) 
1  h 


TZn  —  a^a^+w 


/oo 

pWp(H-r)* 

-oo 


(9.28) 


(9.29) 


(9.30) 


Example  9.7 


Figure  9. 1 1 

Basic  pulse  for  a 
random  binary 
polar  signal. 


9.3  Power  Spectral  Density  531 

As  seen  in  Eq.  (3.75),  if  p(t)  «=>  P(f ),  then  x/fpir)  <==>  \P(f)\2.  Therefore,  the  PSD  of 
y(/),  which  is  the  Fourier  transform  of  Ry( r),  is  given  by 

Sy(f)  = 


This  result  is  similar  to  that  found  in  Eq.  (7.1  lb).  The  only  difference  is  the  use  of  the 
ensemble  average  in  defining  7 Zn  in  this  chapter,  whereas  Rn  in  Chapter  7  is  the  time 
average. 


T  Y  ^n\P(f  )\2e~jn2nfrh 

Tb 


\P<f)\ 

Tb 


n=— 00 
2  00 


Y  n„e-jn2nfr» 


(9.31) 


Example  9.7  Find  the  PSD  Sy(f)  for  a  polar  binary  random  signal  where  1  is  transmitted  by  a  pulse  p(t) 
(Fig.  9.11)  whose  Fourier  transform  is  P(f),  and  0  is  transmitted  by  -p(t).  The  digits  1  and  0 
are  equally  likely,  and  one  digit  is  transmitted  every  Th  seconds.  Each  digit  is  independent  of 
the  other  digits. 


Figure  9.1 1 

Basic  pulse  for  a 
random  binary 
polar  signal. 


1 

Pit) 

[p 

0  1 

»  ' 

4 

4 

In  this  case,  a*  can  take  on  values  I  and  -1  with  probability  1/2  each.  Hence, 
a*  =  Y,  a*P(a*)  =  (l)Pa*(l)  +  (— l)Pa*(-l) 

*=t,-i 


TZo  =  a  1=  Y  a*r*( a*)  =  (l)2/>a*(l)  +  (-l)2/>a*(-l) 
*=l,-l 

=  |(i>2  +  |(-o2  =  1 

and  because  each  digit  is  independent  of  the  remaining  digits, 

Kn  =  a*a*+n  =  =  0  n  -  1 


Hence,  from  Eq.  (9.31), 


Sy(f)  = 


\P(f)\2 


Tb 
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We  already  found  this  result  in  Eq.  (7.13),  where  we  used  time  averaging  instead 
of  ensemble  averaging.  When  a  process  is  ergodic  of  second  order  (or  higher),  the 
ensemble  and  time  averages  yield  the  same  result.  Note  that  Example  9.5  is  a  special 
case  of  this  result,  where  p(t)  is  a  full-width  rectangular  pulse  ri(f/7/,)  with  P(f)  = 
Ti,  sine  (njTb),  and 


Example  9.8  Find  the  PSD  Sy(f)  for  on-off  and  bipolar  random  signals  which  use  a  basic  pulse  for  p(t),  as 


shown  in  Fig.  9.11.  The  digits  1  and  0  are  equally  likely,  and  digits  are  transmitted  every  Tb 
seconds.  Each  digit  is  independent  of  the  remaining  digits.  All  these  line  codes  are  described 
in  Sec.  7.2. 

In  each  case  we  shall  first  determine  TZo,  TZ\ ,  TZj,  ...,1Zn. 

(a)  On-off  signaling:  In  this  case,  a„  can  take  on  values  1  and  0  with  probability 

1  /2  each.  Hence, 


a*  =  (D/VD  +  (0)/vo)  =  -d)  +  -(0)  =  - 
72o  =  a2  =  (l)2/>fljl(i)  +  (0)2pai(0)  =  i(l)2  +  I(0)2  = *  l- 


and  because  each  digit  is  independent  of  the  remaining  digits, 


n  >  1 


Therefore,  from  Eq.  (9.31), 


(9.32a) 


(9.32b) 


Equation  (9.32b)  is  obtained  from  Eq.  (9.32a)  by  splitting  the  term  1  /2  corresponding  to 
7^o  into  two:  1  /4  outside  the  summation  and  1  /4  inside  the  summation  (corresponding  to 
n  —  0).  This  result  is  identical  to  Eq.  (7. 18b)  found  earlier  by  using  time  averages. 


9.3  Power  Spectral  Density 
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We  now  use  a  Poisson  summation  formula,* 


OO  ,  OC  /  \ 

— u  n —  —  N 


Substitution  of  this  result  into  Eq.  (9.32b)  yields 


(9.32c) 


Note  that  the  spectrum  Sy(f)  consists  of  both  a  discrete  and  a  continuous  part.  A  discrete 
component  of  clock  frequency  (/?/,  =  l/Th)  is  present  in  the  spectrum.  The  continuous 
component  of  the  spectrum  | P(f) \2/4Th  is  identical  (except  for  a  scaling  factor  1  /4)  to  the 
spectrum  of  the  polar  signal  in  Example  9.7.  This  is  a  logical  result  because  as  we  showed 
earlier  (Fig.  7.3),  an  on-off  signal  can  be  expressed  as  a  sum  of  a  polar  and  a  periodic- 
component.  The  polar  component  is  exactly  half  the  polar  signal  discussed  earlier.  Hence, 
the  PSD  of  this  component  is  one-fourth  of  the  PSD  of  the  polar  signal.  The  periodic 
component  is  of  clock  frequency  /?/>,  and  consists  ot  discrete  components  ot  frequency  /?/, 
and  its  harmonics. 

(b)  Bipolar  signaling:  in  this  case,  a*  can  take  on  values  0,  1,  and  -1  with 
probabilities  1/2,  1/4.  and  Irrespectively.  Hence, 


aj;  =  (0)Pa*  (0)  +  ( 1  )Pat  ( • )  +  (—  1  )Au  (-  1 ) 


=  ^<0)+i<l)  +  j(-l)  =  0 

TZ0  =  =  (0)~Pak( 0)  +  0)2^< akW  +  (“1) 


Also, 


TZ\  =  a*a*+i  =  akak+\Pnk*k+\  ( akak+\ ) 

k  *+ 1 


Because  and can 


terms,  of  which  only  four  terms  (corresponding  to  values  ±1  for  ak  and  a*+i)  are 
Thus, 

7 l\  =  (l)(l)Patai+|(l'  1)  +  (-lKD/’atat+iC-l.  •) 

+  ( 1 ) ( —  1  J^a* a*+J ( 1  “*)  +  (-•)(-l)f>aiat+|(-1’ 


*  The  impulse 

8(t-nTh)  <=> 
alternate  form 
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Because  of  the  bipolar  rule, 

and 

»  =  M-D/V.MH  -  1)  =  (5)  (5)  =  l 

Similarly,  we  find  Pa* a*+jO,  -1)  =  1/8.  Substitution  of  these  values  in  TZ\  yields 


For  n  >  2,  the  pulse  strengths  a*  and  a*+i  become  independent.  Hence, 

Tln  =  a* a k+n  =  a*  a^  =  (0)(0)  =  0  n>  2 

Substitution  of  these  values  in  Eq.  (9.31)  and  noting  that  7 ln  is  an  even  function  of  n, 
yields 

sy  if)  =  — ~ —  sin  (nJTb) 

Tb 

This  result  is  identical  to  Eq.  (7.21b),  found  earlier  by  using  time  averages. 


9.4  MULTIPLE  RANDOM  PROCESSES 

For  two  real  random  processes  x(t)  and  y (/),  we  define  the  cross-correlation  function 
Pxy(ti,  t2)  as 


^xy(ti,  *2)  =x(t\)y(t2)  (9.33a) 

The  two  processes  are  said  to  be  jointly  stationary  (in  the  wide  sense)  if  each  of  the 
processes  is  individually  wide-sense  stationary  and  if 

^xy (t\ ,  t2)  =  R\y(t2  —  t\) 

=  Rx  y(r)  (9.33b) 

Uncorrelated,  Orthogonal  (Incoherent),  and  Independent  Processes 

Two  processes  x(/)  and  y(t)  are  said  to  be  uncorrelated  if  their  cross-correlation  function  is 
equal  to  the  product  of  their  means;  that  is, 

Pxy(f)  =  x(t)y(t  +  t)  =  xy  (9.34) 

This  implies  that  RVs  x(r)  and  y(f  +  r)  are  uncorrelated  for  all  t  and  r. 


For  complex  random  processes,  the  cross-correlation  function  is  defined  as 


*xy(0,  <2)  =x*0i)y(t2) 
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Processes  x(r)  and  y(t)  are  said  to  be  incoherent,  or  orthogonal,  if 

tfxy(r)  =  0  (9.35) 

Incoherent,  or  orthogonal,  processes  are  uncorrelated  processes  with  x  and/or  y  =  0. 

Processes  \(t)  and  y (t)  are  independent  random  processes  if  the  random  variables  x(fj) 
and  y(f2)  are  independent  for  all  possible  choices  of  t\  and  ti, 

Cross-Power  Spectral  Density 

We  define  the  cross-power  spectral  density  Sxy  (f)  for  two  random  processes  x(f)  and  y(f)  as 


S\y{f)  = 


lim 

r^oo 


X^pYrif) 

T 


(9.36) 


where  Xj{f)  and  Y  rif)  are  the  Fourier  transforms  of  the  truncated  processes  \(t)  U(t/T)  and 
y(r)  n  (t/T),  respectively.  Proceeding  along  the  lines  of  the  derivation  of  Eq.  (9.16),  it  can  be 
shown  that* 


R\y(*)  <=>  Sxy(f)  (9.37a) 

It  can  be  seen  from  Eqs.  (9.33)  that  for  real  random  processes  x(/)  and  y(r), 


flxy(T)  =  *yx(-r)  (937b) 

Therefore, 

Sxy(f)  =  SyA-f)  (9.37c) 

9.5  TRANSMISSION  OF  RANDOM  PROCESSES 
THROUGH  LINEAR  SYSTEMS 

If  a  random  process  x(f)  is  applied  at  the  input  of  stable  linear  time-invariant  system  (Fig.  9. 1 2) 
with  transfer  function  H{f),  we  can  determine  the  autocorrelation  function  and  the  PSD  of  the 
output  process  y  (t).  We  now  show  that 

Ry(r)  =  h( r)  *  h(-r)  *  Rx( r)  (9.38) 


and 

Sy(f)  =  \H(f)\2Sxtf)  (9.39) 


To  prove  this,  we  observe  that 


h(a)\(t  —  a)  da 


*  Equation  (9.37a)  is  valid  for  complex  processes  as  well. 
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Figure  9.12 

Transmission  of  a 
random  process 
through  a  linear 
time-invariant 
system. 


x(f) 

H(f) 

y(0 

- ^ - 

hit) 

Figure  9.1 3 

Representation  of 
thermal  noise  in 
a  resistor. 


and 


/1 00 

h(a)x(t  +  r  —  or)  dot 

-00 


Hence,* 


/00  rOO 

h(a)x(t  -  a)  da  l  h(fi)x(t  +  r  - 

-00  J -00 

/OC  poo 

/  h(a)h(fi)x(t  -  a)x(t  +  t  -p)dadp 

-ooJ-oo 

/00  pOQ 

/  h(a)h(P)Rx(T+a-P)dadp 

-OOJ-OO 


P)dfi 


This  double  integral  is  precisely  the  double  convolution  /t(r  )*/t(-  r )  */?x  (r ).  Hence,  Eqs.  (9.38) 
and  (9.39)  follow. 


Example  9 .9  Random  thermal  motion  of  electrons  in  a  resistor/?  causes  a  random  voltage  across  its  terminals. 

This  voltage  n(f)  is  known  as  the  thermal  noise.  Its  PSD  Sn(f)  is  practically  flat  over  a  very 
large  band  (up  to  1000  GHz  at  room  temperature)  and  is  given  by1 

Sn(f)  =  2kTR  (9-40) 

where  k  is  the  Boltzmann  constant  (1.38  x  1 0~23)  and  T  is  the  ambient  temperature  in  kelvins. 
A  resistor  R  at  a  temperature  T  kelvin  can  be  represented  by  a  noiseless  resistor  R  in  series 
with  a  random  white  noise  voltage  source  (thermal  noise)  having  a  PSD  of  2kTR  (Fig.  9.13a). 
Observe  that  the  thermal  noise  power  over  a  band  A /  is  (2kTR)  2  A f  =  4kTRAf. 

Let  us  calculate  the  thermal  noise  voltage  (rms  value)  across  the  simple  RC  circuit  in 
Fig.  9.13b.  resistor  R  is  replaced  by  an  equivalent  noiseless  resistor  in  series  with 
the  thermal  voltage  source.  The  transfer  function  H(f)  relating  the  voltage  v„  at 
terminals  a-b  the  thermal  noise  voltage  is  given  by 

H(f)  =  _}/j2nfC  _  1 

R  +  \/j2nfC  1  +  j2nfRC 


*  In  (his  development,  we  interchange  the  operations  of  averaging  and  integrating.  Because  averaging  is  really 
an  operation  of  integration,  we  are  really  changing  the  order  of  integration,  and  we  assume  that  such  a  change 
is  permissible. 
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Figure  9.1 3 

Representation  of 
thermal  noise  in 
a  resistor. 


Noiseless 


Sn(/)  =  2k  TR 


Noiseless  . 
R 

Sn(f)i 


(a) 


(b) 


(c) 


a 

-o 


-o 

b 


If  So(f)  is  the  PSD  of  the  voltage  \Q ,  then  from  Eq.  (9.39)  we  have 

2 

So(f)=  - 2  kTR 


1  +  jlnfRC 
2kTR 

1  +  4n2f2R2C2 


The  mean  square  value  \2  is  given  by 


—  rc 

v2°=f 

j — c 


2kTR 


I  +  4  7i2f2R2C 


~2df 


kT 

C 


(9.41) 


Hence,  the  rms  thermal  noise  voltage  across  the  capacitor  is  y  A T  j C . 


Sum  of  Random  Processes 

If  two  stationary  processes  (at  least  in  the  wide  sense)  x(f)  and  y(t)  are  added  to  form  a  process 
z(r),  the  statistics  of  z(t)  can  be  determined  in  terms  of  those  of  x(r)  and  y(t).  If 

z(r)  =  x(f)  +  y(/)  (9.42a) 


then 

rz(  T)  =  z(r)z(/  +  r)  =  [x(r)  +  y(/)][x(f  +  T)  +  y(f  +  T)] 
=  /?x(r)  +  Ry(T)  +  Rxy(r)  +  Ryx(*) 


If  x(r)  and  y (t)  are  uncorrelated,  then  from  Eq.  (9.34), 

Rx  y(r)  =  Ryx(r )  =  xy 


(9.42b) 


and 


/?z(r)  =  /?x(r)  +  /?y(r)  +  2xy 


(9.43) 


Most  processes  of  interest  in  con.mnnic.tion  problems  have  zero  means.  If  processes  x«)  and 
y(t)  are  uncorrelated  with  either  x  or  y  -  0  (that  is.  if  x(r)  and  y(0  are  incoherent),  then 

/?z(t)  =  /?x(r)  +  Ry(r) 


(9.44a) 
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and 

SZ(f)=Sx(f)+Sy(f) 

It  also  follows  from  Eqs.  (9.44a)  and  (9. 19)  that 

z2  =  x2  +  y2 


(9.44b) 


(9.44c) 


Hence,  the  mean  square  of  a  sum  of  incoherent  (or  orthogonal)  processes  is  equal  to  the  sum 
of  the  mean  squares  of  these  processes. 


Example  9. 1  0  Two  independent  random  voltage  processes  xi  (/)  and  X2 (r)  are  applied  to  an  RC  network,  as 
shown  in  Fig.  9. 14.  It  is  given  that 


SM{f)  =  K 


Sx2<f)  = 


2a 

a2  +  (2  7if)2 


Determine  the  PSD  and  the  power  Py  of  the  output  random  process  y(t).  Assume  that  the 
resistors  in  the  circuit  contribute  negligible  thermal  noise  (i.e.,  assume  that  they  are  noiseless). 


Figure  9.14 

Noise 

calculations  in  a 
resistive  circuit. 


x2(0 


Because  the  network  is  linear,  the  output  voltage  y (t)  can  be  expressed  as 

y(0  =  yi(0  +  y2(r) 

where  yi  (f)  is  the  output  from  input  xj(r)  [assuming  x2(/)  =  0]  and  y2(r)  is  the  output 
from  input  x2(r)  [assuming  x i  (r)  =  0].  The  transfer  functions  relating  y (f)  to  xi(0 
x2(r)  are  H\  (f)  and  //2(f),  respectively,  given  by 

l(f)  3(3  -j2nf  +  1)  Hl{f)  =  2(3  -jlnf  +  1) 

Hence, 


syi(f)  =  \H\(f)\2SXl(f)  = 


K 

9[9(2tt/)2  +  1] 


Sy2(f)  =  \H2{f)\2SX2(f) 


_ a 

2[9(2  jzf)2  +  1  ][ce2  +  (27r/)2] 


and 
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Because  the  input  processes  xi  (r)  and  X2(/)  are  independent,  the  outputs  yi(f)  and  y2 (t) 
generated  by  them  will  also  be  independent.  Also,  the  PSDs  of  y i  (/)  and  y2(0  have  no 
impulses  at/  =  0,  implying  that  they  have  no  dc  components  [i.e.,  yi(f)  =  y2(0  =  0]. 
Hence,  y  i  ( t )  and  y2(0  are  incoherent,  and 


Sy(f)  =  Syi(f)  +  Sy2(f) 

2  K[a2  +  (2  nf)2]  +  9a 
~  18[9(2tt/)2  +  \][a2  +  (2nf)2] 


The  power  Py  (or  the  mean  square  value  y2)  can  be  determined  in  two  ways.  We  can  find 
Ry(z)  by  taking  the  inverse  transforms  of  Syi  if)  and  Sy2(f)  as 


*y<*>  =  ^HT|/3 


+ 


3a  -  e““|r| 
4(9a2  —  1) 


Ry,(r)  Rn(  x) 


and 


K 


Py=y2=/?y(0)=-  + 


3a  —  1 
4(9a2  —  1) 


Alternatively,  we  can  determine  y~  by  integrating  Sy(f )  with  respect  to/  [see  Eq.  (9. 19)|. 


9.6  APPLICATION:  OPTIMUM  FILTERING 
(WIENER-HOPF  FILTER) 

When  a  desired  signal  is  mixed  with  noise,  the  SNR  can  be  improved  by  passing  it  through  a 
filter  that  suppresses  frequency  components  where  the  signal  is  weak  but  the  noise  is  strong. 
The  SNR  improvement  in  this  case  can  be  explained  qualitatively  by  considering  a  case  of 
white  noise  mixed  with  a  signal  m(f)  whose  PSD  decreases  at  high  frequencies.  If  the  filter 
attenuates  higher  frequencies  more,  the  signal  will  be  reduced-in  fact,  distorted.  The  distortion 
component  rn.it)  may  be  considered  to  be  as  bad  as  added  noise.  Thus,  attenuation  of  higher 
freauencies  will  cause  additional  noise  (from  signal  distortion),  but,  in  compensation,  it  will 
reduce  the  channel  noise,  which  is  strong  at  high  frequencies.  Because  at  higher  frequences 
the  signal  has  a  small  power  content,  the  distortion  component  will  be  small  compared  to  the 
reduction  in  channel  noise,  and  the  total  distortion  may  be  smaller  than  before. 

Let  H  ,(f)  be  the  optimum  filter  (Fig.  9. 15a).  This  filter,  not  being  ideal,  will  cause  signa 
distortion.  The  distortion  signal  m «(f)  can  be  found  from  Fig.  9.15b.  The  distortion  signal 
power  No  appearing  at  the  output  is  given  by 


/OO  ~ 

Sm(f)\Hopt(f)-l\2df 

-OO 
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Figure  9.15 

Wiener-Hopf 
filter  operation. 


m(/)  +  n(/) 

H„p ,(/) 

m(f)  +  me(/)  +  nch(r) 
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m(r)  +  m  6(f) 
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Hop,(/)-  I 


m€(r) 


(b) 


where  Sm(f)  is  the  signal  PSD  at  the  input  of  the  receiving  filter.  The  channel  noise  power  /VCh 
appearing  at  the  filter  output  is  given  by 


A'ch 


J—C 


Sn{f)\Hopt(f)\-df 


where  Sn(f)  is  the  noise  PSD  appearing  at  the  input  of  the  receiving  filter.  The  distortion 
component  acts  as  a  noise.  Because  the  signal  and  the  channel  noise  are  incoherent,  the  total 
noise  N0  at  the  receiving  filter  output  is  the  sum  of  the  channel  noise  /Vch  and  the  distortion 
noise  No, 


N()  =  jVch  +  No 

/oo  r 

^  [|tfop,(/')|2Sn(n  +  \Hopt(f)-  1 12  ^(Z")]  df  (9.45a) 

Since  | A  +  B\~  =  (A  +  B)(A*  +  B *),  and  both  Sm(f)  and  Sn(f)  are  real,  we  can  rearrange 
Eq.  (9.45a)  as 


r 

N0  = 

J -c 


Sm{f) 


Stf) 


Sr(f)  + 


Sm(f)Sn(f) 

Sr(f) 


df 


(9.45b) 


where  Sr(f )  —  Sm(f)  +  Sn(f).  The  integrand  on  the  right-hand  side  of  Eq.  (9.45b)  is  non¬ 
negative.  Moreover,  it  is  a  sum  of  two  nonnegative  terms.  Hence,  to  minimize  No*  we  mus* 
minimize  each  term.  Because  the  second  term  Sm(f)Sn(f)/Sr(f)  is  independent  of  //opt(A 
only  the  first  term  can  be  minimized.  From  Eq.  (9.45b)  it  is  obvious  that  this  term  is  minimum 
at  zero  when 


Sm(f) 

~SAfj 

Sm{f) 


Sm{f)  +  Sn(f ) 

For  this  optimum  choice,  the  output  noise  power  N0  is  given  by 

Nq  =  Sn{f)Sn(f) 

J—c 


df 


-L 


Sr(f) 

-oo  Sm{f)  +  Sn(f) 


oo 

00 


df 


(9.46a) 


(9.46b) 
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The  optimum  filter  is  known  in  the  literature  as  the  Wiener-Hopf  filter.  Equation  (9.46a) 
shows  that  Hopt(f)  %  1  (no  attenuation)  when  Sm(f )  »  S„(f).  But  when  Sm(f)  <SC  S„(f ), 
the  filter  has  high  attenuation.  In  other  words,  the  optimum  filter  attenuates  heavily  the  band 
where  noise  is  relatively  stronger.  This  causes  some  signal  distortion,  but  at  the  same  time  the 
overall  SNR  is  improved  because  the  noise  is  attenuated  more  heavily. 

Comments  on  the  Optimum  Filter 

If  the  SNR  at  the  filter  input  is  reasonably  large — for  example,  Sm^)  >  100 Sn{f)  (SNR 
of  20  dB) — the  optimum  filter  [Eq.  (9.46a)]  in  this  case  is  practically  an  ideal  filter,  and  N„ 
[Eq.  (9.46b)]  is  given  by 


Hence  for  a  large  input  SNR,  optimization  yields  insignificant  improvement.  The  Wiener-Hopf 
filter  is  therefore  practical  only  when  the  input  SNR  is  small  (large-noise  case). 

Another  issue  is  the  realizability  ot  the  optimum  filter  in  Eq.  (9.46a).  Because  Sm(J )  and 
S„(f)  are  both  even  functions  of/,  the  optimum  filter  Hopt(f)  is  an  even  function  off.  Hence, 
the  unit  impulse  response  /iop,(0  is  an  even  function  of  t  (see  Prob.  3.1-1).  This  makes  h„pt(t) 
noncausal  and  the  filter  unrealizable.  As  noted  earlier,  such  a  filtercan  be  realized  approximately 
if  we  are  willing  to  tolerate  some  delay  in  the  output.  If  delay  cannot  be  tolerated,  the  derivation 
of  Hopt(f)  must  be  repeated  with  a  realizability  constraint.  Note  that  the  realizable  optimum 
filter  can  never  be  superior  to  the  unrealizable  optimum  filter  |Eq.  (9.46a)].  Thus,  the  filter  in 
Eq.  (9.46a)  gives  the  upper  bound  on  performance  (output  SNR).  Discussions  ol  realizable 
optimum  filters  can  be  readily  found  in  the  literature.1'  * 


Example  9.11a  random  process  m(t)  (the  signal)  is  mixed  with  a  white  channel  noise  n(r).  Given 


2a 


and  S„(f)  =  — 


Sm(2f)  a2  +  (2  rtf)2 


find  the  Wiener-Hopf  filter  to  maximize  the  SNR.  Find  the  resulting  output  noise  power  N„. 
From  Eq.  (9.46a), 


W°P,(0  "  4a+.V|a2  +  (2jr/)^] 

Ary  *  4 Of  j 

=  ^  =  77+«2 


"  M/2  +  W)21  X 


(9.47a) 


Hence, 


(9.47b) 


delayed  version  (Fig.  9. loo)  o*  mis  uuci,  mm  ,  opiv 
make  f0  >  3 If  and  eliminate  the  tail  for  /  <  0  (Fig.  9. 16c). 
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Figure  9.16 

Using  delay  to 
achieve  a  close 
realization  of  an 
unrealizable 
filter. 


The  output  noise  power  N0  is  [Eq.  (9.46b)] 

Nn=  r  20  jf  « 

Jo  P2  +  (2jt/)2  1  p  +  {4a /N) 


(9.48) 
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9.7  APPLICATION:  PERFORMANCE  ANALYSIS  OF 
BASEBAND  ANALOG  SYSTEMS 

We  now  apply  the  concept  of  power  spectral  density  (PSD)  to  analyze  the  performance  of 
baseband  analog  communication  systems.  In  analog  signals,  the  SNR  is  basic  in  specifying  the 
signal  quality.  For  voice  signals,  an  SNR  of  5  to  1 0  dB  at  the  receiver  implies  a  barely  intelligible 
signal.  Telephone-quality  signals  have  an  SNR  of  25  to  35  dB,  whereas  for  television,  an  SNR 
of  45  to  55  dB  is  required. 

Figure  9.17  shows  a  simple  communication  system  in  which  analog  signal  m(t)  is  trans¬ 
mitted  at  power  ST  through  a  channel  (representing  a  transmission  medium).  The  transmitted 
signal  is  corrupted  by  additive  channel  noise  during  transmission.  The  channel  also  attenuates 
(and  may  also  distort)  the  signal.  At  the  receiver  input,  we  have  a  signal  mixed  with  noise.  The 
signal  and  noise  powers  at  the  receiver  input  are  S,  and  N„  respectively. 

The  receiver  processes  (filters)  the  signal  to  yield  the  output  sa(t)  +  na(t).  n0(t )  came  from 
processing  n(t)  by  the  receiver  while  s0(t)  came  from  the  message  m(t).  The  signal  and  noise 
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Figure  9. 1 7 
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Channel  noise 


powers  at  the  receiver  output  are  S0  and  N(„  respectively.  In  analog  systems,  the  quality  of  the 
received  signal  is  determined  by  S0/N0,  the  output  SNR.  Hence,  we  shall  focus  our  attention 
on  this  figure  of  merit  under  either  a  fixed  transmission  power  St  or  for  a  given  S,. 

In  baseband  systems,  the  signal  is  transmitted  directly  without  any  modulation.  This  mode 
of  communication  is  suitable  over  a  pair  of  twisted  wires  or  coaxial  cables.  It  is  mainly  used 
in  short-haul  links.  For  a  baseband  system,  the  transmitter  and  the  receiver  are  ideal  baseband 
filters  (Fig.  9.18).  The  ideal  low-pass  transmitter  limits  the  input  signal  spectrum  to  a  given 
bandwidth,  whereas  the  low-pass  receiver  eliminates  the  out-of-band  noise  and  other  channel 
interference.  (More  elaborate  transmitter  and  receiver  filters  can  be  used,  as  shown  in  the  next 
section.) 

The  baseband  signal  m(r)  is  assumed  to  be  a  zero  mean,  wide-sense  stationary  random 
process  band-limited  to  B  Hz.  We  consider  the  case  of  ideal  low-pass  (or  baseband)  filters 
with  bandwidth  B  at  the  transmitter  and  the  receiver  (Fig.  9. 1 7).  The  channel  is  assumed  to  be 
distortionless.  The  power,  or  the  mean  square  value,  ot  m(/)  is  m~,  given  by 

Si  =  mI  =  2  fB Sm«f))df  (9.49) 

Jo 


For  this  case, 


S0  =  St 


(9.50a) 


and 


(9.50b) 


where  Sn(f )  is  the  PSD  of  the  channel  noise.  For  the  case  of  a  white  noise,  Sn(f)  =  M /2,  and 


N0  =  2jBjdf=AfB 


(9.50c) 


S0  _  Si 

IT,,-  MB 


(9.50d) 
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We  define  a  parameter  y  as 

(9.51) 


(9.52) 

The  parameter  y  is  directly  proportional  to  S,  and,  therefore,  directly  proportional  to  Sj .  Hence, 
a  given  St  (or  5,)  implies  a  given  y .  Equation  (9.52)  is  precisely  the  result  we  are  looking  for. 
It  gives  the  receiver  output  SNR  for  a  given  St  (or  S,). 

The  value  of  the  SNR  in  Eq.  (9.52)  will  serve  as  a  benchmark  against  which  the  output 
SNR  of  other  modulation  systems  will  be  measured  in  the  next  chapter. 


From  Eqs.  (9.50d)  and  (9.5 1 )  we  have 


Y  = 


So 


Si_ 

AfB 


9.8  APPLICATION:  OPTIMUM 

PREEMPHASIS-DEEMPHASIS  SYSTEMS 

It  is  possible  to  increase  the  output  SNR  by  deliberate  distortion  of  the  transmitted  signal  (pre¬ 
emphasis)  and  the  corresponding  compensation  (deemphasis)  at  the  receiver.  For  an  intuitive 
understanding  of  this  process,  consider  a  case  of  white  channel  noise  and  a  signal  m(f)  whose 
PSD  decreases  with  frequency.  In  this  case,  we  can  boost  the  high-frequency  components  of 
m(r)  at  the  transmitter  (preemphasis).  Because  the  signal  has  relatively  less  power  at  high 
frequencies,  this  preemphasis  will  require  only  a  small  increase  in  transmitted  power.*  At  the 
receiver,  the  high-frequency  components  are  attenuated  (or  deemphasized)  in  order  to  undo 
the  preemphasis  at  the  transmitter.  This  will  restore  the  useful  signal  to  its  original  form.  The 
channel  noise  receives  an  entirely  different  treatment.  Because  the  noise  is  added  after  the 
transmitter,  it  does  not  undergo  preemphasis.  At  the  receiver,  however,  it  does  undergo  deem¬ 
phasis  (i.e.,  attenuation  of  high-frequency  components).  Thus,  at  the  receiver  output,  the  signal 
power  is  restored  but  the  noise  power  is  reduced.  The  output  SNR  is  therefore  increased. 

In  this  section,  we  consider  a  baseband  system.  The  extension  of  preemphasis  and  deem¬ 
phasis  to  modulated  systems  will  be  carried  out  in  the  next  chapter.  A  baseband  system  with  a 
preemphasis  filter  Hp(f)  at  the  transmitter  and  the  corresponding  complementary  deemphasis 
filter  Hd{f)  at  the  receiver  is  shown  in  Fig.  9.18.  The  channel  transfer  function  is  Hc(f)<  and 
the  PSD  of  the  input  signal  m(f)  is  Sm(f).  We  shall  determine  the  optimum  preemphasis- 
deemphasis  (PDE)  filters  Hp(f)  and  Hd(f)  required  for  distortionless  transmission  of  the 
signal  m(f). 

For  distortionless  transmission, 

\Hp(f)Hc{f)Hd(f)\  =  G  (a  constant)  (9-53a) 

and 

Op(f)  +  0c(f)  +  Odif)  =  — 2.71  fid  (9.53b) 

We  want  to  maximize  the  output  SNR,  S()/NG ,  for  a  given  transmitted  power  St- 

*  Actually,  the  transmitted  power  is  maintained  constant  by  attenuating  the  preemphasized  signal  slightly. 
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Referring  to  Fig.  9.18,  we  have 


(9.54a) 


Because  Hp(f)Hc(f)Hd(f)  =  G  exp(-/27r/f</),  the  signal  power  S()  at  the  receiver  output  is 


(9.54b) 


The  noise  power  N()  at  the  receiver  output  is 


00 


Sn(f  )\Ht,(f)\2  df 


(9.54c) 


-00 


Thus, 


S0  _  G2fTxSm(f)df 
N0  f?xS„{f)\Hd[f)\2df 


(9.55) 


We  wish  to  maximize  this  ratio  subject  to  the  condition  in  Eq.  (9.54a)  with  St  as  a  given 
constant.  Applying  this  power  limitation  makes  the  design  ot  Hp(j )  a  well-posed  problem, 
foi*  otherwise  filters  with  larger  gains  will  always  be  better.  We  can  include  this  constraint  by 
multiplying  the  numerator  and  the  denominator  of  the  right-hand  side  of  Eq.  (9.55)  by  the 
left-hand  side  and  the  right-hand  side,  respectively,  of  Eq.  (9.54a).  This  gives 


G2ST  rxSm(f)4f 


S0  GStJ_00Z  m 

TT  =  roc  „  ,r,n  ,r  r^O 


(9.56) 


No  ~  rxSn(f)\Hd(f)pdf  /f^,  Sm(f)\Hp(f)\2  df 


The  numerator  of  the  right-hand  side  of  Eq.  (9.56)  is  fixed  and  unaffected  by  the  PDE  filters. 
Hence,  to  maximize  S0/N0,  we  need  only  minimize  the  denominator  of  the  right-hand  side  of 
Eq.  (9.56).  To  do  this,  we  use  the  Cauchy-Schwarz  inequality  (Appendix  B), 


2 


/  [Sm(f)Sn(f)}'/2\Hp(f)Hj(f)\df 


(9.57) 


> 


\J-00 


The  equality  holds  if  and  only  it 


Sm{f)\Hp{f)\2  =  K2Sn(f)\Hj(f)\2 


(9.58) 


where  K  is  an  arbitrary  constant.  Thus  to  maximize  S0/N„,  Eq.  (9.58)  must  be  satisfied 


Substitution  of  Eq.  (9.53a)  into  Eq.  (9.58)  yields 


(9.59a) 


2  G  JSjJJ/S^fj 
IW^Iopt  -  K  \Hc(f)\ 


(9.59b) 
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The  constant  K  is  found  by  substituting  Eq.  (9.59a)  into  the  power  constraint  of  Eq.  (9.54a)  as 


K  = 


St 


G  Ov/W)W)/|/W)l]# 

Substitution  of  this  value  of  K  into  Eqs.  (9.59a,  b)  yields 


\Hp(f)\L  = 


\Hd(f)\2op{  = 


St  y/ Sn{f)/Sm(f) 


\Hc(f)\  /^0[V5m(/')5n(/')/l«<(/:)l]^/ 
G2  no[^m(/)5n(r)/|//.  (/')|]^/ 
ST\Hc(f)\  y/W)/S^f) 


(9.59c) 


(9.60a) 

(9.60b) 


The  output  SNR  under  optimum  conditions  is  given  by  Eq.  (9.56)  with  its  denominator  replaced 
with  the  right-hand  side  of  Eq.  (9.57).  Finally,  substituting  \Hp(f)Hj(f)\  =  G/\Hc(f)\  leads  to 


=  ^ 
01,1  (f)Sn  (0/l«. mUf)' 

Equations  (9.60a)  and  (9.60b)  give  the  magnitudes  of  the  optimum  filters  Hp(f )  and  Hd(f)- 
The  phase  functions  must  be  chosen  to  satisfy  the  condition  of  distortionless  transmission 
[Eq.  (9.53b)]. 

Observe  that  the  preemphasis  filter  in  Eq.  (9.59a)  boosts  frequency  components  where 
the  signal  is  weak  and  suppresses  frequency  components  where  the  signal  is  strong.  The 
deemphasis  filter  in  Eq.  (9.59b)  does  exactly  the  opposite.  Thus,  the  signal  is  unchanged  but 
the  noise  is  reduced. 


Example  9.12  Consider  the  case  of  or  =  1400nr,  and 


Sm{f  ) 


c 

(2rr/)2  +  a2 

0 


I/I  <  4000 
I/I  >  4000 


The  channel  noise  is  white  with  PSD 


(9.61a) 


Sn(f)  =  y  (9-61b> 

The  channel  is  assumed  to  be  ideal  [Hc(f)  =  1  and  G=l]  over  the  band  of  interest 
(0-4000  Hz). 


Without  preemphasis-deemphasis,  we  have 

(•4000 


=  2  / 

Jo 


=  10_4C 


4000 

4000 


Sm(f)df 
C 


(2nf)2  +  a2 


df 


a  =  1400:r 
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Also,  because  G  =  1,  the  transmitted  power  Sr  =  SOJ 

S„  =  ST  =  10-4c 

and  the  noise  power  without  preemphasis-deemphasis  is 


N„  =  AT B  =  4000./V 


Therefore, 


So  o  C 

—  =  2.5  x  10-8-7 

No  AT 

The  optimum  transmitting  and  receiving  filters  are  given  by  [Eqs.  (9.60a  and  b)| 


(9.62) 


2  _  10~4y/(2 ;r/)2  +a2  _  X-lifiy/jlnf)2  +  or 

'Hptfn  =  (i ~  104 

104  1-00  (l /v/(2rr/)2+a2)  df  _  0,778  x  IQ4 


I/I  <  4000 

(9.63a) 


\Hd(f)\2  = 


y/(2  ^/)2  +  «2 


\/  (2?r/)2  +  a2 


I/I  <  4000 


(9.63b) 


The  output  SNR  using  optimum  preemphasis  and  deemphasis  in  found  from  Eq.  (9.60c)  as 

(10-4C)2 


= 

V  No  )  opt 


opt  (Afc/ 2)  [/5JSo  [l/\/  4rr2/2  +  ( I4007T)2]  df] 


=  3.3  x  lO-8-^ 

N 


(9.64) 


Comparison  of  Eq.  (9.62)  with  Eq.  (9.64)  shows  that  preemphasis-deemphasis  has 
increased  the  output  SNR  by  a  factor  of  1 .32. 


9.9  BANDPASS  RANDOM  PROCESSES 

If  the  PSD  of  a  random  process  is  confined  to  a  certain  passband  (Fig.  9.19),  the  process  is 
a  bandpass  random  process.  Bandpass  random  processes  can  be  used  effectively  to  model 
modulated  communication  signals  and  bandpass  noises.  Just  as  a  bandpass  s.gnal  can  be 
represented  in  terms  of  quadrature  components  [see  Eq.  (3.40)],  we  can  express  a  bandpass 
random  process  x(f)  in  terms  of  quadrature  components  as  follows: 


x(r)  =  xc(f)  cos  0)ct  -I-  \s(t)  sin  cvct 


(9.65) 
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Figure  9.19 
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bandpass 

random  — 

W) 

process. 

~fc  o 

UA  -1 
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Figure  9.20 

(a)  Equivalent 
circuit  of  an 
ideal  bandpass 
filter,  (b)  Ideal 
low-pass  filter 
frequency 
response. 

(c)  Ideal 
bandpass  filter 
frequency 
response. 


2  cos  (<t)ct  +  6) 


cos  ( u)ct  +  0) 


Ideal 

\  > 

low-pass 

M0  ^ 

\ _ 

'  «, 

filter 

H0 (/) 

i 

W) 


(b) 


H(f) 

1 

- - 2  B - - 

1 

~fc 

fc 

/— 

(c) 


In  this  representation,  \c  (t)  is  known  as  the  in-phase  component  and  x5(0  is  known  as  the 
quadrature  component  of  the  bandpass  random  process. 

This  can  be  proved  by  considering  the  system  in  Fig.  9.20a,  where  Hq(J)  is  an  ideal 
low-pass  filter  (Fig.  9.20b)  with  unit  impulse  response  /?0(r).  First  we  show  that  the  system 
in  Fig.  9.20a  is  an  ideal  bandpass  filter  with  the  transfer  function  H(f )  shown  in  Fig.  9.20c. 
This  can  be  conveniently  done  by  computing  the  response  h(t)  to  the  unit  impulse  input  $(*)• 
Because  the  system  contains  time-varying  multipliers,  however,  we  must  also  test  whether 
it  is  a  time-varying  or  a  time-invariant  system.  It  is  therefore  appropriate  to  consider  the 
system  response  to  an  input  <$(/  -  a).  This  is  an  impulse  at  t  =  a.  Using  the  fact  (see 
Eq.  (2.18b)]  that  /  (r)  8(t  —  a)  =/(a)5(f  —  a),  we  can  express  the  signals  at  various  point 
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as  follows: 

Signal  at 

a\  : 

cos  (coca  +  0)  S(t  —  a) 

a2  : 

sin  (coca  +  0)  8(t  —  a) 

b\  : 

cos  ( coca  +  0)ho(t  —  a) 

b2  * 

sin  ( a)ca  +  0)ho(t  —  a) 

c\  : 

cos  (coca  +  0)  cos  {coct  +  0)ho(t  -  a) 

C2  : 

sin  (coca  +  0)  sin  (coct  +  0)ho(t  —  a) 

d  : 

ho(t  —  a)  [cos  (cuca  +  0)  cos  (coct  +  0)  -F  sin  (coca  +  0)  sin  ( a>ct  +  6)\ 

=  2ho(t  -  a)  cos  [a)c(t  -  a)] 

Thus,  the  system  response  to  the  input  S(t  —  a)  is  2ho(t  —  a) cos  [a>c(f  —  a)].  Clearly,  this 
means  that  the  underlying  system  is  linear  time  invariant,  with  impulse  response 

h(t)  =  2 ho(t)  cos  (oct 


and  transfer  function 


H(f)  =  H0(f  +fe )  +  Wo (f  ~fc) 

The  transfer  function  H(f)  (Fig.  9.20c)  represents  an  ideal  bandpass  filter. 

If  we  apply  the  bandpass  process  x(t)  (Fig.  9.19)  to  the  input  of  this  system,  the  out¬ 
put  y(r)  at  d  will  remain  the  same  as  x(f).  Hence,  the  output  PSD  will  be  the  same  as  the 
input  PSD 

\H(f)\2Sx(f)  =  Sx(f) 

If  the  processes  at  points  b\  and  b2  (low-pass  filter  outputs)  are  denoted  by  xc(f)  and  x,(/), 
respectively,  then  the  output  x(/)  can  be  written  as 

x(r)  =  xc(/)  cos  (ioct  +  0)  +  xs(t)  sin  ( coct  +  0)  =  y(t)  (9.66) 

where  xc(t)  and  xs(t)  are  low-pass  random  processes  band-limited  to  B  Hz  (because  they  are 
the  outputs  of  low-pass  filters  of  bandwidth  B).  Because  Eq.  (9.66)  is  valid  for  any  value  of  0, 
by  substituting  0  =  0,  we  get  the  desired  representation  in  Eq.  (9.65)  with  y(/)  =  x(r). 

To  characterize  xc(0  and  xs(t),  consider  once  again  Fig.  9.20a  with  the  input  x(r).  Let 
6  be  an  RV  uniformly  distributed  over  the  range  (0,  2tt),  that  is,  for  a  sample  function,  0  is 
equally  likely  to  take  on  any  value  in  the  range  (0,  In).  In  this  case  x(t)  is  represented  as  in 
Eq  (9  66)  We  observe  that  xc(t)  is  obtained  by  multiplying  x(f)  by  2  cos  (a>ct  +  9),  and  then 
passing  the  result  through  a  low-pass  filter.  The  PSD  of  2x(/)cos  (u>ct  +  6)  is  [see  Eq.  (9.22b)| 

4  x  7[Sx(f  +/c)  +  —fc)] 

This  PSD  is  S  if)  shifted  up  and  down  by  fc,  as  shown  in  Fig.  9.21a.  When  this  is  passed 
through  a  low-pass  filter,  the  resulting  PSD  of  xf(t)  is  as  shown  in  Fig.  9.21b.  It  is  clear  that 


SXc{f)  = 


Sx(f  +fc)  +  Sx(f  -fc),  [ f\<B 
0,  \f\  >  B 


(9.67a) 
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mmrntmm 


Figure  9.21 

Derivation  of 
PSDs  of 
quadrature 
components  of  a 
bandpass 
random  process. 


-BO  B 

(a) 


Figure  9.22 

(a)  PSD  of  a 
bandpass  white 
noise  process. 

(b)  PSD  of  the 
quadrature 
components  of 
the  process. 


\(f)  or  S  *,(/) 


We  can  obtain  SXs(f)  in  the  same  way.  As  far  as  the  PSD  is  concerned,  multiplication  by 
cos  (a)ct  +  6)  or  sin  ( coct  +  0)  makes  no  difference,*  and  we  get 


SXl(f)  =  SXs(f)  = 


Sx(f+fc)  +  Sx(f-fc), 

0, 


\f\<B 

\f\>B 


(9.67b) 


From  Figs.  9. 1 9  and  9.2 1  b,  we  make  the  interesting  observation  that  the  areas  under  the  PSDs 
Sx(f),  SX({f),  and  SXs(f)  are  equal.  Hence,  it  follows  that 


x2(t)  =  x2(t)  =  x2(t)  (9.67c) 

Thus,  the  mean  square  values  (or  powers)  of  xc(t)  and  xs(f)  are  identical  to  that  of  x(t). 

These  results  are  derived  by  assuming  0  to  be  an  RV.  For  the  representation  in  Eq.  (9.65), 
0  =  0,  and  Eqs.  (9.67b,  c)  may  not  be  true.  Fortunately,  Eqs.  (9.67b,  c)  hold  even  for  the  case 
of  0  =  0.  The  proof  is  rather  long  and  cumbersome  and  will  not  be  given  here.'-3  It  can  also 
be  shown  that3 


xc(l)x,(0  =  Kx,x.t(0)  =  0  (968) 

That  is,  the  amplitudes  xc.  and  xv  at  any  given  instant  are  uncorrelated.  Moreover,  if  Sx(f) 1S 
symmetrical  about  fc  (as  well  as  -fc),  then 


7?xcXj(r)  —  0 


(9.69) 


Example  9.1 3 


The  PSD  of  a  bandpass  white  noise  n(t)  is  JV/2  (Fig.  9.22a).  Represent  this  process  in  terms 
of  quadrature  components.  Derive  Snc(f)  and  and  verify  that  n2.  =  nf  =  n2. 


I  We  have  the  expression 

n(r)  =  n c(t)  cos  cjct  +  nsff)  sin  coct 


(9.70) 


*  As  noted  in  connection  with  Eq.  (9.22a),  we  obtain  the  same  result  even  if  <pt  =  mU)  sin  (a>ct  +  &)■ 
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Figure  9.22 

(a)  PSD  of  a 
bandpass  white 
noise  process. 

(b)  PSD  of  the 
quadrature 
components  of 
the  process. 


Sn(f) 

U 

—  — 

2 

• 

1 

-fc 

fc  f 

Sn  (/)orSn  (/) 
M 


-B  B 

(b) 


where 


Snc(f)  =  Sns(f)  = 


\Sn{f+fc)  +  Sn(f-fc)  l f\<B 


|0  \f\>B 

It  follows  from  this  equation  and  from  Fig.  9.22  that 


Also, 


Sn,(f)  =  S»  *<f)  = 


—  rf'+B  Af 
n2  =  2  /  —  df  =  2AfB 

Jfc-B  2 

From  Fig.  9.22b  it  follows  that 

n2  =  n2  =  2  [\df  =  2MB 
Jo 

Hence, 


AT  \f\<B 
0  \f\>B 


n2  =  n2  =  n2  =  2 AfB 


(9.71) 


(9.72a) 


(9.72b) 


(9.72c) 


Nonuniqueness  of  the  Quadrature  Representation 

No  unique  center  frequency  exists  for  a  bandpass  signal.  For  the  spectrum  in  Fig.  9.23a, 
for  example  we  may  consider  the  spectrum  to  have  a  bandwidth  2 B  centered  at  fc.  The  same 
spectrum  can  be  considered  to  have  a  bandwidth  2 B'  centered  at/, ,  as  also  shown  in  Fig.  9.23a. 
The  quadrature  representation  [Eq.  (9.65)]  is  also  possible  for  center  frequency/,: 

x(r)  =  xn  (/)  cos  u>\  t  +  x.V|  (/)  sin  a>\l 
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Figure  9.23 

Nonunique 
nature  of 
quadrature 
component 
representation  of 
a  bandpass 
process. 


Figure  9.24 

A  possible  form 
of  quadrature 
component 
representation  of 
noise  in  SSB. 


where 


S*C](f)=SXst(f)  = 


1° 


l f\<B' 

I/I  >  B' 


(9.73) 


This  is  shown  in  Fig.  9.23b.  Thus,  the  quadrature  representation  of  a  bandpass  process 
is  not  unique.  An  infinite  number  of  possible  choices  exist  for  the  center  frequency,  and 

corresponding  to  each  center  frequency  is  a  distinct  quadrature  representation. 


Example  9.14A  bandpass  white  noise  PSD  of  an  SSB  channel  (lower  sideband)  is  shown  in  Fig.  9.24a. 

Represent  this  signal  in  terms  of  quadrature  components  with  the  carrier  frequency  fc. 

The  true  center  frequency  of  this  PSD  is  not/c,  but  we  can  still  use  fc  as  the  center 
frequency,  as  discussed  earlier, 

n(f)  =  nc(f)  cos  a)ct  +  n?(f)  sin  coct  (9.74) 

The  PSD  Snc(f)  or  Sns(f)  obtained  by  shifting  Sn(f )  up  and  down  by  fc  [see  Eq.  (9.73)]  is 
shown  in  Fig.  9.24b, 


Snc(f)  =  Sns(f) 


T  \f\±B 
0  l/|  >  B 


From  Fig.  9.24a  it  follows  that 


n2  =  MB 

Similarly,  from  Fig.  9.24b  we  have 

n2  =  n2  =  MB 


(9.75) 


(9.76a) 


(9.76b) 
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Figure  9.24 

A  possible  form 
of  quadrature 
component 
representation  of 
noise  in  SSB. 
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Hence, 


n2  =  n2  =  n2  =  MB 


(9.76c) 


Bandpass  “White”  Gaussian  Random  Process 

Thus  far  we  have  avoided  defining  a  Gaussian  random  process.  The  Gaussian  random  process 
is  perhaps  the  single  most  important  random  process  in  the  area  of  communication.  It  requires 
a  rather  careful  and  unhurried  discussion.  Fortunately,  we  do  not  need  to  know  much  about 
the  Gaussian  process  at  this  point;  to  avoid  unnecessary  digression,  therefore,  its  detailed 
discussion  is  postponed  until  Chapter  1 1 .  All  we  need  to  know  here  is  that  an  RV  x(r)  formed 
by  sample  function  amplitudes  at  instant  I  of  a  Gaussian  process  is  Gaussian,  with  a  PDF  of 
the  form  of  Eq.  (8.39). 

AGaussian  random  process  with  a  uniform  PSD  is  called  a  white  Gaussian  random  process. 
A  bandpass  “white”  Gaussian  process  is  actually  a  misnomer.  However,  it  is  a  popular  notion 
to  represent  a  random  process  n (/)  with  uniform  PSD  Af/2  centered  at  fc  and  with  a  bandwidth 
2 B  (Fig.  9.22a).  Utilizing  the  quadrature  representation,  it  can  be  expressed  as 

n(r)  =  nc(f)  cos  a >ct  +  n5(f)  sin  wct  (9.77) 


where,  from  Eq.  (9.71),  we  have 


S^ff)  =  Sa,(f )  = 


o 


I/I  <  B 
\f\>B 


Also,  from  Eq.  (9.72c), 


n2  =  n?  =  n2  =  2 MB 

The  bandpass  signal  can  also  be  expressed  in  polar  form  [see  Eq.  (3.40)]: 


(9.78) 


n(t)  =  E(f)  cos  ((oct  +  ©) 


(9.79a) 
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where  the  random  envelope  and  random  phase  are  defined  by 


E(f)  =  +  nf(/)  (9.79b) 

0(0  = -tan"1  (9.79c) 

n  At) 

The  RVs  nc(0  and  ns(f)  are  uncorrelated  [see  Eq.  (9.68)]  Gaussian  RVs  with  zero  means  and 
variance  2A fB  [Eq.  (9.78)].  Hence,  their  PDFs  are  identical: 


Av(«)  =Pns(  a) 


(9.80a) 


where 


ct2  =  2A fB  (9-BOb) 

It  has  been  shown  in  Prob.  8.2-12  that  if  two  jointly  Gaussian  RVs  are  uncorrelated,  they  are 
independent.  In  such  a  case,  as  shown  in  Example  8.17,  E(r)  has  a  Rayleigh  density 

Pe(E)  =  ^e~E2/2a2u(E),  ct2  =  2 MB  (9-81) 

and  0  in  Eq.  (9.79a)  is  uniformly  distributed  over  (0,  2tt). 

Sinusoidal  Signal  in  Noise 

Another  case  of  interest  is  a  sinusoid  plus  a  narrowband  Gaussian  noise.  If  A  cos  (o)ct  + 
is  a  sinusoid  mixed  with  n(f),  a  Gaussian  bandpass  noise  centered  at  fc,  then  the  sum  y(0  1S 
given  by 


y(0  =  A  cos  ( o)ct  +  ({>)  +  n(0 

By  using  Eq.  (9.66)  to  represent  the  bandpass  noise,  we  have 

y(0  =  [A  +  nc (/)]  cos  ( (oct  +  (p)  -I-  n5(0  sin  ( (oct  +  tp) 

=  E(0  cos  [a)ct  +  ©(f)  +  <p] 

where  E(f)  is  the  envelope  [E(f)  >  0]  and  0(f)  is  the  angle  shown  in  Fig.  9.25, 

E(0  =  J[A  +  nc(f)]2-|-n2(f) 


0(f)  =  -  tan-1 


n  s(t) 


A  +  nc(f) 


(9.82a) 

(9.82b) 


(9.83a) 

(9.83b) 


Figure  9.25 

Phasor 

representation  of 
a  sinusoid  and  a 
narrowband 
Gaussian  noise. 
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Both  n c(t)  and  n s(t)  are  Gaussian,  with  variance  a2.  For  white  Gaussian  noise,  a2  =  ZA fB 
[Eq.  (9.80b)].  Arguing  in  a  manner  analogous  to  that  used  in  deriving  Eq.  (8.57),  and  observing 
that 


n*  +  n2  =  E2-A2-  24  n, 

=  E2  -  2A(A  +  nc)  +  A2 
=  E2  —  2AE  cos  0(f)  +  A2 


we  have 


Pe&(E,  0)  = 


E  ~(E2-2AEcos  0+A2)/2o2 
2no* 


(9.84) 


where  a2  is  the  variance  of  n(  (or  nv)  and  is  equal  to  2 SfB  for  white  noise.  From  Eq.  (9.84) 
we  have 

Pe(E)=  f  PEe(E,e)dO 
J—jt 

=  JLe-(E2+A2)/l*2  |^_L  Jn  e(AE/a2)cos0de^  (9.85) 

The  bracketed  term  on  the  right-hand  side  of  Eq.  (9.85)  defines  I()(AE/a2),  where  /0  is  the 
modified  zero-order  Bessel  function  of  the  first  kind.  Thus, 


E 

Pe(E)  =  — 2 
a * 


=  *e-(E2+A2)'2 °2Io 


(9.86a) 


This  is  known  as  the  Rice  density,  or  Ricean  density.  For  a  large  sinusoidal  signal  (A  »  cr), 
it  can  be  shown  that4 


Io 


^E/a2 


and 


Pe(E) 


4 


2  nAo2 


-{E-A)2/2o2 


(9.86b) 


Because  A  »  cr ,  E~A,  and  pz(E )  in  Eq.  (9.86b)  is  very  nearly  a  Gaussian  density  with  mean 
A  and  variance  cr, 


Pe(E)  2 


1  e-(E-A)2/2o 2 

os/2n 


(9.86c) 


Figure  9.26  shows  the  PDF  of  the  normalized  RV  E/o .  Note  that  for  A/a  =0,  we  obtain  the 

RdyFromdteheSjoint  PDF  pbb{E,0),  we  can  also  obtain  p&(6),  the  PDF  of  the  phase  0,  by 
integrating  the  joint  PDF  with  respect  to  £, 


poo 

p@(Q)  =  /  Pe©(£>  0)  dE 

Jo 
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Figure  9.26 

Ricean  PDF. 


Although  the  integration  is  straightforward,  there  are  a  number  of  involved  steps,  and  for  this 
reason  it  will  not  be  repeated  here.  The  final  result  is 


Pe(0)  =  ^e~A2/2a2  { 1  +  £ V^Fcos  ^  [l  -  G  (^-^) 


(9.86d) 
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PROBLEMS 

9.1- 1  (a)  Sketch  the  ensemble  of  the  random  process 

x(f)  =  a  cos  (coct  +  ©) 

where  coc  and  0  are  constants  and  a  is  an  RV  uniformly  distributed  in  the  range  (0,  A). 

(b)  Just  by  observing  the  ensemble,  determine  whether  this  is  a  stationary  or  a  nonstationary 
process.  Give  your  reasons. 

9.1- 2  Repeat  part  (a)  ot  Prob.  9.1-1  if  a  and  coc  are  constants  but  0  is  an  RV  uniformly  distributed 

the  range  (0,  In). 

9.1- 3  Find  the  mean  and  the  variance  of  the  random  signal  x(r)  in  Prob.  9.1-2. 
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9.1- 4  (a)  Sketch  the  ensemble  of  the  random  process 

\(t)  =  ar2  +  b 

where  b  is  a  constant  and  a  is  an  RV  uniformly  distributed  in  the  range  (—2,  2). 

(b)  Just  by  observing  the  ensemble,  state  whether  this  is  a  stationary  or  a  nonstationary  process. 

9.1- 5  Determine  x(7)  and  Rx(t\ ,  ft)  f°r  the  random  process  in  Prob.  9.1-1,  and  determine  whether 

this  is  a  wide-sense  stationary  process. 

9.1- 6  Repeat  Prob.  9.1-5  for  the  process  x(f)  in  Prob.  9.1-2. 

9.1- 7  Repeat  Prob.  9.1-5  for  the  process  x(f)  in  Prob.  9.1-4. 

9.1- 8  Given  a  random  process  x(r)  =  kf,  where  k  is  an  RV  uniformly  distributed  in  the  range  (- 1,  1). 

(a)  Sketch  the  ensemble  of  this  process. 

(b)  Determine  x(f). 

(c)  Determine  R\(t\,  ^)* 

(d)  Is  the  process  wide-sense  stationary? 

(e)  Is  the  process  ergodic? 

9. 1  -9  Repeat  Prob.  9.1-8  for  the  random  process 

x(/)  =  a  cos  (coct  +  0) 

where  coc  is  a  constant  and  a  and  0  are  independent  RVs  uniformly  distributed  in  the  ranges 
(-1,  1 )  and  (0,  2^).  respectively. 

9.1-10  Find  the  average  power  Px  (that  is,  its  mean  square  value  x2(f)]  of  the  random  process  in 
Prob.  9.1-9. 

9.2- 1  Show  that  for  a  wide-sense  stationary  process  x(t). 

(a)  RAO)  >  I*x(t)I  rytO 

Hint :  (x,  ±x2)2  =  +  $  ± ^  >  0.  Let  x ,  =  x(/, )  and  x2  =  x(r2). 

(b)  lim  R\(t)  =  x2 

r-*oo 

Hint:  As  r  — ►  oo,  X)  and  x2  tend  to  become  independent. 

9.2- 2  State  whether  each  of  the  following  functions,  can  be  a  valid  autocorrelation  function  of  a 

stationary  real  random  process. 

(a) ?L  (d)  mt)+^r4 

(b)  J  -  (e)  sin  (tuor) 

(c)  e~Tu( r)  (0  cos  (mot) 

9.2- 3  State  whether  each  of  the  following  functions,  can  be  a  valid  PSD  of  a  real  random  process. 
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(a) 

(2  nf)2 

(e) 

(2nf)*+2 

(b) 

(0 

(2jt/)2-2 

(c) 

(2  nfj1 

(g) 

W+2 

(d) 

S(f  fo)+  (2*/)2+2 

S(f  +/o)  -  *<f  -/o) 

cos  2n{f  +/o)  +jsin  2n(f  -  fo ) 

e~(2xf)2 


9.2-4  Show  that  if  the  PSD  of  a  random  process  x(r)  is  band-limited,  and  if 


1  n  =  0 

0  n  =  ±1,  ±2,  ±3,... 

then  the  minimum  bandwidth  process  x(/)  that  can  exhibit  this  autocorrelation  function  must  be 
a  white  band-limited  process;  that  is,  Sx(0  =  k  U(f  /2W). 

Hint:  Use  the  sampling  theorem  to  reconstruct  R\(r). 

9.2- 5  For  random  processes  in  Prob.  9.2-4,  define  a  class  of  raised-cosine  PSDs  when  the  signal 

bandwidth  is  limited  to  2B/(\  +  r). 

9.2- 6  For  the  random  binary  process  in  Example  9.5  (Fig.  9.9a),  determine  /?x(r)  and  Sx(f)  if  the 

probability  of  transition  (from  1  to  - 1  or  vice  versa)  at  each  node  is  p  instead  of  0.5. 

9.2- 7  A  wide-sense  stationary  white  process  m(t)  band-limited  to  B  Hz  is  sampled  at  the  Nyquist  rate. 

Each  sample  is  transmitted  by  a  basic  pulse  p(t)  multiplied  by  the  sample  value.  This  is  a  PAM 
signal.  Show  that  the  PSD  of  the  PAM  signal  is  2BRm(0)\P(f)\2. 

Hint:  Use  Eq.  (9.31).  Show  that  Nyquist  samples  a*  and  a*+„  (n  >  1)  are  uncorrelated. 

9.2- 8  A  duobinary  line  code  proposed  by  Lender  is  a  ternary  scheme  similar  to  bipolar  but  requires 

only  half  the  bandwidth  of  the  latter.  In  this  code,  0  is  transmitted  by  no  pulse,  and  1  is  transmitted 
by  pulse  p(t)  or  -p(t)  using  the  following  rule:  A  1  is  encoded  by  the  same  pulse  as  that  used  to 
encode  the  preceding  1  if  the  two  Is  are  separated  by  an  even  number  of  Os.  It  is  encoded  by  the 
negative  of  the  pulse  used  to  encode  the  preceding  1  if  the  two  Is  are  separated  by  an  odd  number 
of  Os.  Random  binary  digits  are  transmitted  every  T ),  seconds.  Assuming  P(0)  =  P(l)  =  ®  ^ 
show  that 

Sy(f)  =  cos2  (7zJTb) 

Tb 

Find  Sy^)  if  p(t),  the  basic  pulse  used,  is  a  half-width  rectangular  pulse  n(2f/7/7). 

9.2- 9  Determine  Sy(f)  for  polar  signaling  if  P(l)  =  Q  and  P( 0)  =  1  -  Q. 

9.2-10  An  impulse  noise  x(r)  can  be  modeled  by  a  sequence  of  unit  impulses  located  at  random  instants 
(Fig.  P9.2-10).  There  are  on  average  a  impulses  per  second,  and  the  location  of  any  impulse  is 
independent  of  the  locations  of  other  impulses.  Show  that  Rx( r)  =  a  8(r)  +  a2. 


Figure 

P.9.2-13 


Figure 

P.9.2-10 


t 


a 
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Figure 

P.9.2-13 


9.2- 11  Find  the  autocorrelation  function  of  the  impulse  noise  in  Prob.  9.2-10  if  the  impulses  are  equally 

likely  to  have  magnitude  ±1,  ±3. 

9.2- 12  Find  the  autocorrelation  function  of  the  output  signal  when  the  impulse  noise  in  Prob.  9.2- 1 1  is 

the  input  to  a  linear  time-invariant  system  with  impulse  response  h{t). 

9.2- 13  A  sample  function  of  a  random  process  x(f)  is  shown  in  Fig.  P9.2-13.  The  signal  x(/)  changes 

abruptly  in  amplitude  at  random  instants.  There  are  an  average  of  ft  amplitude  changes  (or 
shifts)  per  second.  The  probability  that  there  will  be  no  amplitude  shift  in  r  seconds  is  given  by 
p0(r )  =  e~PT .  The  amplitude  after  a  shift  is  independent  of  the  amplitude  before  the  shift.  The 
amplitudes  are  randomly  distributed,  with  a  PDF p\(x).  Show  that 


This  process  represents  a  model  for  thermal  noise.1 

9.3-1  Show  that  for  jointly  wide-sense  stationary,  real,  random  processes  x(t)  and  y(/), 

|Kxy(r)l  <  l/?x(0)/?yC0)]l/2 


Hint:  For  any  real  number  a ,  (ax  -  y)2  >  0. 

9.3-2  Two  random  processes  x(r)  and  y (/)  are 

x(f)  =  A  cos  (a)Qt  +  <p)  and  y (t)  =  B  sin  (nwot  +  n<p  +  ifr) 


where  n  =  integer  ^  0  and  A,  B ,  and  a>o  are  constants,  and  <p  is  an  RV  uniformly  distributed 

in  the  range  (0.  2 n).  Show  that  the  two  processes  are  incoherent. 

9.3-3  If  x(/)  and  y (t)  are  two  incoherent  random  processes,  and  two  new  processes  u ( / )  and  v (f)  are 
formed  as  follows 


u(/)  =  2x(r)  -  y(/)  v(r)  =  x(/)  +  3y(/) 


findtfu(r).  ^v(r).  /fUv(r),  and  Rvu(r)  in  terms  of  Rx(r)  and /?y(r). 


9  3-4  A  sample  signal  is  a  periodic  random  process  x(/)  shown  in  Fig.  P9.3-4.  The  initial  delay  b  where 
the  first  pulse  begins  is  an  RV  uniformly  distributed  in  the  range  (0,  T0). 

(a)  Show  that  the  sample  signal  can  be  written  as 


00 


x(r)  =  C0  +  C„ cos  \nwo(t  -  b)  +  0n\ 
n=  1 


by  first  finding  its  trigonometric  Fourier  series  when  b  =  0. 


mmm 
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(b)  Show  that 

1  00  2  ji 

RAr)  =  Cq  +  -  J]C^cos  moot  wo  = 


g 


9.4- 1  Consider  Example  9. 10  again.  The  two  random  noise  processes  have  the  following  PSD: 

5x1  =  or2  +  (J2nf)2  5x2  =  K 

Find  the  PSD  and  the  power  of  the  output  random  process  y(r). 

9.4- 2  Show  that  R\y(r ),  the  cross-correlation  function  of  the  input  process  x(t )  and  the  output  process 

y(r)  in  Fig.  9.12,  is 


^xy (t)  — /i(r)  * /?x(r)  and  Sxy(f)  —  H (f)S\(f) 

Hence,  show  that  for  the  thermal  noise  n(r)  and  the  output  v()(t)  in  Fig.  9.13  (Example  9.9), 


Sn\„(f)  — 


2kTR 


1  +  j2nfRC 


and  Rmo(r)=^e  r/RC  u(t) 


9.4-3  A  simple  RC  circuit  has  two  resistors  /?|  and  R2  in  parallel  (Fig.  P9.4-3a).  Calculate  the  rms 
value  of  the  thermal  noise  voltage  \Q  across  the  capacitor  in  two  ways: 


(a)  Consider  resistors  R  j  and  R2  as  two  separate  resistors,  with  respective  thermal  noise  voltages 
of  PSD  2kTR\  and  IkTRi  (Fig.  P9.4-3b).  Note  that  the  two  sources  are  independent. 

(b)  Consider  the  parallel  combination  of  /?  j  and  Ri  as  a  single  resistor  of  value  R 1 R2I (^1  ' 

with  its  thermal  noise  voltage  source  of  PSD  2kTR\R2/(R\  +R2)  (Fig.  P9.4-3c).  Comment. 


Figure  P.9.4-4 


9.4-4  A  shot  noise  is  similar  to  impulse  noise  described  in  Prob.  9.2-10  except  that  instead  of  random 
impulses,  we  have  pulses  of  finite  width.  If  we  replace  each  impulse  in  Fig.  P9.2-10  by  a  pulse 
h(t)  whose  width  is  large  compared  to  1/a,  so  that  there  is  a  considerable  overlapping  of  pulses- 
we  get  shot  noise.  The  result  of  pulse  overlapping  is  that  the  signal  looks  like  a  continuous 
random  signal,  as  shown  in  Fig.  P9.4-4. 
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(a)  Derive  the  autocorrelation  function  and  the  PSD  of  such  a  random  process. 

Hint:  Shot  noise  results  from  passing  impulse  noise  through  a  suitable  filter.  First  derive 
the  PSD  of  the  shot  noise  and  then  obtain  the  autocorrelation  function  from  the  PSD.  The 
answers  will  be  in  terms  of  or,  /*(/),  or  H  if). 

(b)  The  shot  noise  in  transistors  can  be  modeled  by 


h(t)  =  le~,ITu(t) 


where  q  is  the  charge  on  an  electron  and  T  is  the  electron  transit  time.  Determine  and  sketch  the 
autocorrelation  function  and  the  PSD  of  the  transistor  shot  noise. 


ure  P.9.4-4 


9.6-1  A  signal  process  m (t)  is  mixed  with  a  channel  noise  n(/).  The  respective  PSDs  are 


20 


and 


(a)  Find  the  optimum  Wiener-Hopf  filter. 

(b)  Find  the  noise  power  at  the  input  and  the  output  of  the  filter. 

(c)  What  is  the  SNR  improvement  achieved  by  using  this  filter? 


9.6-2  Repeat  Prob.  9.6- 1  if 


Smif)  4  +  (2jr/)2 


4 


9.6-3  Repeat  Prob.  9.6-1  if 


20 


9.7-1  Consider  a  message  signal  with  PSD 


in  which  a  = 


1400jt.  The  channel  noise  is  white  with  PSD 


,  -V 

Sn(f)  =  y 
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(a)  Find  the  optimum  preemphasis  and  deemphasis  filters. 

(b)  Find  the  SNR  improvement  achieved  by  the  optimum  filters  in  part  (a). 


9.7-2  Consider  a  message  signal  with  PSD 


Figure  P.9.8-3 


The  channel  noise  is  white  with  PSD 


(a)  Find  the  optimum  preemphasis  and  deemphasis  filters. 

(b)  Find  the  SNR  improvement  achieved  by  the  optimum  filters  in  part  (a). 


9.7-3  A  message  signal  m(/)  with 


2 


DSB-SC  modulates  a  carrier  of  100  kHz.  Assume  an  ideal  channel  with  Hc(f)  =  10  3  an£* 
the  channel  noise  PSD  Sn(f )  =  2  x  10“^.  The  transmitted  power  is  required  to  be  1  kW,  and 
G  =  10"2. 

(a)  Determine  transfer  functions  of  optimum  preemphasis  and  deemphasis  filters. 

(b)  Determine  the  output  signal  power,  the  noise  power,  and  the  output  SNR. 

(c)  Determine  y  at  the  demodulator  input. 

9.7- 4  Repeat  Prob.  9.7-3  for  the  SSB  (USB)  case. 

9.8- 1  A  white  noise  process  of  PSD  JV. /2  is  transmitted  through  a  bandpass  filter  H(f)  (Fig.  P9.8-D- 

Represent  the  filter  output  n(r)  in  terms  of  quadrature  components,  and  determine  ' ' 

nc>  when  the  center  frequency  used  in  this  representation  is  100  kHz  (i ,c.,fc  — 

100  x  103). 


Figure  P.9.8-1 


Wf)\2 


2 

1 


9.8-2  Repeat  Prob.  9.8-1  if  the  center  frequency  fc  used  in  the  representation  is  not  a  true  center 
frequency.  Consider  three  cases:  (a )fc  =  105  kHz:  (b)/c  =  95  kHz;  (c)/c  =  120  kHz. 
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Figure  P.9.8-3 


9.8-3  A  random  process  x(f)  with  the  PSD  shown  in  Fig.  P9.8-3a  is  passed  through  a  bandpass  filter 
(Fig.  P9.8-3b).  Determine  the  PSDs  and  mean  square  values  of  the  quadrature  components  of 
the  output  process.  Assume  the  center  frequency  in  the  representation  to  be  0.5  MHz. 


H(f) 

_  1 

- 1 

t 

1 

n 

100  kHz 

i 

0 

0.5  MHz  /— ► 

(b) 
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Figure  10.1 

Communication 
system  model. 


Figure  10.2 

DSB-SC  system. 


10.2  AM 


In  this  chapter,  to  facilitate  the  comparison  of  various  systems,  we  analyze  the  performance 
of  modulated  analog  communication  systems  in  the  presence  of  noise. 

10.1  ANALYTICAL  FIGURE  OF  MERIT 

Figure  10.1  is  a  schematic  of  a  communication  system  that  captures  the  essence  of  various 
analog  modulations.  The  transmitter  modulates  the  message  m(t )  and  transmits  the  modulated 
signal  at  power  Sf  over  a  channel  (or  transmission  medium).  The  transmitted  signal  is  corrupted 
by  an  additive  channel  noise.  Just  as  in  our  baseband  analysis  of  Chapter  9,  the  channel  may 
attenuate  and  distort  the  signal.  At  the  receiver  end,  the  input  signal  and  noise  powers  are  5, 
and  Ni,  respectively.  The  job  of  the  receiver  is  to  demodulate  the  modulated  signal,  under  noise 
and  interferences,  to  generate  the  message  m(t)  as  the  desired  signal  output. 

The  signal  and  noise  powers  at  the  receiver  output  are  S„  and  N(),  respectively.  For  analog 
message  signal  m(t),  the  quality  of  the  received  signal  is  determined  by  S0/N0,  the  output  SNR' 
Hence,  we  shall  focus  our  attention  on  SNR  as  a  key  performance  metric.  However,  S0/Nc 
can  be  increased  as  much  as  desired  simply  by  increasing  the  transmitted  power  ST.  Thus,  it 
would  be  unfair  to  compare  two  receivers  when  one  has  the  benefit  of  higher  input  receiver 
power.  To  make  a  fair  performance  comparison,  receivers  should  be  operating  under  the  same 
environment.  Hence,  the  value  of  S„/N0  for  a  given  transmitted  power  is  an  appropriate  figure 
of  merit  in  an  analog  communication  system.  In  practice,  the  maximum  value  of  ST  is  l«mited 
by  considerations  such  as  transmitter  cost,  channel  capability,  and  interference  with  other 
channels.  Often,  it  is  more  convenient  to  deal  with  the  received  power  5,  rather  than  the 
transmitted  power  ST.  From  Fig.  10.1,  it  is  apparent  that  5,  is  proportional  to  ST.  Hence,  the 
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Figure  10.1 

Communication 
system  model. 


Figure  10.2 

DSB-SC  system. 


Channel  noise 


Transmitter  Channel 


Receiver  Demodulator 


value  of  S()/N()  for  a  given  5/  will  serve  equally  well  as  our  figure  of  merit  in  performance 
analysis. 


10.2  AMPLITUDE-MODULATED  SYSTEMS 

We  shall  analyze  DSB-SC,  SSB-SC,  and  AM  systems  separately. 

DSB-SC 

A  basic  DSB-SC  system  is  shown  in  Fig.  10.2.*  The  modulated  signal  is  a  bandpass  signal 
centered  at/(  with  a  bandwidth  2 B.  The  channel  noise  is  assumed  to  be  additive.  The  channel 
and  the  filters  in  Fig.  10.2  are  assumed  to  be  ideal. 

Let  Si  and  S„  represent  the  useful  signal  powers  at  the  input  and  the  output  of  the  demod¬ 
ulator,  and  let  N0  represent  the  noise  power  at  the  demodulator  output.  The  signal  at  the 
demodulator  input  is 


s/2m(f)cos  coct  4-  n,(r) 

where  the  additive  white  noise  n(t)  passes  through  the  bandpass  filter  to  generate  a  bandpass 
noise 


n,(r)  =  bandpass  filtering  {«(/)} 

=  nc(t)  cos  o)ct  4-  ns(t)  sin  coct 


at  the  demodulator.  Its  spectrum  is  centered  at  wc  and  has  a  bandwidth  2 B  Hz.  The  input  signal 
power  Sj  is  the  power  of  the  modulated  signal  s/2  m it)  cos  (o(  t .  From  Ecj.  (9.22c), 


Si  =  [V2m(f)cos  a)ct]2  =  (s/2)2[m(r) cos  (oct\2  =  m 2(t)  =  m2  (10.1) 


*  The  use  of  an  input  bandpass  filter  in  the  receiver  may  appear  redundant  because  the  out-of-band  noise 
components  will  be  suppressed  by  the  final  baseband  filter.  In  practice,  an  input  filter  is  usetul  because  by  removing 
the  out-of-band  noise,  it  reduces  the  probability  of  nonlinear  distortion  from  overload  effects. 

+  The  modulated  signal  also  has  a  random  phase  0,  which  is  uniformly  distributed  in  the  range  (0.  2n).  This  random 
phase  [which  is  independent  of  m(»)l  does  not  affect  the  final  results  and.  hence,  is  ignored  in  this  discussion. 
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The  reader  may  now  appreciate  our  use  of  \/2cos  coct  (rather  than  cos  &><■/)  in  the  modulator 
(Fig.  10.2).  This  was  done  to  facilitate  comparison  by  making  the  received  power  equal  to  that 
in  the  baseband  system.  We  shall  use  a  similar  artifice  in  our  analysis  ot  the  SSB  system. 

To  determine  the  output  powers  S0  and  Na,  we  note  that  the  signal  at  the  demodulator 
input  is 


y,(f)  =  V2m(/)cos  coct  +  n(t ) 

Because  n(t)  is  a  bandpass  signal  centered  at  toc,  we  can  express  it  in  terms  of  quadrature 
components,  as  in  Eq.  (9.70).  This  gives 

y ,•(/)  =  [V2  m(r)  +  nc(r) j  cos  u>ct  +  n5(r)  sin  a>ct 


When  this  signal  is  multiplied  by  s/2cos  coct  (synchronous  demodulation)  and  then  low-pass 
filtered,  the  bandpass  terms  m(/)  cos  2 coct  and  m ( t )  sin  2 a>ct  are  suppressed.  The  resulting 
demodulator  output  y a(t)  is 


y  o(t)  =  m(r)  -l-  —=n  At) 

v2 


Hence, 


S0  =  m2  =  S, 
N„  =^T) 


For  white  noise  with  power  density  A/72,  we  have  [Eq.  (9.72b)] 


(10.2a) 

(10.2b) 


nj?(f)  =  n2(f)  =  2MB 


and 


Nn=MB  (,a3) 

Hence,  from  Eqs.  ( 10.2a)  and  (10.3)  we  have 

—  =  _^L  =  v  (10.4) 

N0  MB  Y 

Comparison  of  Eqs.  (10.4)  and  (9.52)  shows  that  for  a  fixed  transmitted  power  (which  also 
implies  a  fixed  signal  power  at  the  demodulator  input),  the  SNR  at  the  demodulator  output  >s 
the  same  for  the  baseband  and  the  DSB-SC  systems.  Moreover,  quadrature  multiplexing  w 
DSB-SC  can  render  its  bandwidth  requirement  identical  to  that  of  baseband  systems.  Thus, 
theoretically,  baseband  and  DSB-SC  systems  have  identical  performance  and  capabilities. 

SSB-SC 

The  fundamentals  of  SSB-SC  amplitude  modulation  has  been  presented  in  Section  4.4. 1 
out  considering  the  effect  of  noise.  We  now  investigate  the  effect  of  channel  noise  on  t 


Figure  10.3 

SSB-SC  system. 
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Transmitter  Channel  Receiver 


quality  of  demodulation.  An  SSB-SC  system  is  shown  in  Fig.  10.3.  The  SSB  signal  <pSSB(t ) 
can  be  expressed  as  [see  Eq.  (4.20c)] 


<pSSB  ( t )  =  m(f)  cos  a)ct  +  mh(t)  sin  coct  (10.5) 

The  spectrum  of  <pSSB(t)  is  shown  in  Fig.  4.13d  after  the  DSB-SC  signal  is  filtered  by  the  (band¬ 
pass)  SSB  filter.  This  signal  can  be  obtained  (Fig.  10.3)  by  multiplying  m(r)  with  2  cos  u>ct  and 
then  suppressing  the  unwanted  sideband.  The  power  of  the  modulated  signal  2  m(r)  cos  (oct 
equals  2  in2  [four  times  the  power  of  m(/)cos  wct].  Suppression  of  one  sideband  halves  the 
power.  Hence  Si,  the  power  of  <fiSSB (t),  is 

S,=^  00.6) 

If  we  express  the  channel  bandpass  noise  in  terms  of  quadrature  components  as  in  Eq.  (9.74) 
of  Example  9. 14,  we  find  the  signal  at  the  detector  input,  y,(t): 

y ,.(/)  =  [m(f)  -I-  nt(f)]  cos  0)cl  +  [m*(r)  +  n.5(/)]  sin  u)ct 

At  the  synchronous  demodulation  receiver,  this  signal  is  multiplied  by  2  cos  (oct  and  then 
low-pass-filtered  to  yield  the  demodulator  output 

y„(/)  =  m(t)  +  n(.(f) 


Hence, 

S0  =  m2  =  S, 

C0.7) 

We  have  already  found  for  ihe  SSB  channel  noise  (lower  sideband)  in  Eq.  (9.76b)  as 

/v„  =  n2  =  NB 


Thus, 


N0  NfB 


(10.8) 


This  shows  that  baseband.  DSB-SC.  and  SSB-SC  systems  perform  identically  in  terms  of 
resource  utilization.  All  of  them  yield  the  same  output  SNR  for  g.ven  transmuted  power  and 
transmission  bandwidth. 


*  Although  this  is  LSB,  the  discussion  is 


valid  for  USB  as  well. 


568  PERFORMANCE  ANALYSIS  OF  MODULATED  COMMUNICATION  SYSTEMS  UNDER  NOISE 


Example  10.1  In  a  DSB-SC  system,  the  carrier  frequency  is/r  =  500  kHz,  and  the  modulating  signal  m(f)  has 

_ _  ...  .  ..  a  i  t  t-i _ A  io  tronctnittpM  r\\/pr  q  Hictortinnlpcc 


a  uniform  PSD  band-limited  to  4  kHz.  The  modulated  signal  is  transmitted  over  a  distortionless 
channel  with  a  noise  PSD  Sn(f)  =  1/(4jt  2f2  +  a2),  where  a  =  106tt.  The  useful  signal  power 
at  the  receiver  input  is  1  /zW.  The  received  signal  is  bandpass  filtered,  multiplied  by  2  cos  a>ct, 
and  then  low-pass-filtered  to  obtain  the  output  s„(t)  +  na(t).  Determine  the  output  SNR. 

If  the  received  signal  is  km(t)  cos  coct,  the  demodulator  input  is  [A:m(f)  +  n<(f)]  cos  o>ct  + 
ns(f)  sin  a >ct.  When  this  is  multiplied  by  2  cos  coct  and  low-pass-filtered,  the  output  is 


So(f)  +  n„(r)  =  km(t)  +  nf(r) 


(10.9) 


Hence, 


S0  —  k2  m2  and  N„  =  n(2 

But  the  power  of  the  received  signal  Am(r)cos  coct  is  1  gW.  Hence, 


and 


S0  =  k2 m2  =  2  x  10“6 


To  compute  n2,  we  use  Eq.  (9.67c): 


where  n2  is  the  power  of  the  incoming  bandpass  noise  of  bandwidth  8  kHz  centered  at 
500  kHz;  that  is. 


504,000 


2nf  (2jr  )504,oo° 


a  (2tt  >496,000 


=  8.25  x  10-10  =  N0 


Therefore, 


So  2  X  10“6 


=  2.42  x  10-1 


N0  8.25  x  10-'° 
=  33.83  dB 


AM 


AM  signals  can  be  demodulated  synchronously  or  by  envelope  detection.  The  former  appr°aL 
is  of  theoretical  interest  only.  It  is  useful,  however,  for  comparing  the  noise  performance  o 
the  envelope  detector.  For  this  reason,  we  shall  consider  both  methods. 
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Coherent  (Synchronous)  AM  Demodulation:  Coherent  AM  detection  is  identical  to 
DSB-SC  in  every  respect  except  for  the  additional  and  redundant  carrier  term.  If  the  received 
signal  y/l [A  +  m(f)]  cos  ioct  is  multiplied  by  v^cos  coct ,  the  demodulator  output  is  m(r). 
Hence, 


S0  —  m- 

The  output  noise  will  be  exactly  the  same  as  that  in  DSB-SC  [Eq.  (10.3)]: 

N0  =  n2  =  Mb 


The  received  signal  is  y/l[A  +  m(f)]  cos  wct.  Hence,  the  total  received  signal  power  is 


Si  =  (V2)2 


[A  +  m(/)]2 


=  [A  +  m(r)]2 


=  A2  +  m2(/)  +  2A  m(r) 


Because  m(/)  is  assumed  to  have  zero  mean, 

Sj  =A2  +  m2(r) 


and 


So  m2 

N„  ~  AfB 

i^2  Sj 
A2  -I-  m2  B 
m2 

=  — — ==Y 
A2  +  m2 


If  m(,)max  =  trip,  then  A  >  mp.  For  the  maximum  SNR,  A  =  mp,  and 

(k\  =  _”Ly 

V  Ho  /  max  W2  +  ni- 

1 

= - == - Y 

(m2/m2  +  1) 


(10.10) 


(10.11a) 


Because  (wt2/ m2)  >  1, 


(10.11b) 


,  u  n  that  the  SNR  in  AM  is  worse  than  that  in  DSB-SC  and  SSB-SC  (by  at  least 
3  SE  rX  lu.  6  dB  in  practice,  depending  on  the  Nation  index  and  ihe  signal 


570  PERFORMANCE  ANALYSIS  OF  MODULATED  COMMUNICATION  SYSTEMS  UNDER  NOISE 

waveform).  This  performance  loss  differs  for  different  message  signals.  The  reason  is  the 
relative  amount  of  power  consumed  by  the  carrier  component  in  the  total  AM  signal.  For 
example,  when  m(f)  is  sinusoidal,  tnjjm-  =  2,  and  AM  requires  three  times  as  much  power 
(4.77  dB)  as  that  needed  for  DSB-SC  or  SSB-SC. 

In  many  communication  systems  the  transmitter  is  limited  by  peak  power  rather  than 
average  power  transmitted.  In  such  a  case,  AM  fares  even  worse.  It  can  be  shown  (Prob.  1 0.2-5) 
that  in  tone  modulation,  for  a  fixed  peak  transmission  power,  the  output  SNR  of  AM  is  6  dB 
below  that  of  DSB-SC  and  9  dB  below  that  of  SSB-SC.  These  results  are  valid  under  conditions 
most  favorable  to  AM,  that  is,  with  modulation  index  ft  =  l.  For  /n  <  1 ,  AM  would  be  even 
worse  than  this.  For  this  reason,  volume  compression  and  peak  limiting  are  generally  used  in 
AM  transmission  for  the  sake  of  having  full  modulation  most  of  the  time. 

AM  Envelope  Detection:  Assuming  the  received  signal  to  be  [A  -F  m(f)]  cos  coct,  the 
demodulator  input  is 


y,(r)  =  [A  +  m(f)]  cos  coct  +  n(r) 

We  use  the  quadrature  component  representation  for  n(r)  to  write 

y i(t)  =  [A  +  m(r)  -F  nc(/)]  cos  o)ct  +  ns(f)  sin  a)ct  (10.12a) 

The  desired  signal  at  the  demodulator  input  is  [A  +  m(f)]  cos  coct.  Hence,  the  signal  powerS, 
is  [based  on  analysis  similar  to  what  was  used  to  derive  Eq.  (9.22c)] 

c  [A  -F  m(f)]2  A2  +  m2 

' =  2  =  2 

To  find  the  envelope  of  y ,•(/),  we  rewrite  Eq.  ( 10. 12a)  in  polar  form  as 

y,(f)  =  E,(r)  cos  [o)ct  +  ©,(r)]  (10.12b) 

where  the  envelope  E ,-(r)  is  [see  Eq.  (3.42a)] 

E,(0  =  y/[A  +  m  (r)  +  nf(r)]2  +  n2(r)  (10.12c) 

The  envelope  detector  output  is  E,(r)  [Eq.  (10.12c)].  We  shall  consider  two  extreme  cases, 
small  noise  and  large  noise. 

1.  Small-Noise  Case:  If  [A  +  m(r)]  »  n(f)  for  almost  all  /,  then  [A  +  m(/)l  »  MO  and 
n.v(f)  for  almost  all  t*  In  this  case  E,(r)  in  Eq.  (10.12c)  can  be  approximated  by 

E,(0  —  A- F  m(/)  +  nf(r) 

The  dc  component  A  of  the  envelope  detector  output  E,  is  blocked  by  a  capacitor,  yielding 
m(r)  as  the  useful  signal  and  nc(r)  as  the  noise.  Hence, 

S0  =  m2 


*  Here  we  use  the  term  "almost  all  t"  because  nf(f)  and  n5(r)  are  both  Gaussian  (amplitude  range  -oo  to  oo),  and  'n 
some  instances  nf  (t)  or  nv(f)  or  both  will  exceed  A  -I-  m(r).  no  matter  how  large  A  +  m(f)  is.  For  large  signals, 
however,  this  occurs  only  over  relatively  short  time  intervals. 
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and  from  Eq.  (9.72b), 


Na  =  n2(f)  =  1NB 


and 


S0  m 


N0  2 MB 


m2  5, 
A2  +  m2  -N  B 


m 


(10.13) 


which  is  identical  to  the  result  for  AM  with  synchronous  demodulation  [Eq.  ( 1 0. 1 0)].  Therefore 
for  AM,  when  the  noise  is  small  in  comparison  to  the  signal,  the  performance  of  the  envelope 
detector  is  identical  to  that  of  the  synchronous  detector. 

2.  Large-Noise  Case:  In  this  case  n(r)  »  [A+m(f)].  Hence,  nc(t)  and  n,(r)  »  [A+m(r)J 
for  almost  all  t.  Under  this  condition  Eq.  (10.12c)  becomes 


E/(r)  2  y/n2(t)  +  n2(r)  +  2nr(r)[A  +  m(f)] 


where  En(U  and  0„(f),  the  envelope  and  the  phase  of  the  noise  n(r),  are  [see  Eqs.  (3.42)  and 
Fig.  10.4a,  later,  in  Example  10.2]. 


En(r)  =  yfr&t)  +  n?(f) 


(10.14a) 


(10.14b) 


nc(r)  =En(t)cos  @n(0 
ns(r)  =  En(r) sin  ©n(0 


(10.14c) 
( 1 0. 1 4d ) 


Because  E.(»)  »  A  +  m(r).  &«>  may  1*  W™"  as 


=  En(0  +  [A  +  m(f)]  cos  0„(f) 


(10.15) 
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Figure  10.4 

Performance  of 
AM  (synchronous 
detection  and 
envelope 
detection). 

(a) 


Synchronous  detection  (AM) 


A  glance  at  Eq.  ( 10. 15)  shows  that  the  output  contains  no  term  proportional  to  rn(t).  The  signal 
m(/)cos  ©n(0  represents  m(r)  multiplied  by  a  time-varying  function  (actually  a  noise  sig¬ 
nal)  cos  0n(/)  and,  hence,  is  of  no  use  in  recovering  m (/).  In  all  previous  cases,  the  output 
signal  contained  a  term  of  the  form  a  •  m (r),  where  a  was  constant.  Furthermore,  the  output 
noise  was  additive  (even  for  envelope  detection  with  small  noise).  In  Eq.  (10.15),  the  noise 
is  multiplicative.  In  this  situation  the  useful  signal  is  badly  mutilated.  This  is  the  threshold 
phenomenon,  where  the  signal  quality  at  the  output  undergoes  disproportionately  rapid  deten- 
oration  when  the  input  noise  increases  beyond  a  certain  level  (i.e.,  when  y  drops  below  a  certain 
value.) 

Calculation  of  the  SNR  for  the  intermediate  (moderate  noise)  case  is  quite  complex.1  Here 
we  shall  state  the  final  results  only: 


5  _ 

t—  —  0.916A2m2  y2 

N0 

Figure  10.4b.  which  plots  S(,/N0  as  a  function  of  y  for  AM  with  synchronous  detection 
and  AM  with  envelope  detection,  clearly  illustrates  the  threshold  effect.  The  threshold  occurs 
when  y  is  on  the  order  of  10  or  less.  For  an  AM  signal  of  reasonable  quality,  y  should  be  on  the 
order  ol  1000  (30  dB),  and  the  threshold  is  rarely  a  limiting  condition  in  practical  cases.  Thr 
result  does  not  mean,  however,  that  AM  is  an  inferior  system.  In  practice,  DSB-SC,  SSB-SC  ■ 
and  QAM  modulations  all  require  coherent  detection  or  more  complex  receivers.  On  the  other 
hand,  the  simple  noncoherent  detection  of  AM  signals  can  be  a  huge  advantage  when  recei 
cost  is  a  key  concern. 
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Example  1  0.2  Find  ythresin  the  value  of  y  at  the  threshold,  in  tone-modulated  AM  with  n  —  1  it  the  onset  of 
the  threshold  is  when  En  >  A  with  probability  0.01.  E„  being  the  noise  envelope. 


Because  En  =  y/n2  4-  n2,  where  both  n<  and  nv  are  Gaussian  with  variance  o~ .  according 
to  Eq.  (8.58a)  En  has  a  Rayleigh  PDF  with  variance  <t2,  and 


^(En  >A) 


=  =  0.01 


Hence,  at  the  onset  of  the  threshold. 


A2 

— -  =  4.605 

-CTn 


The  variance  <x2  of  the  bandpass  noise  of  PSD  M /2  and  the  bandwidth  2  B  is  2MB.  Hence, 
at  the  onset  of  the  threshold. 


A2 

4 MB 


4.605 


For  tone  modulation. 


m(r)  =  nA  cos  (u)mt  +  <t>) 

=  A  cos (coml  +  (f>)  fi—\ 


and 


Hence, 


A2  +  m2  A~  +  0-5,4 ~  _  3A~ 

2  =  2  “  4 


/thresh  — 


=  3A1  =  3(4.605)  =  13.8 
MB  4 MB 


(or  12.4  dB) 


Optimum  Preemphasis-Deemphasis  in  A\1  Systems 

Because  the  channel  noise  in  a  DSB  system  is  in  the  band/c  ±  B.  the  PDE  should  be  carried 
!ut  in  this  band.  This  means  PDE  filters  Hp(f)  and  H4(f)  are  bandpass  filters  located  as  shown 

p-  ins  The  ontimization  is  localized  to  the  subsystem  shown  in  the  dashed  box.  This 
in  Fig  10.5.  P  jn  p.o  9  ,8  Hence  Hp{f)  and  Hj(f)  can  be  obtained  from 

JSS  ei  (9  60a.  b».  However,  because  .he  signal  PSD  a,  the  input  of  the  subsystem 
Chapter  y,  tq.  I  5  (f  _/)],  we  should  use  this  PSD  in  place  of  Sm(f)  in 

F'g  aSmentapplies  .o  SSB  and  VSB  systems.  In  these  cases.  HJf,  and 

^  (O  can  be  computed  from  Eq.  (9.60a.  b)  b,  replacing  5.(0  with  the  appropriate  SSB  o, 

VSB  PSD. 


wmmm 
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Figure  10.5 

Optimum  PDE  in 
an  amplitude- 
modulated 
system. 


Figure  10.7 

Phasor 

representation  of 
signals  in  an 
angle-modulated 
system. 


Figure  10.6 

Angle-modulated 

system. 


n(f) 


Transmitter  Channel 


Receiver 


10.3  ANGLE-MODULATED  SYSTEMS 

Figure  10.6  is  a  block  diagram  of  an  angle-modulated  system.  The  angle-modulated  (or 
exponentially  modulated)  carrier  <pEM(t)  can  be  written  as 

<Pem(0  =  a  cos[<tfcT+  lK*)l  (10.16a) 

where 

=  kpm(t)  for  PM  (10.16b) 

=  kf  f  m (a)  da  for  FM  (10.16c) 

J—OO 

and  m(r)  is  the  message  signal.  The  channel  noise  n(f)  at  the  demodulator  input  is  a  bandpass 
noise  with  PSD  Sn{f)  and  bandwidth  2(A f  +  B).  The  noise  n(f)  can  be  expressed  in  terms  of 
quadrature  components  as 

n(r)  =  n c(t)  cos  coct  +  n s(t)  sin  coct  (10.17a) 

where  nc(r)  and  n s(t)  are  low-pass  signals  of  bandwidth  A f  +  B.  The  bandpass  noise  n(f) 
may  also  be  expressed  in  terms  of  the  envelope  En(r)  and  phase  @„(r)  as  [see  Fig.  10.4a  and 
Eqs.  (10.14)] 

n(r)  =  En(r)cos  [wct  +  ©„(0]  (10.17b) 

Angle  modulation  (and  particularly  wideband  angle  modulation)  is  a  nonlinear  type  of  mod¬ 
ulation.  Hence,  superposition  does  not  apply.  In  AM,  the  signal  output  can  be  calculated  by 
assuming  the  channel  noise  to  be  zero,  and  the  noise  output  can  be  calculated  by  assum 
ing  the  modulating  signal  to  be  zero.  This  is  a  consequence  of  linearity.  Hence  in  amplitude 
modulations,  the  signal  and  noise  do  not  form  intermodulation  components.  Unfortunately- 
exponential  (or  angle)  modulation  is  nonlinear,  and  we  cannot  use  superposition  to  calculate 
the  output,  as  can  be  done  in  AM.  We  shall  show  that  because  of  special  circumstances,  how 
ever,  even  in  angle  modulation  the  noise  output  can  be  calculated  by  assuming  the  modulatiUc 
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Figure  10.7 

Phasor 

representation  of 
signals  in  an 
angle-modulated 
system. 


signal  to  be  zero.  To  prove  this  we  shall  first  consider  the  case  of  PM  and  then  extend  those 
results  to  FM. 

Phase  Modulation 

Because  narrowband  modulation  is  approximately  linear,  we  need  to  consider  only  wide¬ 
band  modulation.  The  crux  of  the  argument  is  that  for  wideband  modulation,  the  signal  m(f) 
changes  very  slowly  relative  to  the  noise  n(r).  The  modulating  signal  bandwidth  is  B.  and 
the  noise  bandwidth  is  2(A/  +  B),  with  A/  »  B.  Hence,  the  phase  and  frequency  varia¬ 
tions  of  the  modulated  carrier  are  much  slower  than  the  variations  of  n (/).  The  modulated 
carrier  appears  to  have  constant  frequency  and  phase  over  several  cycles,  and,  hence,  the 
carrier  appears  to  be  unmodulated.  We  may  therefore  calculate  the  output  noise  by  assuming 
m(/)  to  be  zero  (or  a  constant).  This  is  a  qualitative  justification  for  the  linearity  argument. 
A  quantitative  justification  is  given  in  the  following  development. 

To  calculate  the  signal  and  noise  powers  at  the  output,  we  shall  first  construct  a  phasor 
diagram  of  the  signal  y,(f)  at  the  demodulator  input,  as  shown  in  Fig.  10.7, 

y  ,-(t)  =  A  cos  [o)ct  +  +  n(/) 

=  A  cos  [o)ct  +  iKO]  +  En(') cos  +  0n(O) 

=  R(/)  cosUM  +  f(t)  +  AiHO]  (10.18) 


where 

x//(t)  =  kpm(t)  forPM  (10.19) 


For  the  small-noise  case,  where  E.(0  «  A  for  -almost  all  f,”  Af(t)  «  n/2  for  “almost  all 
t”  and 

a  *(o  ~  sin  [0.W  -  n 01  <  10-2°) 


The  demodulator  detects  the  phase  of  the  input  y,(/).  Hence,  the  demodulator  output  is 


y  a(t)  =  W)  +  W) 

=  *pm(O  +  pp«n[0„(O-^(O] 


(10.21a) 

(10.21b) 


,  .  a, i,i t\  involves  the  signal  iH/)  due  to  the  nonlinear  nature  of  angle 

vLs  much  ,„„e  s,ow,y  ,han  B„(,)  ^ 
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noise),  we  can  approximate  \fr(t)  by  a  constant  if , 

E  ( t ) 

At/r(f)  —  ML—  sin  [0„W  -  if ] 

A 

_  E„(/)  0  ^ cos  ^ - !]i_)  cos  ©n(r)  sin  \f/ 

A  A 

MO  ,  Mf)  . 

= - cos  x/f - sin  xfr 

A  A 

Also,  because  n<(/)  and  n,(f)  are  uncorrelated  for  white  noise, 

cos2  xlr  sin2  \lf  „ 

s*+V)  =  -^r-Sns(f)  +  —2 -snc{f) 


M2 

A2 


[because  Sn,  (f)  =  S„M)\ 


(10.22) 


For  a  white  channel  noise  with  PSD  M  /2  [Eq.  (9.71 )],  the  resulting  demodulation  noise  is 


^  (O  — 


4  \f\<Af  +  B 
A 

0  otherwise 


(10.23a) 


Note  that  if  we  had  assumed  \ j/(t)  =  0  (zero  message  signal)  in  Eq.  (10.22),  we  would  have 
obtained  exactly  the  same  result.* 

The  demodulated  noise  bandwidth  is  A /  +  B.  But  because  the  useful  signal  bandwidth  is 
only  B ,  the  demodulator  output  is  passed  through  a  low-pass  filter  of  bandwidth  B  to  remove 
the  out-of-band  noise.  Hence,  the  PSD  of  the  low-pass-filtered  output  noise  is 


Sn0(f)  = 


71  lfl<* 
0  l f\>B 


(10.23b) 


leading  to  the  total  noise  power  of 

kr  ~D/M\  2 MB 

No  =  2b{m)  =  ^~ 

From  Eq.  (10.21b)  we  already  have  the  signal  component  power 


Sn  =  kj  m2 


(10.24a) 


(10.24b) 


Thus, 


?£.  -  tAk  \2  m2 

N0~{Akp)  2MB 


(10.24c) 


These  results  are  valid  for  small  noise,  and  they  apply  to  both  wideband  and  narrowband  PM 
We  also  have 


Y  = 


Si  A2/2  A2 


MB  MB  2MB 


1 

*  This  follows  from  the  fact  that  E„(t)sin  C-)n(f)  =  nj(f).  Hence.  A </<(/)  =  ns(t)/A,  and  S^(f)  -  Sns(f)/*~' 


figure  10.8 

system  as  a 
special  case  of 
PM  system. 
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and 


—  =  klm1  y  (10.25) 

N0  ’’ 

Also,  for  PM  [Eq.  (5.17a)], 

A f  —  kp/hp/2jT  trip  =  [  rh  ( / )  ]  max 

Hence, 

^  =  (2ttA/)2(^x  (10.26) 

N0  \mp  / 

Note  that  the  bandwidth  of  the  angle-modulated  waveform  is  about  2A /  (for  the  wideband 
case).  Thus,  the  output  SNR  increases  with  the  square  of  the  transmission  bandwidth:  that  is, 
the  output  SNR  increases  by  6  dB  for  each  doubling  of  the  transmission  bandwidth.  Remember, 
however,  that  this  result  is  valid  only  when  the  noise  power  is  much  weaker  than  the  signal 
power.  Hence,  the  output  SNR  cannot  be  increased  indefinitely  by  increasing  the  transmission 
bandwidth  because  this  also  increases  the  noise  power,  and  at  some  stage  the  small-noise 
assumption  used  in  the  analysis  is  violated.  When  the  noise  power  becomes  comparable  to  the 
carrier  power,  the  threshold  appears  as  explained  later,  and  a  further  increase  in  bandwidth 
actually  reduces  the  output  SNR  instead  of  increasing  it. 

Let  us  apply  Eq.  (10.26)  to  tone  modulation,  where  m(f)  =  or  cos  a>mt.  For  this  case 

m^  =  a2/2,  and  mp  =  |m(0lmax  =  aa)m  =  2cntfm.  Hence, 


Note  that  the  results  of  Eqs.  (10.24c),  (10.25),  (10.26).  and  (10.27)  are  valid  for  both  NBPM 
and  WBPM. 


Frequency  Modulation  ...  .  ,, 

Frequency  modulation  may  be  considered  as  a  special  case  of  phase  modulation,  where  the 
7,  .  .  ,  -  /-r  mf„u„  (Fie  10  8).  At  the  receiver,  we  can  demodulate  FM  with 

TpM  demodulator  followed  by  a  differentiator,  as  shown  in  Fig^  10.8.  The  PM  demodulator 
output  is  kf  fLx  m  (a)  da.  The  subsequent  differentiator  yields  the  output  kfm(t),  so  that 


S„  =  kl  m2 


(10.28) 


figure  10.8 

system  as  a 
fecial  case  of 
PM  system. 
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Figure  10.9 

PSD  of  output 
noise  in  FM 
receiver. 


Figure  10.10 

Performance  of 
FM  system. 


The  phase  demodulator  output  noise  will  be  identical  to  that  calculated  earlier,  with  PSD  Af /A2 
for  white  channel  noise.  In  FM  receiver,  this  noise  is  passed  through  an  ideal  differentiator 
whose  transfer  function  is  jlnf .  Hence,  the  PSD  Sno(f)  of  the  output  noise  is  \j2nf\~  times 
the  PSD  in  Eq.  (10.23b), 


Sn0(f) 


j^(27T f)2  \f\<B 
lO  \f\>B 


(10.29) 


The  PSD  of  the  output  noise  is  parabolic  (Fig.  10.9),  and  the  output  noise  power  is 


N0 


-rj(2nf)2df 


SttIVB3 


Hence,  the  output  SNR  is 


(10.30) 


(10.31) 


Because  A f  =  kfmp/2n , 


(10.32a) 


(10.32b) 


Recall  that  the  transmission  bandwidth  is  about  2A /.  Hence,  for  each  doubling  of  the  band¬ 
width,  the  output  SNR  increases  by  6  dB.  Just  as  in  the  case  of  PM,  the  output  SNR  does  not 
increase  indefinitely  because  threshold  appears  as  the  increased  bandwidth  makes  the  channe 
noise  power  comparable  to  the  carrier  power. 
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Figure  10.10 

Performance  of 
FM  system. 


For  tone  modulation,  m 2 /ml  =  0.5  and 


-  =  -P2y 

No  T 


(10.33) 


The  output  SNR  S0/N0  (in  decibels)  in  Eq.  (10.33)  is  plotted  in  Fig.  10.10  as  a  function  of  y 
(in  decibels)  for  various  values  of  fi.  The  dotted  portion  of  the  curves  indicates  the  threshold 
region  (to  be  discussedlater  in  this  section).  Although  the  curves  in  Fig.  10.10  are  valid  for 
tone  modulation  only  (m  2/m2p  =  0.5).  they  can  be  usedfor  any  other  modulating  source  signal 

m(f)  simply  by  shifting  them  vertically  by  a  factor  (mV^)/0.5  =  2m2/^. 

From  Eqs  ( 10  27)  and  (10.33),  we  observe  that  for  tone  modulation  FM  is  superior  to  PM 
bv  a  factor  of  3  This  does  not  mean  that  FM  is  superior  to  PM  for  other  modulating  signals 
as  well.  In  fact,  we  shall  see  that  PM  is  superior  to  FM  for  most  of  the  practical  signals.  From 
Eqs.  ( 10.26)  and  ( 10.32a),  it  can  be  seen  that 


(S„/N0) pm  _  (27rg)2w/- 
(S0/N0)m 


3  mj 


(10.34) 


U  -f  nrr  .  3a,2  pM  is  superior  to  FM.  If  the  PSD  of  m(r)  is  concentrated  at  lower 

frequencies,  low-frequency  components  predominate  in  m(r),  and  A,  is  small.  This  favors  PM. 

Therefore  in  general.  PM  is  superior  lo  FM  when  Sm(f)  .s  conceniraled  a,  lower  frequences 
Ind  FM  is  superior  to  PM  when  Sm(f)  is  conceniraled  at  higher  frequencies.  This  explains 
why  FM  is  superior  lo  PM  for  tone  modulation,  where  all  the  signal  power  is  concentrated  at 
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the  highest  frequency  in  the  band.  But  for  most  of  the  practical  signals,  the  signal  power  is 
concentrated  at  lower  frequencies,  and  PM  proves  superior  to  FM. 


Example  10.3  For  a  zero  mean  Gaussian  random  process  m (t)  as  the  baseband  signal,  and  assuming  white 


Gaussian  channel  noise,  determine  the  output  SNR  for  FM. 

For  a  Gaussian  m (r),  mp  =  oo.  But  because  the  probability  that  amplitude  m  lies  beyond 
3<rm  (|m|  >  3am)  is  about  0.0027,  one  may  consider  mp  to  be  3crm.*  Hence, 


From  Eq.  (10.32b)  it  follows  that 


Narrowband  Modulation:  The  equations  derived  thus  far  are  valid  for  both  narrowband 


and  wideband  modulation.  We  observed  in  Chapter  4  that  narrowband  exponential  modulation 
(NBEM)  is  approximately  linear  and  is  very  similar  to  AM.  In  fact,  the  output  SNRs  for  NBEM 


and  AM  are  similar.  To  see  this,  consider  the  cases  of  NBPM  [Eq.  (5. 1 1 )]  and  AM  [Eq.  (4.8a)], 


^>AM(0  =  A  cos  +  rn(r)  cos  coct 
^nbpm  0)  — A  cos  co^t  y4^m(f)  sin  co^t 


=  A  cos  coct  -  mi  (r)  sin  coct 


where  m\(t)  =  Akpm(t).  Both  <pAM  and  <£>nbpm  contain  a  carrier  and  a  DSB  term.  In  <PNB pm 
the  carrier  and  the  DSB  component  are  out  of  phase  by  n/2  rad,  whereas  in  <pAM  they  are  m 
phase.  But  the  phase  difference  of  n/2  rad  has  no  effect  on  the  power.  Thus,  m(D  in  <PAN i  1S 
analogous  to  irq  ( t )  in  <pNBPM . 

Now  let  us  compare  the  output  SNR  for  AM  and  NBPM.  For  AM  [Eq.  ( 10.13)], 


whereas  for  NBPM  with  n\\(t)  =  Akpm(t),  Eq.  (10.25)  can  be  expressed  as 


Note  that  for  NBPM,  we  require  that  \kpm(t)\  <£  1 ,  that  is,  mi  (t)/A  <£  1 .  Hence, 


A2  ^  A2  +  m? 


This  is  known  as  3o  loading.  In  the  literature,  4 a  loading  [mp  =  4<rm)]  is  also  used. 
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and 


No) 


V \  ml 


pm  A2  +  m? 


which  is  of  the  same  form  as  (S(>/N(>) am-  Hence,  NBPM  is  very  similar  to  AM.  Under  the 
best  possible  conditions,  however,  AM  outperforms  NBPM  because  for  AM,  we  need  satisfy 
only  the  condition  [A  +  m(r)]  >  0  [Eq.  (4.9a)),  which  implies  [-m(r)]max  <  A.  Thus  for 
tone  modulation,  the  modulation  index  for  AM  can  be  set  close  to  unity.  For  NBPM.  however, 
the  narrowband  condition  would  be  equivalent  to  requiring  //  «  1.  Hence,  although  AM  and 
NBPM  have  identical  performance  for  a  given  value  of  n ,  AM  has  the  edge  over  NBPM  from 
the  SNR  viewpoint. 

It  is  interesting  to  look  for  the  dividing  point  (in  terms  of  A /)  that  separates  narrowband 
and  wideband  FM.  We  may  consider  the  dividing  line  to  be  that  value  of  A f  for  which  the 
output  SNR  for  FM  given  in  Eq.  (10.32b)  is  equal  to  the  maximum  output  SNR  for  AM.  The 
maximum  SNR  for  AM  occurs  when  n  =  1 ,  or  when  A  =  mp.  Hence,  equating  Eq.  ( 1 0.32b) 
with  Eq.  ( 1 0. 1 3)  [with  A  =  mp ], 


or 


3  |_1  +  (m 2/m2) 


(10.35) 


Because  tn2/mj,  <  1  tor  practical  signals,  and 


or 


P  <  0.6 


(10.36a) 


This  gives  the  dividing  point  of 


A/  =  0.6  B 


(10.36b) 


Mean  Square  Bandwidth  and  Estimation  of 
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A  normalized  PSD,  by  definition,  has  unit  area.  Hence,  to  normalize  a  PSD.  we  divide  it  by  its 
area  so  that  the  resulting  function  has  a  unit  area. 

For  a  baseband  signal  m (f),  we  can  use  this  concept,  to  obtain  the  normalized  PSD 
Smtf)/  f -OQ  Sm(f)  df.  Because  the  normalized  PSD  is  symmetrical  about  the  vertical  axis 

( f  =  0),  it  has  a  zero  mean  (in  the  sense  of  the  PDF),  and  its  variance  Bfn  is 


Bl 


i  r°° 

/  f2Sm(f)df  Hz 

m2  J- oo 


(10.37a) 

(10.37b) 


Example  10.4  For  a  low-pass  signal  with  PSD  Sm(f)  =  Tl(f /2B)9  show  that  B2,  =B2/ 3. 

Because  Sm(f)  =  1  for  \f\  <  B ,  and  0  for  \f\  >  we  have  [Eqs.  (10.37)] 

W_f-sf2df  _B> 

B m  -B  ..  3 


I—B  df 


Example  1  0.5  For  a  Gaussian  PSD  Sm(f)  =  ke  O-nf^/la1  show  that 


From  Eqs.  (10.37), 

W_kf-O0f2‘-*n2f2/*’2df  a 2 
m  k  !Zoe~4n2f2/2a2  df 


We  shall  now  investigate  the  dependence  of  the  FM  wave  PSD  on  instantaneous  frequent) 
In  wideband  modulation,  an  instantaneous  frequency  f,  in  the  range  (/*,/+  40  &ives  rxsc[° 
power  spectral  components  (components  of  the  PSD)  in  the  range  (f,f  +  df)-  The  P°^e 
contribution  of  these  components  is  proportional  to  the  relative  time  that  f,  remains  in  *  b 
range.  If  pfi{f)  is  the  PDF  of  instantaneous  frequency  f„  then  pu(f)df  is  the  probability 
of  observing  f,  in  the  range  (f,f+  df).  This  probability  is  proportional  to  the  time  that  / 
remains  in  the  range  (f ,  / + df )  and,  hence,  is  proportional  to  the  power  contributed  by  spectr 


For  to  exist,  Sm(f)  must  approach  zero  at  a  rate  faster  than  1/(27t/)2  for  large  values  of /. 
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components  in  the  range  (f ,  /  -f  df).  If  Sfm(0  is  the  PSD  of  an  FM  wave,  then  the  preceding 
argument  implies*  that 


Sm(f)  =  k-p{i(f)  />  0  (10.38) 

where  k  is  a  constant  of  proportionality. 

To  find  the  mean  square  bandwidth  of  Sfm(0>  we  observe  from  Eq.  (10.38)  that  pa(f)  is 
precisely  Sfm(/*)  normalized  to  unit  area.  Hence,  the  mean  square  bandwidth  of  Sfm(0  is  the 
variance  of  f/,  because 


f,  =  ^  =  ^-[(oc  +  kfm(t)] 
2n  2  7i 

1 


f,  =  — [a>c  +  fc/m(f)] 


=  fc  [because  m(f)  =  0] 


and 


=  (f,  -fc)2 


In  other  words, 


For  phase  modulation, 


and 


bLm  _  _L  m2 

FM  4jr2  f 


f i  =  <Oc  +  kpm(t)] 

In  2.7t 


Because  the  PSD  of  m(f)  is  (2nf)2Sm(f)  =  4n2  f2Sm(f \ 

i  r°° 


(10.39) 


*  Because />,,(/)  =0fo7/  <  0,  Sm(f)  in  Eq.  (10.38)  is  the  unilateral  PSD.  This  means  Smtf)  =  0  for /  <  0  and 
is  twice  'h^alebr^des  J Eq  ( 10.38)  over  (0.  oo),  it  can  be  shown  that  k=A2/ 2. 


By  integrating  1 
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From  Eq.  (10.37), 

<*l  =  m2  Bm 

that  is, 

From  Eqs.  (10.39)  and  (10.40)  we  observe  that  the  bandwidth  of  the  FM  wave  is  independent  ot 
the  modulating  signal  spectrum,  whereas  the  bandwidth  of  the  PM  wave  is  strongly  influenced 
by  the  modulating  signal  spectrum. 

The  output  SNR  in  Eqs.  (10.25)  and  ( 10.3 1 )  can  now  be  expressed  in  terms  of  mean  square 
bandwidths.  For  PM, 

(10.41) 

No  Bi 

m 

and  for  FM, 

v  (10.42) 

N0  B2  K 

Quantities  fipM  and  SpM  are  the  variances  of  the  normalized  PSDs  of  the  PM  and  FM  waves, 
respectively.  As  seen  earlier,  the  actual  transmission  bandwidth  will  be  several  times  the  stan¬ 
dard  deviation  or  -  For  example,  when  the  modulated  signal  PSD  has  a 

Gaussian  form,*  99.74%  of  the  total  power  resides  within  3a  of  the  carrier  frequency.  Hence, 

the  bandwidth  in  this  case  may  be  taken  as  6a,  or  6  for  PM  anc*  6  (#fm)  *°r 

FM.  From  Eqs.  (10.41)  and  (10.42),  it  follows  that  in  PM  as  well  as  in  FM  the  output  SNR 
improves  by  6  dB  for  each  doubling  of  the  transmission  bandwidth.  It  should  be  stressed  that 
these  results  are  valid  only  for  small  noise. 


Which  Is  Superior:  PM  or  FM?  From  Eqs.  (10.41)  and  (10.42),  we  have 

(Sq/Nq) pm  __  /  B ~  \  /^pm\  (10.43) 

(V^)fm  ”  \3Bj^/ 

It  will  be  instructive  to  compare  the  performance  of  PM  and  FM  for  the  same  transmission 
bandwidth  (the  same  mean  square  bandwidth),  that  is,  for  Z?jsM  =  #pM.  This  gives 


(S0/N0) pm  _  B 2  (l  0.44a) 

(S0/ N0)  fm 


Thus,  if 


B  > 


3  B2 


(1 0.44b) 


*  It  tan  be  shown2  that  for  wideband  angle  modulation,  the  PSD  of  the  modulated  carrier  is  Gaussian  when  the 
modulating  random  process  is  Gaussian. 
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PM  is  superior  to  FM.  Otherwise,  FM  is  superior  to  PM.  We  showed  in  Example  10.4  that 
when  the  PSD  of  m(r)  is  constant  (over  a  band  B  Hz),  3 B2^  =  B2.  Hence.  PM  and  FM  perform 
equally  well  in  that  case.  If  the  spectrum  falls  off  with  frequency,  as  it  does  for  all  real  sig¬ 
nals,  B ^  is  less  than  B2/ 3,  and  PM  is  superior  to  FM.  If,  on  the  other  hand,  the  spectrum  is 
weighted  heavily  at  higher  frequencies,  B is  greater  than  B2/ 3,  and  FM  is  superior  to  PM. 
This  is  exactly  what  happens  for  tone  modulation,  where  the  spectrum  is  concentrated  at  the 
highest  frequency  B ,  with  no  power  at  lower  frequencies,  making  B2^  equal  to  B2  This  is  why 
for  tone  modulation  FM  is  superior  to  PM  by  a  factor  of  3.  Tone  modulation  proves  to  be 
grossly  misleading  in  the  SNR  analysis  of  angle  modulation.  For  practical  signals,  including 
audio,  the  spectrum  falls  off  with  frequency,  and  PM  is  superior  to  FM.  Actually,  so-called 
FM  broadcast  is  not  pure  FM  but  is  FM  modified  by  preemphasis-deemphasis,  as  discussed 
in  Sec.  5.5. 

We  have  derived  two  different  criteria  [Eqs.  (10.34)  and  (10.44a)]  for  comparing 
PM  and  FM  performance.  In  Eq.  (10.34),  the  output  SNRs  are  compared  for  a  given 
transmission  bandwidth,  whereas  in  Eq.  (10.44a).  the  output  SNRs  are  compared  for  a 
given  mean  square  transmission  bandwidth.  Consequently,  we  may  get  slightly  different 
answers  by  using  these  two  criteria  (see  Probs.  10.3-3  and  10.3-9).  Although  the  criterion 
in  Eq.  (10.34)  is  preferable  to  that  in  Eq.  (10.44a),  it  is  impossible  in  practice  to  deter¬ 
mine  the  parameters  required  in  Eq.  (10.34),  and  the  only  way  to  compare  may  be  through 
Eq.  (10.44a). 


Example  1  0.6  If  a  baseband  signal  m(r)  has  a  Gaussian  PSD,  show  that  PM  is  superior  to  FM  by  a  factor  of 
3  (4.77  dB)  when  the  bandwidth  is  taken  as  3<r  rad/s,  where  a  is  the  standard  deviation  of  the 

normalized  PSD  of  m(/). 

Sm(r)  is  Gaussian  of  the  form  ke-(2nf)ll2a' ■  The  radian  bandwidth  W  =  2nB  =  3<r,  and 
B  =  3ct/27T.  as  seen  in  Example  10.5, 

W=^  and  B=^~  (10.45) 

m  4n2  2rr 

Hence,  from  (10.44),  it  follows  that  PM  is  superior  to  FM  by  a  factor  of  3. 


Threshold  in  Angle  Modulation  .  ,  .  n  ,. 

When  the  noise  power  at  the  demodulator  input  is  comparable  to  the  earner  (signal)  power,  the 
threshold  phenomenon  (discussed  earlier  for  AM  systems)  appears.  In  FM  this  effect  is  much 
more  pronounced  than  in  AM.  Let  us  discuss  qualitatively  how  the  threshold  effect  appears. 
We  refer  back  to  Fig.  10.7  and  observe  that  the  phasor  E„  rotates  from  the  terminal  of  the 
phasor  A  When  En  «  A,  the  angle  A is  quite  small,  and  because  ©„(/)  is  random  with 
uniform  distribution  in  the  range  (0,  2*),  A *(f)  assumes  positive  as  well  as  negative  values 
(Fig  10  11a),  which  are  usually  much  smaller  than  2n.  When  En  is  large  (on  the  order  of  A 
or  greater)  however,  the  resultant  phasor  is  much  more  likely  to  rotate  around  the  origin.  and 
aITs  more  likely  to  go  through  changes  of  2*  (Fig.  10.1 1c)  in  a  relatively  short  time,  because 
the  noise  varies  much  faster  than  the  modulating  s.gnal. 


A 
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Figure  10.1 1 

The  nature  of 
output  noise  in 
FM  receiver  for 
small  (a,  c)  and 
large  (b,  d) 
channel  noise. 


Figure  10.12 

Threshold 
phenomena 
in  FM. 


The  noise  at  the  FM  demodulator  output  is  given  by  Ai j/(t).*  This  is  shown  in  Fig.  10- 
for  large  and  small  noise.  For  large  noise,  we  observe  the  appearance  of  spikes  (of  area 
which  give  rise  to  a  crackling  sound.  When  the  noise  is  small  (En  <$CA),  the  PSD  ot  the  output 
noise  A  \j/(t)  is  parabolic,  and  most  of  its  power  is  in  the  frequencies  greater  than  B  an  ^ 
therefore  filtered  out  by  the  baseband  filter  at  the  output.  For  the  large-noise  case,  on 
other  hand,  we  have  the  presence  of  spikes,  which  are  like  impulses.  Consequently,  they  ha'e 
considerable  power  at  lower  frequencies.  Hence,  a  spike  will  contribute  much  more  noise  a 
output.  For  this  reason,  when  En  approaches  the  order  of  A,  the  output  noise  starts  increasing 
disproportionately  (Fig.  10.12).  This  is  precisely  the  threshold  phenomenon. 

It  has  been  shown3  that  the  noise  power  caused  by  the  spikes  Ns,  is 


Nx  = 


8 


(10.46a) 


Example 


*  Recall  that  FM  demodulation  requires  differentiation  of  the  angle  because  the  instantaneous  frequency  is 
proportional  to  the  derivative  of  the  angle. 


1 0.3  Angle-Modulated  Systems  587 

and  the  total  noise  power  Nt ,  is  the  sum  of  N0  in  Eq.  (10.30)  and  Ns  in  Eq.  (10.46a).  The 
output  SNR  is 


So  _ _ 302y(m2/m2) _ 

N0  +  Ns  ~  1  +  (2v/3Bfm/B)  YQ  [V(2B/Bfm)/] 
_ _ 302y(m2/m2) _ 

~  1  +  4-\/3(0  +  \)yQWy/(P  +  1)J 


(10.46b) 


The  onset  of  the  threshold  is  when  the  carrier  power  is  10  times  the  channel  noise  power.  The 
carrier  power  is  A2/ 2,  and  for  white  noise  with  a  PSD  A/*/2,  the  noise  power  is  A^fm.  where 
the  bandwidth  of  an  FM  carrier,  is 


Z?fm  =  2(A/  4-  B)  =  2B(P  -f  1) 


Hence,  the  threshold  occurs  when 


r  1  Al 

ZVBtf  +  l)=  —  — 


or 


20iP  +  l)  =  WE 

By  definition,  the  right-hand  side  of  this  equation  is  y  at  threshold.  Thus,  ythresh.  the  value  of 
y  at  the  onset  of  threshold,  is 


/thresh  —  20(^  +  1 ) 


(10.47) 


Example  10.7  a  Gaussian  m(r)  with  4a  loading  (i.e.,  mp  =  4<rm)  uses  p  =  4  to  frequency-modulate  a  carrier. 

The  output  SNR  is  found  to  be  20.5  dB.  Determine  whether  the  system  is  in  threshold. 

For  ft  =  4,  is 

/thresh  =  20(0  +  1)  =  20(5)  =  100 


For  this  value  of  /  [Eq.  (10.32b)], 


(^)  =  3(,6,no°)(l)  = 


300 


Because  20.5  dB  is  a  ratio  of  160,  the  SNR  is  below  300,  and  the  system  is  in  threshold. 


Threshold  Extension  in  Angle  Modulation 

The  threshold  problem  is  rather  serious  in  angle  modulatton.  The  ab.l.ty  to  communicate  even 
at  low  power  levels  simply  by  increasing  the  transmission  bandwidth  is  the  raison  d  etre  of 
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Figure  10.13 

Frequency 
compression 
feedback  (FCF). 


angle-modulated  systems,  and  threshold  deprives  angle  modulation  of  its  very  essence.  For  this 
reason,  much  attention  has  been  paid  to  the  problem  of  pushing  the  threshold  level  further  down 
and  extending  the  useful  operating  range.  The  two  principal  methods  used  for  this  purpose  rely 
on  frequency  compression  feedback  (FCF)  and  the  phase-locked  loop  (PLL).  We  shall  look 
briefly  into  these  techniques. 

Frequency  Compression  Feedback  (FCF):  The  basic  circuit  is  shown  in  Fig.  10  l-1 
Let  us  consider  a  frequency-modulated  wave  A  cos  [ioct  +  i/(t)\  at  the  input.  The  quiescent 
frequency  of  the  VCO  is  a>c  -  w0.  The  instantaneous  frequency  co,  of  the  VCO  is  given  by 

(Dj  =  (coc  —  u>o)  +  ae(t) 


and  the  VCO  output  e„(f)  is  given  by*  (assuming  initial  instant  t  =  0), 

e„(f)  =  2cos  [(<yc  —  (o0)t  +  a  J  e(r)r/rj  (10.48) 


The  bandpass  filter  H„(f)  is  centered  at /„.  The  multiplier  output  em(f)  has  a  spectrum centere 
at  the  sum  and  difference  frequencies.  The  sum  frequencies  are  centered  at  2/<-  —f<>  an(* art" 
suppressed  by  the  bandpass  filter  H0{f),  which  allows  only  the  difference  frequencies  centere 
at  f0  to  pass.  The  output  zp(l)  of  the  bandpass  filter  is 


ep(t)  =  A  cos  +  r//(t)  —  a  J  e(r)drj 


(10.49) 


The  limiter-discriminator  frequency  demodulates  the  signal  ep(t)  by  differentiating 
Therefore,  the  output  e(/)  is  given  by 


its  angle- 


e(f)  =  ^  ^(Kf)  ~aJ  e(T)^TJ 


=  x(/(t)-  ae(r) 


Hence, 


e(f)  = 


'fr(t) 


1  +a 


Substitution  of  this  value  of  e(/)  into  Eq.  (10.49)  yields 

T  "1 

tp(t)  =  A  cos  co„t  +  - - 

L  1  +«J 


(10.50) 


'  The  amplitude  of  e„(/)  is  immaterial  in  our  discussion.  For  convenience,  it  is  considered  to  be  2. 
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This  interesting  result  shows  that  the  signal  ep(t)  is  another  frequency-modulated  signal  with 
carrier  frequency  co()  and  is  similar  to  the  incoming  carrier  A  cos  [ coct  +  ].  The  difference 

is  that  the  angle  is  x//(t)/ (1  +  a)  instead  of  \ {/(t).  This  implies  a  reduction  in  the  frequency 
deviation  and,  consequently,  a  reduction  in  the  bandwidth  of  the  modulated  signal  by  a  factor* 
of  (1  +a).  Hence,  fora  wideband  case,  the  bandwidth  of  H()(f)  need  only  be  about  2  A/ /(I  Tor). 
The  second  conclusion  is  that  when  ep(t)  is  applied  to  a  limiter-discriminator,  the  output  e(f) 
is  \j/(t)/( 1  +  a).  Hence,  the  FCF  demodulator  indeed  frequency-demodulates  the  incoming 
frequency-modulated  carrier. 

Let  us  now  see  what  happens  when  the  input  signal  is  a  frequency-modulated  carrier  plus 
bandpass  channel  noise.  The  signal  plus  noise  can  be  expressed  as  R(/)  cos  [coct  +  4r(f)  + 
A\l/(t)]  [Eq.  (10.18)].  The  amplitude  variations  R (/)  are  eventually  eliminated  in  the  limiter- 
discriminator  (see  Sec.  5.3),  and 


Cn(f  ) 


=  A  cos  | \oct  + 


^r(r)  -f  A \l/(t)' 
1  +  a 


(10.51) 


Similarly, 


e(f)  =  4*  A }jr(t)] 

1  +  a 


where  ^  is  the  useful  signal  and  At/r  is  the  noise.  The  output  of  the  FCF  demodulator  is  identical 
to  that  of  the  conventional  demodulator  except  for  the  multiplicative  factor  I/C  1  +  a).  Hence, 
the  output  SNR  of  the  FCF  demodulator  is  identical  to  that  of  the  conventional  demodulator. 
The  advantage  is  gained,  however,  in  extending  the  threshold.  This  can  be  seen  from  the  tact 
that  the  FM  wave  passes  unmolested  through  the  filter  H„{f)  [Eq.  ( 1 0.5 1 )].  The  channel  noise, 
however  which  had  hitherto  a  bandwidth  of  about  2  A/,  has  to  pass  through  H„(f)  having 
I  /  ( 1+  a)  times  its  former  bandwidth.  Hence,  the  ratio  of  carrier  power  to  noise  at  the  input  of 
the  limiter-discriminator  is  enhanced  by  a  factor  of  1  +  a.  As  seen  earlier,  this  does  not  affect 
the  SNR  but  it  does  extend  the  threshold  because  of  the  relative  enhancement  of  the  carrier 
power  in  relation  to  noise  power.  It  can  be  seen  that  the  carrier  power  can  now  afford  to  be 
reduced  by  a  factor  of  1  +  a  before  the  onset  of  the  threshold.  In  other  words,  the  threshold 
is  extended  roughly  by  10  log  a  dB.  Because  of  practical  problems,  however,  this  benefit  is 
not  completely  realized.  In  practical  FCF  demodulators,  threshold  extension  of  about  5  to  7 

dB  can  be  realized. 


Phase-I  ocked  Loop  (PLL):  The  functioning  of  the  PLL  was  discussed  in  Chapter  4.  For 
the  small-noise  case,  the  PLL  performance  is  identical  to  that  of  a  conventional  demodulator 
(iList  as  in  FCF)  For  large  noise,  however,  the  PLL  extends  the  threshold  just  as  the  FCF 
does,  by  reducing  the  filter  bandwidth.  Detailed  analysis  is  beyond  our  scope.  A  practical  PLL 
extends  the  threshold  region  by  about  3  to  6  dB. 


Ontimum  Preemphasis-Deemphasis  in  Angle  Modulation 

Ba^ed  on  the  analysis  of  baseband  preemphasis-deemphasis  (PDE)  filters  in  Chapter  9.  we 
fhall  consider  the  case  of  baseband  PDE  in  angle  modulation,  that  is,  preemphasis  before 
uLion  and  deemphasis  after  demodulation  (Fig.  10.14a).  The  channel  ,s  assumed  ,o  be 

ideal,  that  is,  Hc(f )  = 


*  To  avoid  distortion  of  the  modulating 


signal,  the  bandwidth  (A/  -I-  B)/(  1  +  or)  must  be  greater  than  B. 
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Figure  10.14 

(a)  Optimum  PDE 
in  FM  system. 

(b)  Equivalent 
of  system  in 
part  (a). 


(b) 


The  problem  here  is  very  different  from  the  baseband  communication  case.  In  the  base¬ 
band  system,  the  preemphasis  filter  changes  the  transmitted  power  but  not  its  bandwidth,  and 
optimization  was  performed  under  the  constraint  of  a  fixed  transmitted  power  (because  band¬ 
width  remained  fixed,  it  was  of  no  concern).  In  FM,  on  the  other  hand,  the  transmitted  power 
is  always  fixed  (A2/ 2),  but  the  transmission  bandwidth  is  modified  by  the  preemphasis  filter. 
But  optimization  must  be  performed  under  the  constraint  of  a  fixed  transmission  bandwidth. 
Which  bandwidth  should  be  used?  The  conventional  (2A/)  or  the  mean  square?  A  look  at 
Eq.  (10.32a)  shows  that  the  SNR  in  terms  of  conventional  bandwidth  involves  the  constant 
mp.  When  m(/)  is  passed  through  Hp(f ),  mp  will  change,  and  no  simple  relationship  exists 
between  m (f),  Hp(f ),  and  the  new  mp.  The  problem  is  mathematically  intractable.  No  such 
difficulty  appears  when  we  use  Eq.  (10.42),  which  uses  the  mean  square  bandwidth.  Hence, 
we  shall  optimize  the  ratio  S0/N0  with  the  constraint  of  a  fixed  mean  square  bandwidth  of 
transmission.  Fortunately,  this  constraint  turns  out  to  be  similar  to  that  used  in  the  baseband 
system.  This  can  be  seen  from  Eq.  (10.39): 


1  , 2  2 


B2.  =  - k 2  m2 

dfm  ■  ->*•' 


Thus  to  maintain  fixed,  m2,  the  power  of  the  modulating  signal,  must  be  the  same  with 
or  without  preemphasis.  This  is  exactly  the  constraint  of  the  baseband  system  [Eq.  (9.54a) 
(Chapter  9)].  If  the  input  to  the  system  in  the  dashed  box  (Fig.  10.14a)  is  x(t),  its  output 
is  kfx(t)  plus  the  parabolic  noise  n'(/)  with  the  PSD  in  Eq.  (10.29).  Hence,  the  system  in 
Fig.  10.14b  is  the  equivalent  of  the  system  in  Fig.  10.14a.  Our  problem  is  the  optimization  o 
the  output  SNR  with  the  constraint  that  the  output  of  Hp(f)  has  a  fixed  mean  square  value. 
This  problem  is  identical  to  the  optimization  of  the  baseband  system  with  G  =  Hc(f)  =  */• 
Hence,  from  Eqs.  (9.60)  with  Sn(f)  =  fif  (2nf)2/A2  and  St  =  m2,  we  have 


\HP(f)\2op{  = 
\Hj(f)\2opt  = 


2 nf  m2 

s/Sjf)  fZo  J(27Tf)2Sm(f)  If 

s/S^H  JZ o  W)25m(/~)  df 

2j xf  ■  m2 


(10.52a) 


(10.52b) 


(10.52c) 


10.4  PU 
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Because  y  =  A1  /2J\fB, 


(10.52d) 


Comparison  of  Eq.  (10.52d)  with  Eq.  (10.42)  shows  that  PDE  in  FM  improves  the  SNR 
by  the  factor  inside  the  square  brackets  on  the  right-hand  side  of  Eq.  (10.52d).  Optimum 
preemphasis  and  deemphasis  are  not  used  in  commercial  FM  broadcasting  tor  historical  and 
practical  reasons.  The  relatively  simple  suboptimum  scheme,  discussed  in  Sec.  5.5,  is  used 
instead. 


10.4  PULSE-MODULATED  SYSTEMS 

Among  pulse-modulated  systems  (PAM.  PWM.  PPM.  and  PCM),  only  PCM  is  of  practical 
importance.  The  other  systems  are  rarely  used  in  practice  for  communication  purposes.  For 
this  reason  we  shall  discuss  only  PCM  in  detail.  It  can  be  shown  that  the  performance  of  PAM 
is  similar  to  that  of  AM-SC  systems  (i.e.,  the  output  SNR  is  equal  to  y).  The  PWM  and  PPM 
systems  are  capable  of  exchanging  the  transmission  bandwidth  with  output  SNR.  as  in  angle- 
modulated  systems.  In  PWM.  the  output  SNR  is  proportional  to  the  transmission  bandwidth 
Bj- This  performance  is  clearly  inferior  to  that  of  angle-modulated  systems,  where  the  output 
SNR  increases  as  Bf.  In  PPM  systems  under  optimum  conditions,  the  output  SNR  increases 
as  Bf  but  is  still  inferior  to  FM  by  a  factor  of  6.  For  in-depth  treatment  of  PAM,  PWM.  and 

PPM^  the  reader  is  referred  to  Panter1  or  Rowe.5 

Another  pulse  modulation  system  that  deserves  mention  is  the  delta  modulation  (DM) 
system  discussed  in  Chapter  6.  For  speech  signals,  this  system’s  performance  is  comparable  to 
that  of  PCM  for  a  bandwidth  expansion  ratio  BT/B  of  7  to  8.  For  BT/B  >  8,  PCM  is  superior 
to  DM,  and  for  BT/B  <  8,  DM  is  superior  to  PCM. 


Pulse  Code  Modulation  ...  ,  .  ,,  f 

In  PCM  a  baseband  signal  m(0  band-limited  to  B  Hz  and  with  amplitudes  in  the  range  of 
_m  to  m  is  sampled  at  a  rate  of  2 B  samples  per  second.  The  sample  amplitudes  are  quantized 
into  L  levels,  which  are  uniformly  separated  by  2 mp/L.  Each  quantized  sample  is  encoded  into 
,.  binarv  dieits  (2"  =  L).  The  binary  signal  is  transmitted  over  a  channel.  The  receiver  detects 
the  binary  signal  and  reconstructs  quantized  samples  (decoding).  The  quantized  samples  are 
then  Dassed  through  a  low-pass  filter  to  obtain  the  desired  signal  m(/). 

Xhere  are  two  sources  of  error  in  PCM:  (1)  quantization  or  “rounding  off  error,  and 
(2)  detection  error.  The  latter  is  caused  by  error  in  the  detection  of  the  binary  signal  at  the 

receiver.  ed  to  be  a  wide-sense  stationary  random  process.  The  random 

'  hi  laT  )  formed  by  sample  function  amplitudes  at  /  =  kTs,  will  be  denoted  by  m* .  The 
variable  m(kTs),  y  auantjZed  to  a  value  m*,  which  is  encoded  and  transmitted 

of  ^  digits  may  ha  detected  erroneously 

M  the  reviver,  and  the  reconstructed  sample  will  be  m,  instead  of  m,  If  qt  and  e,  are  the 
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quantization  and  detection  errors,  respectively,  then 


q*  =  mt  -  m* 

€k  =  rru  -  rr»A- 


(10.53) 


and 

m*  -  m*  =  qk  +  ek  (10.54) 

Hence,  the  total  error  m*  —  m*  at  the  receiver  is  q*  +  e*.  The  receiver  reconstructs  the  signal 
m(r)  from  samples  rn*  according  to  the  interpolation  formula  in  Eq.  (6. 10), 

m(r)  =  ^  sine  (2nBt  —  kn) 
k 

=  ^[m*  -  (q*  +  e*)]  sine  (2nBt  -  kn) 

it 

=  ^  m*  sine  (2 nBt  —  kir)  —  ^^(qk  +  e*)  sine  (2tt Bt  —  kn) 
k  k 

=  m(f)  —  e(r)  (10.55a) 

where 

e(r)  =  ^(q*  +  *k)  sine  (2  nBt  —  kn)  (10.55b) 

k 

The  receiver  therefore  receives  the  signal  m(f)  —  e(f )  instead  of  m  ( / ) .  The  error  signal  e(/)  is  a 
random  process  with  the  klh  sample  q*  +  Because  the  process  is  wide-sense  stationary,  the 
mean  square  value  of  the  process  is  the  same  as  the  mean  square  value  at  any  instant.  Because 
q*  +  e*  is  the  value  of  e(/)  at  t  =  kTs , 

e2(>)  =  (q*  +  €k)2 

Even  though  quantization  error  is  not  truly  independent  of  the  signal  under  quantization,  tor  a 
sufficiently  large  number  of  quantization  levels,  the  independent  approximation  is  satisfactorily 
accurate.6  Now,  because  q*  and  €k  are  independent  with  zero  mean  RVs  (Examples  8.20, 8.21. 
and  8.22), 

e2(r)  =  q  l  +  ej 

We  have  already  derived  qj  and  in  Examples  8.20  and  8.2 1  [Eqs.  (8.68b)  and  (8.70)1-  Hence. 

^77  =  I  ("W  ,  Amlpe^2~  1)  (10  56a) 

3  V  L  )  3  L2 

where  Pe  is  the  detection  error  probability.  For  binary  coding,  each  sample  is  encoded  into 
binary  digits.  Hence,  2"  =  L,  and 

A-[l+4Pe(2*'-])] 


e2(r)  = 


(10.56b) 
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As  seen  from  Eq.  (10.55a),  the  output  m (t)  —  e(/)  contains  the  signal  m(/)  and  noise  e(f). 
Hence, 


Sa  =  m2  N0  =  e2(r) 


and 


So  =  3(22n)  /V\ 

~  1  +  4Pe(22n  —  1)  \m2/ 


(10.57) 


The  error  probability  Pe  depends  on  Ap,  the  peak  pulse  amplitude,  and  the  channel  noise  power 
<* *l  [Eq.  (8.42)],* 


Pe  =  Q 


It  will  be  shown  in  Sec.  11.1  that  p  =  Ap/an  can  be  maximized  (i.e.,  Pe  can  be  minimized)  by 
passing  the  incoming  digital  signal  through  an  optimum  filter  (known  as  the  matched  filter).  It 
will  be  shown  that  for  polar  signaling  [Eqs.  ( 1 1 . 1  la)  and  ( 1 1 . 1 3)], 


V)  = 

an  /  max 


and 


(Pe)  min  =  Q 


(10.58) 


where  E„  is  the  energy  of  the  received  binary  pulse  and  the  channel  noise  is  assumed  to  be 
white  with  PSD  N /2.  Because  there  are  n  binary  pulses  per  sample  and  2 B  samples  per  second, 
there  are  a  total  of  2 Bn  pulses  per  second.  Hence,  the  received  signal  power  S,  =  2 BnEp,  and 


Pe  =Q 


and 


So 

No 


3(22") 


1  -f-  4(22”  -  1) Q{VyJ 


s(?) 


(10.59) 


(10.60) 


Figure  10.15  plots  the  output  SNR  as  a  function  of  y  foiMone  modulation  (m -/m2p  -  0.5). 

xJ-  1  .  mn  also  be  used  for  a  general  case,  with  m 2 /nr  =  A.,  where  the  SNR  is 
va5p=  in  Fig.  10.15.  Hence,  we  simply  shift  .he  curves  in  Fig.  10.15  hy 

10  log  (2A)  dB. 


i  •  i  „o  Rinolar  (or  pseudolemary )  signaling  requires  about  3  dB  more  power  than  polar  to 

*  This  assumes  polar  signal' ng.  B  PO  ‘  P  rather  (han  |ar  signa|ing  is  used  for  PCM. 

achieve  the  same  Pe  (see  Sec.  7.2).  in  practice,  r 


J 


I 


594  PERFORMANCE  ANALYSIS  OF  MODULATED  COMMUNICATION  SYSTEMS  UNDER  NOISE 


Figure  10.15 

Performance 
of  PCM. 


Example  10. 


Figure  10.15  shows  two  interesting  features:  the  threshold  and  the  saturation.  When  y 
is  too  small,  a  large  pulse  detection  error  results,  and  the  decoded  pulse  sequence  yields  a 
sample  value  that  has  no  relation  to  the  actual  sample  transmitted.  The  received  signal  is  thus 
meaningless,  and  we  have  the  threshold  phenomenon.  To  explain  saturation,  we  observe  that 
when  y  is  sufficiently  large  (implying  sufficiently  large  pulse  amplitude),  the  detection  error 
Pe  — ►  0,  and  Eq.  (10.57)  becomes 


(10.61a) 

(10.61b) 


The  SNR  in  this  case  is  practically  independent  of  y.  Because  the  detection  error  approaches 
zero,  the  output  noise  now  consists  entirely  of  the  quantization  noise,  which  depends  only 
on  L.  Because  the  pulse  amplitude  is  so  large  that  there  is  very  little  probability  of  making  a 
detection  error,  a  further  increase  in  y  by  increasing  the  pulse  amplitude  buys  no  advantage, 
and  we  have  the  saturation  effect. 

In  the  saturation  region. 


log  3  +  2n  log  2  +  log 


=  a  +  6/7 


(10.61c) 


where  a  =4.77+  101og10(m2//n;;). 
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Example  1  0.8  For  PCM  with  n  =  8,  determine  the  output  SNR  for  a  Gaussian  m(/).  Assume  the  saturation 
region  of  operation. 


For  a  Gaussian  signal,  mp  =  oo.  In  practice,  however,  we  may  clip  amplitudes  >  3am  or 
4am,  depending  on  the  accuracy  desired.  For  example,  in  the  case  of  3cr  loading, 

P(|m|  >  3am)  =  20(3)  =  0.0026 


and  for  4a  loading. 


P(|m|  >  4 <rm)  =  20(4)  =  6x10  ? 
If  we  take  the  case  of  3 o  loading. 


(3om)2 


1 

9 


and 


For4o  loading. 


So 

No 


=  21.845  =  43.4  dB 


12,288  =  40.9  dB 


To  facilitate  the  comparison  of  PCM  with  other  types  of  modulation,  the  SNRs  of  FM 
and  DSB-SC  are  superimposed  on  the  SNR  of  PCM  in  Fig.  10.15.  The  theoretical  bandwidth 
expansion  ratio  for  PCM  is  Bpcm/B  =  »  assuming  polar  signaling  with  pulse  shape  satisfy- 
i„g  Nyauist-s  first  criterion  (Fig.  7.14).  In  practice,  this  can  be  achieved  by  using  duobmm, 
signalin.  Today's  PCM  systems  use  bipolar  signaling,  however,  requiring  Brcu/B  =  2n. 

J^Jr  P  in  Eo  ,10  59)  is  valid  only  for  polar  signaling.  Bipolar  signaling  requires  twice 

i"  Bg  10.15  is  valid  for  bipolar  signaling  if  3  dB  is  added  to 

each  value  of  y. 

From  Eq.  (10.61b)  we  have 


__  ^  ^  ->25pcm/^ 

N0  \ 


m\ 


\  J 


(10.62) 


where  1  <  k  <  2.  For  duobinary  k  =  1.  and  for  bipolar  k  =  2. 
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BSBara 


Trading  Bandwidth  for  Performance  It  is  clear  from  Eq.  (10.62)  that  in  PCM,  the  out¬ 
put  SNR  increases  exponentially  with  the  transmission  bandwidth.  Compare  this  with  angle 
modulation,  where  the  SNR  increases  as  a  square  of  the  transmission  bandwidth.  In  angle  mod¬ 
ulation,  doubling  the  transmission  bandwidth  quadruples  the  output  SNR.  From  Eq.  (10.61c) 
we  see  that  in  PCM,  increasing  n  by  1  quadruples  the  SNR.  But  increasing  n  by  1  increases 
the  bandwidth  only  by  the  fraction  l/n.  For  n  =  8,  a  mere  12.5%  increase  in  the  transmission 
bandwidth  quadruples  the  SNR.  Therefore  in  PCM,  the  exchange  of  SNR  tor  bandwidth  is 
much  more  efficient  than  in  angle  modulation.  This  is  particularly  evident  for  large  values  of 
/?,  as  can  be  seen  from  Fig.  10.15.  For  smaller  values  of  /?,  the  difference  between  FM  and 
PCM  is  not  as  impressive  as  for  large  values  of  n.* 


Figure  10.16 

Input-output 
characteristic  of 
a  PCM 
compressor. 


Companded  PCM 

The  output  SNR  of  PCM  is  proportional  to  m 2/mjr  Once  a  uniform  quantizer  has  been  designed, 

trip  is  fixed,  and  m2/mj)  is  proportional  to  the  speech  signal  power  m2  only.  This  can  vary  from 
talker  to  talker  (or  even  for  the  same  talker)  by  as  much  as  40  dB,  causing  the  output  SNR  to 
vary  widely.  This  problem  can  be  mitigated,  and  a  relatively  constant  SNR  over  a  large  dynamic 
range  of  m2  can  be  obtained,  either  by  nonuniform  quantization  or  by  signal  companding.  The 
methods  are  equivalent,  but  in  the  latter,  which  is  simpler  to  implement,  the  signal  amplitudes 
are  nonlinearly  compressed. 

Figure  6.16  showed  the  input-output  characteristics  of  the  two  most  commonly  used 
compressors  (the  /x-law  and  the  A-law).  For  convenience,  let  us  denote 

m 


Clearly,  the  peak  value  of  x  is  1  (when  m  =  mp).  Moreover,  the  peak  value  of  the  compressor 
output  y  is  also  1  (occurring  when  m  =  m^).  Thus,  x  and  y  are  the  normalized  input  and  output 
of  the  compressor,  each  with  unit  peak  value  (Fig.  10.16).  The  input-output  characteristics 
have  an  odd  symmetry  about  x  =  0.  For  convenience,  we  have  only  shown  the  region  x  >  0 
The  output  signal  samples  in  the  range  (-1,  1)  are  uniformly  quantized  into  L  levels,  with  a 
quantization  interval  of  2/L.  Figure  10.16  shows  the yth  quantization  interval  for  the  output  y 
as  well  as  the  input  x.  All  input  sample  amplitudes  that  lie  in  the  range  A j  are  mapped  into» 
For  the  input  sample  value  x  in  the  range  Ay,  the  quantization  error  is  q  =  (x  —  Xj),  and 

rXj+(Aj/2) 

2  /  (x  —  Xj)2 px(x)  dx 

is  the  part  of  q-  (the  mean  square  quantizing  error)  contributed  by  x  in  the  region  Ay.  The 
factor  2  appears  because  there  is  an  equal  contribution  from  negative  amplitudes  of  x  centered 
at  —xj.  Thus, 


—  _  fXj-HAj/2) 

=  /  C X-Xj)2px(x)dx 

Jxj-(Aj/2) 


The  FM  plots  in  Fig.  10.15  are  without  preemphasis  and  deemphasis. 
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Figure  10.16 

Input-output 
characteristic  of 
a  PCM 
compressor. 


Because  L  »  1,  the  quantizing  interval  (2/L)  and  Ay  are  very  small,  and  px(*>  can  be  assumed 
to  be  constant  over  each  interval.  Hence, 


_  /•*;+<  V2>  , 

q2  =2Yr>Axj)  /  (x-xj)  dx 


Px(Xj)  A] 
”12 


(10.63) 


Because  2/L  and  Ay  are  very  small,  the  compression  characteristics  can  be  assumed  to  be 
linear  over  each  A/,  and 


Substituting  this  in  Eq.  (10.63),  we  have 

—  2  PxiXj) 

q"  "  3L2  y  ly(Xj)]2  1 
For  L  large  enough,  .he  preceding  sum  can  be  approximated  by  an  integral 


2  f  P 

~  3L2  io  ly 


Px(x) 


[y(*)]2 


dx 


(10.64) 


For  the  ju-law  [Eq.  (6.35a)], 


In  (1  +  px) 

y  = 


0  <  x  <  1 


In  ( 1  +M) 
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and 


m  = 


In  (1  + 11) 


leading  to 


q!  =  (S) 

If  PxW  is  symmetrical  about  x  =  0, 


CTx  =  2  / ' 

Jo 


x2pAx)dx 


and  |x|,  the  mean  of  the  rectified  x,  is 


|x|  =  2  f 
Jo 


xp\(x)  dx 


We  can  express  q2  as 


=  pn(l  +M)j 


—  _  fin  (1  +  p)Y  I  1  +  ;it2gx2  +  2>u|x| 


3Z,2 


nn(l+M)]2/2  2|x|  l' 

-  -  1  7X  H - p  — 

M  M2, 


(10.65) 


(10.66a) 


(10.66b) 


(10.67a) 

(10.67b) 


“  3Z.2  (" 

Recall  that  q-  in  Eqs.  ( 10.67)  is  the  normalized  quantization  error.  The  actual  error  is  nip  q'  The 
normalized  output  signal  is  \(t).  and,  hence,  the  normalized  output  power  S(>  =  = 

The  actual  S„  will  be  m2  a2.  Hence, 


So_  _  _  3  L2  a 2 

N0  q2  [In  (1  +  p)]2  (or2  -f  2\x\/p  +  \/p2) 

-  3  L2  1 

"  [In  (1  +  m)]2  (1  +  2 R/m<t2  +  1/m2<t2) 


(10.68a) 

(10.68b) 


To  get  an  idea  of  the  relative  importance  of  the  various  terms  in  parentheses  in  Eq.  (1 0.68b). 
we  notediat  x  is  an  RV  distributed  in  the  range  (-1,  1).  Hence,  o2  and  jx|  are  both  less  than 
1 .  and  |x|/ctx  is  typically  in  the  range  of  0.7  to  0.9.  The  values  of  //used  in  practice  are  greater 
than  100.  For  example,  the  D2  channel  bank  used  in  conjunction  with  the  T1  carrier  system 
has  p  255.  Thus,  the  second  and  third  terms  in  the  parentheses  in  Eq.  (10.68b)  are  small 
compared  to  I  if  cr2  is  not  too  small,  and  so 


^  3  L2 

No  [In  ( 1  +  p)]2 


(10.68c) 
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Figure  10.17 

PC  M 

performance 
with  and  without 
companding. 


which  is  independent  of  a2.  The  exact  expression  in  Eq.  ( 10.68b)  has  a  weak  dependence  on 
a2  over  a  broad  range  of  <rx.  Note  that  the  SNR  in  Eq.  (1068b)  also  depends  on  the  s.gnal 
statistics  lli  and  a2.  But  for  most  of  the  practical  PDFs.  |x|/<r*  is  practically  the  same  (in 
the  range  of  0.7-0.9).  Hence,  S„/N„  depends  only  on  or2.  This  means  the  plot  ot  S„/ N„  vs. 
?  pracically  independent  of  the  PDF  of  X.  Figure  10.1 7  plots  SJN  vs.  <r-  for  two 

different  PDFs:  Laplacian  and  Gaussian  (see  Example  10.9).  It  can  be  seen  that  there  is  hardly 

anv  difference  between  the  two  curves.  _  _ 

Because  x  =  m/mp,  <x2  =  <r2/m2,  and  |x|  =  |m| /mp,  Eq.  (10.68a)  becomes 


3  Ll 


S*  = _ 

N0  [In  (1  +  At)]2 


°m  /m2p  +  2\m\/nmr+  1/M‘ 


(10.69) 


One  should  be  careful  in  interpreting  mp  in  Eq.  (10.69).  Once  the  system  is  designed  for  some 
is  fixed.  Hence,  m,  is  a  constant  of  the  system,  no,  of  the  unknown  message  stgnal 

m(f)  that  may  be  subsequently  transmitted. 

Example  1  0.9  A  voice  signal  amplitude  PDF  can  be  closely  modeled  by  the  Laplace  density* 

—  \/2|m|/<7m 


Pm(m)  = 


V2 


-e 


For  a  voice  PCM  system  with  „  =  8  and  4  =  255.  find  and  sketch  the  output  SNR  as  afunetion 
of  the  normalized  voice  power  oJn\-p. 


*  A  better  but  more 
k 


complex  model  for  speech  signal  amplitude  m  is  the  gamma  density7 


600 


PERFORMANCE  ANALYSIS  OF  MODULATED  COMMUNICATION  SYSTEMS  UNDER  NOISE 


It  is  straightforward  to  show  that  the  variance  of  this  Laplace  PDF  is  a*.  In  practice,  the 
speech  amplitude  will  be  limited  by  either  3cr  or  4a  loading.  In  either  case,  the  probability 
of  observing  m  beyond  this  limit  will  be  negligible,  and  in  computing  |m|  (etc.),  we  may 
use  the  limits  0  to  oo, 

1ST  =  2  f°  dm  =  0.707am 

Jo  crmV 2 


PROBLE/ 


Hence,  from  Eq.  (10.69), 

So= _ 6394(a^/m^) _ 

N0  (a^/m^)  +  O.OOSSSfam/m,,)  +  1.53  x  10~5 

This  is  plotted  as  a  function  of  (a^/wj;)  in  Fig.  10.17. 


(10.70) 


Example  10.10  Repeat  Example  10.9  for  the  Gaussian  m(r). 

In  this  case, 

n  OO 

|m|  =  2  /  — dm  =  0.798am 

Jo  am  V27T 

and 


So  _ _ 6394(q^/m^) _  (,071) 

No  (ol/nj)  +  0.0063 (om/mp)  +  1 .53  x  10"5 

The  SNR  here  is  nearly  the  same  as  that  in  Eq.  (10.70).  The  plot  of  SNR  vs.  0mlmp 
(Fig.  10.17)  is  practically  indistinguishable  from  that  in  Example  10.9.  Thus,  this  result 
confirms  the  advantage  of  applying  companded  PCM  by  desensitizing  the  PCM  per¬ 
formance  to  message  signal  distribution.  The  treatment  of  difference  signals  by  the 
companded  PCM  results  in  more  fairness  of  performance. 
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PROBLEMS 

10.1- 1  A  baseband  channel  has  transfer  function 

10~2 

Hc{f)  -  j2nf  +  3000* 

The  message  signal  PSD  is  Sm(f)  =  8n (f/a),  with  a  =  8000.  The  channel  noise  PSD  is 
Sn(f)  =  10-8.  If  the  output  SNR  at  the  receiver  is  required  to  be  at  least  40  dB.  find  the 
minimum  transmission  bandwidth  required  for  the  SNR. 

10.1- 2  A  signal  m(f )  with  PSD  Sm{f)  =  P  /a)  and  or  =  8000  is  transmitted  over  a  telephone  channel 

with  transfer  function  Hc(f)  =  \0~3/(j2nf  +  or).  The  channel  noise  PSD  is  Sn{f)  =10  .To 
compensate  for  the  channel  distortion,  the  receiver  filter  transfer  function  is  chosen  to  be 

«W> -(*£*=)  n  (If) 


The  receiver  output  SNR  is  required  to  be  at  least  55  dB.  Determine  the  minimum  required  value 
of  P  and  the  corresponding  transmitted  power  Sr  and  the  power  S,  received  at  the  receiver  input. 


10.2-1 


For  a  DSB-SC  system  with  a  channel  noise  PSD  of  Sn{f)  =  I0"12  and  a  baseband  signal  of 
bandwidth  5  kHz.  the  receiver  output  SNR  is  required  to  be  at  least  47  dB.  The  receiver  is  as 

shown  in  Fig.  10.2. 

(a)  What  must  be  the  signal  power  5/  received  at  the  receiver  input  ? 

(b)  What  is  the  receiver  output  noise  power  N<?? 

(c)  What  is  the  minimum  transmitted  power  Sr  if  the  channel  transfer  function  is  Hc(f)  =  10  1 

over  the  transmission  band? 


10.2- 2  Repeat  Prob.  10.2-1  for  SSB-SC. 

10.2- 3  Assume  [rn(/)]max  =  -[m(/)]min  =  mP  in  an  AM  SyStem' 

(a)  Show  that  the  output  SNR  for  AM  [Eq.  ( 10. 10)]  can  be  expressed  as 

So  -y  (10.73) 

N0  k 2  +  /r2 


(b) 

(c) 


here  k2  =  ml/ m2. 

[sing  the  result  in  part  (a),  for  a  periodic  triangle  message  signal  (as  in  Fig.  P10.3-2)  with 
lodulation  index  fi  =  1,  find  the  receiver  output  SNR  S0/N0. 

:St  and  S'T  are  the  AM  and  DSB-SC  transmitted  powers,  respectively,  required  to  attain 
given  output  SNR,  then  show  that 


Sr  -  k2Sj  for  k 2  »  1 


(10.74) 


10  2-1  AGaussianbasebandra„domprocessm(/)istransmittedbyAM.  For  3o  loading  =  3<r), 

find  the  output  SNR  as  a  function  of  y  and  \i. 

,0  2-5  In  many  radio  transmilleis.  the  Iransmilied  signal  is  limited  ^  avera^e 

power  Under  such  a  limllalion.  AM  freer  much  worse  ton  DSB-SC  or  SSB-SC. 

Show  .ha,  for  ron.  mode, a, ion  for  a  h,ed  peak  pmm  rranannrred.  to  ourpu,  SNR  of  AM 

is  6  dB  below  that  of  DSB-SC  and  9  dB  below  that  of  SSB-SC. 
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(b)  What  would  be  the  difference  if  the  message  is  a  periodic  triangle  waveform  (as  in 
Fig.  PI 0.3-2). 

10.2-6  Determine  the  output  SNR  of  each  of  the  two  quadrature  multiplexed  channels  and  compare  the 
results  with  those  of  DSB-SC  and  SSB-SC. 


10.3-1  For  an  FM  communication  system  with  =  5  and  white  channel  noise  with  PSD  Sn  (Y)  =  10  l(), 
the  output  SNR  is  found  to  be  28  dB.  The  baseband  signal  m(r)  is  Gaussian  and  band-limited  to 

ICI.TT. I  T  _  1 I! '  _ A  ~  ...  Tka  n-l!n  if  Of  1  TU 1 C  mP£Vl<C 


15  kHz,  and  3a  loading  is  used  (i.e.,  trip  =  3a).  The  demodulator  gain  is  a  =  10“-.  This  means 
that  the  FM  demodulator  output  is  a\l/(t)  when  the  input  is  A  cos  [coct  4-  ^0)].  In  the  present 
case,  the  signal  at  the  demodulator  output  is  akfm(t).  The  output  noise  is  also  multiplied  by  a. 

(a)  Determine  the  received  signal  power  5/. 

(b)  Determine  the  output  signal  power  S0. 

(c)  Determine  the  output  noise  power  Na- 

10.3-2  Consider  the  modulating  signal  m(f)  shown  in  Fig.  PI 0.3-2.  Assume  that  the  bandwidth  of  m(f) 
is  the  frequency  of  its  fifth  harmonic  frequency.  Show  that  PM  is  superior  to  FM  by  a  factor  of 
approximately  20  from  the  SNR  point  of  view. 


Figure 


P.10.3-2 


-B 


10.3-3  Form(/)  =  a\  cos  co\t+a2  cos  o>2 /.show  that  PM  is  superior  to  FM  from  the  SNR  point  of  view 

when  (1  -Fat)2  <  (1  jc)2/3,  where  .v  =  a\/u2  and  v  =  io\/co2  .This  example  shows  that  two 

signal  spectra  with  the  same  mean  square  bandwidth  can  have  different  conventional  bandwidths. 

Hint:  Use  Eq.  (10.34). 


10.3-4  Show  that  an  alternative  expression  of  the  mean  square  bandwidth  for  m(t)  in  Eq.  (10.37a),  that 
is,  is  written  as 


m  ^f-oomHOdt 


(10.75) 


Hint:  Use  the  transfer  function  of  the  differentiator  j27rf  and  apply  Parse vaFs  theorem. 

10.3-5  Show  that  when  the  modulating  signal  m (t)  has  a  Butterworth  PSD,  that  is. 


then  the  mean  square  bandwidth  B ^  is  given  by 


(10.77) 
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10.3-6  A  modulating  signal  PSD  is  given  by 

Smtf)  =  (10.78) 

The  effective  bandwidth  B  of  m(f)  is  that  bandwidth  which  contains  x  percent  of  the  total  power 
of  m(f).  Determine  which  of  the  angle  modulations  is  superior  if  (a)  x  =  "'•  <b>  X  =  9°'. 
(c)  x  =  70. 


10.3- 7  Show  that 

(a)  For  tone  modulation,  the  dividing  line  between  narrowband  and  wideband  modulation  is 
P  =  0.47. 

(b)  For  a  Gaussian  modulating  signal  with  3 a  loading,  the  dividing  line  is  at  p  =  0.55. 

(c)  For  4a  loading  it  is  at  p  =  0.56. 

10.3- 8  For  FM  stereophonic  broadcasting  (Fig.  5.18).  show  that  the  (L  -  R)  channel  is  about  22  dB 

noisier  than  the  L  +  R  channel. 


Hint-  The  baseband  signal,  L  +  R.  is  preemphasized  to  obtain  (L+R)  .  The  signal  (L  -  R)  is 
preemphasized  to  obtain  (L  -  R)\  which  is  used  to  obtain  (L-R)'  cos  wct.  The  sum  (L  +  R)  + 
(L  -  R)'  cos  oct  now  frequency-modulates  a  carrier.  At  the  receiver,  after  frequency  demodu¬ 
lation,  (L  +  R)'  and  (L  -  R)'  cos  <oct  are  separated.  (L  -  R)'  is  deemphasized.  (L  +  R)'  cos  a>ct 
is  multiplied  by  2  cos  wct  to  obtain  (L  -  R)'.  which  is  then  deemphasized. 


10  3-9  In  Prob.  10.3-3.  FM  and  PM  were  compared  for  a  given  transmission  bandwidth.  II  we  compare 
them  for  a  given  mean  square  transmission  bandwdith,  we  get  a  slightly  ditterent  result.  Show 

that  PM  is  superior  to  FM  if  (1  +x2y2)  <  0  +*2)/3  when  ,hey  are  «>mPared  for  a  mean 
square  transmission  bandwidth.  This  comparison  shows  that  two  signal  spectra  with  the  same 
mean  square  transmission  bandwidth  can  have  different  conventional  bandw.dths.  Hint:  In  this 
case  SmCO  >s  discrete.  Show  that  normalizing  Sm(f)  and  taking  the  second  moment  about  the 

origin  yield 

F  °\!hM 

tint  —11 


10  4-1  We  need  to  design  a  binary  PCM  system  to  transmit  an  analog  TV  signal  m(t)  of  bandwidth 

4. .5  MHz.  The  receiver  output  ratio  of  signal  to  quantization  no.se  ratio  is  required  to  be  at  least 

50  dB. 

(a)  The  TV  signal  m(f)  value  is  uniformly  distributed  in  the  range  (—mp,  nip).  Find  the  minimum 
number  of  quantization  levels  L  required.  Select  the  nearest  value  of  L  to  satisfy  L  =  2  . 

(b)  For  the  value  of  L  in  part  (a),  compute  the  receiver  output  SNR  assuming  the  nonthreshold 
region  of  operation. 

(c)  For  the  value  of  L  in  part  (a),  compute  the  required  channel  bandwidth  for  the  PCM 
transmission. 

,d.  If  the  ouipul  SNR  is  squired  .o  be  increased  b,  6  JB  (fonr  limesl.  wha,  an:  ibe  new  .nine 
„f  /  and  ihe  percentage  increase  in  iransmission  bandwidih? 


10.4-2 


In  M- ary  PCM.  pulses  can  take  M  distinct  amplitudes  (as  opposed  to  two  for  binary  PCM), 
(a)  Show  that  the  SQNR  for  M- ary  PCM  is 


^  =3W2n 
N„ 


(10.79) 
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(b)  Show  the  amount  of  channel  bandwidth  needed  to  transmit  the  TV  signal  in  Prob.  10.4-1 
using  8-ary  PCM. 

10.4- 3  A  zero  mean  Gaussian  message  signal  m(r)  band-limited  to  4  kHz  is  sampled  at  a  rate  of  12,000 

samples  per  second.  The  samples  are  quantized  into  256  levels,  binary-coded,  and  transmit¬ 
ted  over  a  channel  with  noise  spectrum  Sn(f)  =  2.5  x  10~7.  Each  received  pulse  has  energy 
Ep  =  2  x  10-5.  Given  that  the  signal  loading  is  3a  (i.e.,  mp  =  3a),  do  the  following. 

(a)  Find  the  output  SNR  assuming  polar  signal,  and  the  error  probability  given  in  Eq.  (10.59). 

(b)  If  the  transmitted  power  is  reduced  by  10  dB,  find  the  new  SNR. 

(c)  State  whether,  at  the  reduced  power  level  in  part  (b),  it  is  possible  to  increase  the  out¬ 
put  SNR  by  changing  the  value  of  L.  Determine  the  peak  output  SNR  achievable  and  the 
corresponding  value  of  L. 

10.4- 4  In  a  PCM  channel  using  k  identical  regenerative  links,  we  have  shown  that  the  error  probability 

Pe  of  the  overall  channel  is  kPe ,  where  Pe  is  the  error  probability  of  an  individual  link  (Example 
8.7).  This  shows  that  Pe  is  approximately  cumulative. 

(a)  If  A:  —  1  links  are  identical  with  error  probability  Pe  and  the  remaining  link  has  an  error 
probability  P' ,  find  the  Pe  of  the  new  relay  system. 

(b)  For  a  certain  chain  of  repeaters  with  k  =  100  ( 100  repeaters),  it  is  found  that  y  over  each 
of  99  links  is  23  dB,  and  over  the  remaining  link  y  is  20  dB.  Using  Eq.  (10.59)  to  calculate 
Pe  and  P'e  (with  n  =  8).  Now  compute  Pe  and  show  that  Pe  is  primarily  dominated  by  the 
weakest  link  in  the  chain. 

10.4- 5  For  companded  PCM  with  n  =  8,  fi  —  255,  and  amplitude  m  uniformly  distributed  in  the  range 

(-A,  A)  (A  <  mp ),  show  that 

Sq  _  _ 6394 (cr^/nip) _  (10.80) 

N°  (Vm/mp)  +  0.0068 (am/wp)  -f  1.53  x  10"5 

Note  that  mp  is  a  parameter  of  the  system,  not  of  the  signal.  The  peak  signal  A  can  vary  from  talker 
to  talker,  whereas  mp  is  fixed  in  a  given  system  by  limiting  the  peak  voltage  of  all  possible  signals. 


1  1  PERFORMANCE  ANALYSIS  OF 
I  I  DIGITAL  COMMUNICATION 
SYSTEMS 


In  analog  communications,  the  user  objective  is  to  achieve  high  fidelity  in  waveform  repro¬ 
duction-  Hence,  the  suitable  performance  criterion  is  the  output  signal-to-noise  ratio.  The 
choice  of  this  criterion  indicates  that  the  signal-to-noise  ratio  reflects  the  quality  of  the 
message  and  is  related  to  the  ability  of  the  listener  to  interpret  a  message. 

In  digital  communication  systems,  the  transmitter  input  is  chosen  from  a  finite  set  of 
possible  symbols  The  objective  at  the  receiver  is  not  to  reproduce  the  waveform  that  carries 
the  symbol  with  fidelity;  instead,  the  receiver  aims  to  accurately  determine  which  particular 
symbol  was  transmitted  among  the  set  of  possible  ones.  Because  each  symbol  is  represented  by 
a  particular  waveform  at  the  transmitter,  our  goal  is  to  decide,  from  the  noisy  received  signal 
which  particular  waveform  was  originally  transmitted.  Logically,  the  appropriate  hgure  of 
merit  in  a  digital  communication  system  is  the  probability  of  error  m  this  decision  at  the 
receiver  In  particular,  the  probability  of  bit  error,  also  known  as  the  bit  error  rate  (BER).  is  a 
direct  quality  measure  of  the  communication  system.  Not  only  is  the  BER  important  to  d.g.ta 
signal  sources,  it  is  also  directly  related  to  the  quality  of  signal  reproduction  for  analog  signal 

S°UrinStwTcSlpter0'wI)present  two  important  aspects  in  the  performance  analysis  of  digital 
■  cvctems  The  first  part  focuses  on  the  error  analysis  of  several  specific  binary 
detection  receivers  The  goal  is  for  students  to  learn  how  to  apply  the  fundamental  tools  of 
probability  theory  and  random  processes  for  BER  performance  analysis.  Our  second  focus  ,s  ,o 
musrrale  derailed  deriva, ion  of  op, mum  de„c,i,m  receiver,  for  general  drgual  common, cal, on 
systems  such  that  the  receiver  BER  can  be  minimized. 


11.1  OPTIMUM  LINEAR  DETECTOR  FOR  BINARY 
POLAR  SIGNALING 


,  •  Vitinn  systems  the  information  is  transmitted  as  0  or  1  in  each  time  interval 

^consider , he  bina^  polar  signaling  sysiem  of  Fig.  Ilia,  in  which  ,he  source 
LI  hi,  1  and  0  are  represen, ed  by  ±n( 0,  respectively.  Having  passed  a  d,s,on,onless.  bu, 
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Figure  11.1 

Typical  binary 
polar  signaling 
and  linear 
receiver. 


±p(t)  +  n(f) 

H(f) 

+p„(t)  +  na(r)  t  =  tm 

Threshold 

Decision 

- > - 

h(t) 

device 

(a) 


noisy,  channel,  the  received  signal  waveform  is 

y(t )  =  ±p(t)  +  n(r)  0  <  /  <  7o  (H.1) 

where  n(f)  is  a  Gaussian  channel  noise. 


1  1.1.1  Binary  Threshold  Detection 

Given  the  received  waveform  of  Eq.  (1 1.1),  the  binary  receiver  must  decide  whether  the 
transmission  was  originally  a  1  or  a  0.  Thus,  the  received  signal  y(f)  must  be  processed  to 
produce  a  decision  variable  for  each  symbol.  The  linear  receiver  for  binary  signaling,  as  shown 
in  Fig.  11.1a,  has  a  general  architecture  that  can  be  optimum  (to  be  shown  later  in  Section  1 1.6). 
Given  the  receiver  filter  H (f)  or  h(t),  its  output  signal  for  0  <  t  <  7b  is  simply 

y(f)  =  ±p(t)  *h(t)  +  n(f)  *h(t)  =  ±p0(t)  +  n„(f)  (ll2) 

PoU)  n„(r) 

The  decision  variable  of  this  linear  binary  receiver  is  the  sample  of  the  receiver  filter  output 
at  t  =  tm: 


rUm)  —  ±Po(tm)  +  n  oUm)  (H-3) 

Based  on  the  properties  of  Gaussian  variables  in  Section  8.6, 

no(0=  [  n(r)/t(r  —  r)dr 
Jo 

is  Gaussian  with  zero  mean  so  long  as  n(f)  is  a  zero  mean  Gaussian  noise.  If  we  define 

AP=Po{tm)  (H-4a) 

crt  =  E{n0(tm)2}  (H-4b) 

then  this  binary  detection  problem  is  exactly  the  same  as  the  threshold  detection  ot 
Example  8.16.  We  have  shown  in  Example  8.16  that,  if  the  binary  data  are  equally 
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to  be  0  or  1,  then  the  optimum  threshold  detection  is 


dec{r(fm)}  = 


1  if  r(tm)  >  0 
0  if  r(tm)  <  0 


(11.5a) 


whereas  the  probability  of  (bit)  error  is 


Pe  =  Q(p) 


(11.5b) 


in  which 


(11.5c) 


cr„ 

To  minimize  Pe ,  we  need  to  maximize  p  because  Q(p)  decreases  monotonically  with  p. 

1  1 .1 .2  Optimum  Receiver  Filter- Matched  Filter 

Let  the  received  pulse  p(t)  be  time-limited  to  Ta  (Fig.  11.1).  We  shall  keep  the  discussion  as 
general  as  possible  at  this  point.  To  minimize  the  BER  or  P„  we  should  determine  the  best 
receiver  filter  H(f)  and  the  corresponding  sampling  instant  such  that  Q(p)  is  minimized. 
In  other  words,  we  seek  a  filter  with  a  transfer  function  H (f)  that  maximizes 


(11.6) 


which  is  coincidentally  also  the  signal-to-noise  ratio  at  time  instant  /  —  tm. 

First  denote  the  Fourier  transform  of  p(t)  as  P(f)  and  the  PSD  of  the  channel  no.se  n(r) 
as  Sn(f).  We  will  determine  the  optimum  receiver  filter  in  the  frequency  domain.  Starting  with 


Po(t)  =  f-'\P{f)H(f)] 

=  P(f  )H(f)e>2nf'  df 


we  have  the  sample  value  at  t  —  tm 


Po(tm)  =  r  P(f)H(f)e*2nfi"df 
J-OO 


(11.7) 


On  the  other  hand,  the  filtered  noise  has  zero  mean 


'o 


while  its  variance  is  given  by 


a\  =  nl(t)  =  fX  Sn(f)\H(f)\2df 

J-OO 


(11.8) 
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Hence,  the  signal-to-noise  ratio  is  given  in  the  frequency  domain  as 

2  |/-"W)/W2^#|2 

P  ”  f-ooS« I2  df 


The  Cauchy-Schwarz  inequality  (Appendix  B)  is  a  very  powerful  tool  for  finding  the 
optimum  filter  H(f).  We  can  simply  identify 


Yif)  = 


~7m~ 


Then  by  applying  the  Cauchy-Schwarz  inequality  to  the  numerator  of  Eq.  ( 1 1 .9),  we  have 

P  f-oo\X(f)\2df 

<  /-°^IX(^)i2#-/^0|yy)i2# 

"  f-oo\x<f)\2df 


\Y(f)\2df 


UW 

sn(f) 


df 


(11.10a) 


with  equality  if  and  only  if  X(f)  =  k[Y(f)]*  or 


Figure  1 1 .2 

Optimum  choice 
for  sampling 
instant. 


H(f)s/S^f)  =  k 


P(f)ei^fhn 


kP{—f)e  j2nf'm 

~vwr 


Hence,  the  SNR  is  maximized  if  and  only  if 


H(f)  —  k 


P(—f)e-jl*fim 

sn{f) 


(11.10b) 


where  k  is  an  arbitrary  constant.  This  optimum  receiver  filter  is  known  as  the  matched  filter 
This  optimum  result  states  that  the  best  filter  at  the  binary  linear  receiver  depends  on  several 
important  factors:  (1)  the  noise  PSD  Sn(f),  (2)  the  sampling  instant  tm,  and  (3)  the  pulse  shape 
P(f).  It  is  independent  of  the  gain  at  the  receiver  k ,  since  the  same  gain  would  apply  to  both 
the  signal  and  the  noise  without  affecting  the  SNR. 

For  white  channel  noise  Sn(f)  =  M/2,  Eq.  (1 1.10a)  becomes 

where  Ep  is  the  energy  of  p(t),  and  the  matched  filter  is  simply 

Hif)  =  k'P(-f)e~j27tfim  (11.1  lb) 

where  k  =  2 k/fl  is  an  arbitrary  constant. 


11.1  Optimum  Linear  Detector  for  Binary  Polar  Signaling 


609 


Figure  11.2 

Optimum  choice 
for  sampling 
instant. 


The  unit  impulse  response  k(t)  of  the  optimum  filter  is  obtained  from  the  inverse  Fourier 
transform 

h(t)  =  T-X[k'Pi-f)e-j2nfim] 

Note  that  p  i-t )  <=►  />(-/)  and  e~^  represents  the  time  delay  of  tm  seconds.  Hence, 

h(t)  =  k'p(tm-t )  (11.11c) 

The  response  p{tm-t)  is  the  signal  pulse />(-!)  delayed  by  tm.  Three  cases./™  <  Tatm  =  T„, 
and  >  T„,  are  shown  in  Fig.  1 1 .2.  The  first  case,  <  Ta,  yields  a  noncausa.  .mpuNe 
response  which  is  unrealizable.* *  Although  the  other  two  cases  yield  phys.cally  realizable 
Z  fc  to  case.  >  To,  delays  .he  decision-making  instant nnnecessanly.  The  ca.se 


u  nnrfpr^tood  intuitively  when  the  decision-making  instant  is  tm  <  T„.  In  this 

*  The  filter  unreahzabihtj '  can  be  rea  b  d  ^  ha's  ^  fcd  (Q  ,he  fi)ter  (,m  <  T/)),  This  calls  for  a 
case,  we  are  forced  to  make  a  decis  o  P  d  As  wg  know  on,  unreali/able  (noncausal) 

prophetic  filter,  which  can  respond  to  inputs  before  tney  ar  w 

filters  can  do  this  job. 
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tm  =  T0  gives  the  minimum  delay  for  decision  making  using  a  realizable  filter.  In  our  future 
discussion,  we  shall  assume  tm  =  T(),  unless  otherwise  specified. 

Observe  that  both  p(t)  and  h(t)  have  a  width  of  T„  seconds.  Hence,  p„(f),  which  is  a 
convolution  of  p(t)  and  h(t),  has  a  width  of  2 T„  seconds,  with  its  peak  occurring  at  t  =  T„  where 
the  decision  sample  is  taken.  Also,  because  P0(f)  =  P(f)H (f)  =  k'\P(J)\~e  •/_7r^r",  p0(t)  is 
symmetrical  about  t  =  T()* 

Since  the  gain  k'  does  not  affect  the  SNR  p,  we  choose  k'  =  1.  This  gives  the  matched 
filter  under  white  noise 


h(t)  =p(T„  -t) 


(11.12a) 


or  equivalently 


H(f)  =  P(-f)e~j27Tfr°  (11.12b) 

for  which  the  signal  to  noise  ratio  is  maximum  at  the  decision-making  instant  t  =  T0. 

The  matched  filter  is  optimum  in  the  sense  that  it  maximizes  the  signal-to-noise  ratio 
at  the  decision-making  instant.  Although  it  is  reasonable  to  assume  that  maximization  of  this 
particular  signal-to-noise  ratio  will  minimize  the  detection  error  probability,  we  have  not  proven 
that  the  original  structure  of  linear  receiver  with  threshold  detection  (sample  and  decide)  is  the 
optimum  structure.  The  optimality  of  the  matched  filter  receiver  under  white  Gaussian  noise 
will  be  shown  later  (Section  1 1.6). 

Given  the  matched  filter  under  white  Gaussian  noise,  the  matched  filter  receiver  leads  to 
Pmax  of  Eq.  ( 1 1 . 1 1  a)  as  well  as  the  minimum  BER  of 


Pe  =  C(Pmax)  =  Q  (jjf\  <lU3) 

Equation  ( 1 1 . 1 3)  is  quite  remarkable.  It  shows  that,  as  far  as  the  system  performance  is  con¬ 
cerned,  when  the  matched  filter  receiver  is  used,  various  waveforms  used  forp(f)  are  equivalent 
as  long  as  they  have  the  same  energy 

/°°  ,  rT» 

\Ptf)\2df=  \p(t)\2dt 

-00  JO 

The  matched  filter  may  also  be  implemented  by  the  alternative  arrangement  shown  m 
Fig.  1 1 .3.  If  the  input  to  the  matched  filter  is  y(/),  then  the  output  r(/)  is  given  by 

r(t)=  f  y(x)h(t-x)dx  (O.H) 

J — 00 


where  h{t)  =  p(T0-  t)  and 


Figure  11 .3 

Correlation 

detector. 


1 1.2  GE 


Ht-x)=p[n-<,-x)]=p(x+T<>-t)  nos 


*  This  follows  from  the  fact  that  because  \P{f)\2  is  an  even  function  off,  its  inverse  transform  is  symmetrical  abo 
t  =  0  (see  Prob.  3.1-1).  The  output  from  the  previous  input  pulse  terminates  and  has  a  zero  value  at  /  ==  T0- 
Similarly,  the  output  from  the  following  pulse  starts  and  has  a  zero  value  at  /  =  T().  Hence,  at  the  decision-making 
instant  T() ,  no  intersymbol  interference  occurs. 


1  1 .2  General  Binary  Signaling 


611 


Figure  1 1 .3 

Correlation 

detector. 


y(0  =  ±p(t)  +  n(/) 


Threshold 

device 


Decision 


Hence, 


oo 


r(0  =  /  y  (x)p  (x  +  T0  —  t)dx 


(11.16a) 


-OO 


At  the  decision-making  instant  t  =  TOJ  we  have 


(11.16b) 


Because  the  input  y(jt)  is  assumed  to  start  at  x  —  0  and  p  (.x)  —  0  tor  x  >  Tn ,  we  have  the 
decision  variable 


(11.16c) 


We  can  implement  Eqs.  ( 1 1 . 16)  as  shown  in  Fig.  1 1 .3.  This  type  of  arrangement,  known  as  the 
correlation  receiver,  is  equivalent  to  the  matched  filter  receiver. 

The  right-hand  side  of  Eq.  (11.16c)  is  the  cross-correlation  of  the  received  pulse  with 
p(t)  Recall  that  correlation  basically  measures  the  similarity  of  signals  (Sec.  2.6).  Thus,  the 
optimum  detector  measures  the  similarity  of  the  received  signal  with  the  pulse  pit).  Based 
on  this  similarity  measure,  the  sign  of  the  correlation  decides  whether  pit)  or  -pit)  was 

transmitted.  .  .  x  r 

Thus  far  we  have  discussed  polar  signaling  in  which  only  one  basic  pulse  pit)  of  opposite 

signs  is  used  Generally,  in  binary  communication,  we  use  two  distinct  pulses  pit)  and  qit)  to 
represent  the  two  symbols.  The  optimum  receiver  for  such  a  case  will  now  be  discussed. 


1 1 .2  GENERAL  BINARY  SIGNALING 

1 1 .2.1  Optimum  Linear  Receiver  Analysis 


Thp  incoming  signal  v(0  is  transmiucuuiiuu^i.  a  i.itw.  v  - * - w  ~ 

T  The  decision  of  whether  0  or  1  was  present  at  the  input  depends  on  whether  is  or  is  not 
T„)  is  less  than  a0,  where  is  the  optimum  threshold. 


p(r)  +  n(/)  0  <  t  <  Tb  for  data  symbol  1 

qit)  +  n(f)  0  <t<Tb  for  data  symbol  0 

sjgnal  y(/)  is  transmitted  through  a  filter  Hif),  and  the  output  r(f)  is  sampled 

_  _ *u~  : _ _  ^lrv«nnrlr  rtn  titVusfKpr  1  c  nr  ic  nnt 


at  Tb ■  The  decision  ( 
r(7fc)  is  less  than  a0 


mmmmsm 


^  d 
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Figure  1 1 .4 

Optimum  binary 

threshold 

detection. 


y(0 


ii 

H(f) 

Threshold 

r(f)  ’’  r (Tb) 

device 

(a) 


Decision:  m  —  0  if  r(7^)  <  ao 
m  =  1  if  r{Tb)  <  a0 


Let  p()(t)  and  qQ(t)  be  the  response  of  H(f)  to  inputs  p(t)  and  q(t ),  respectively.  From 
Eq.  ( 1 1 .7)  it  follows  that 

/oo 

P(f)H{f)e>lnrr»df  (1117a) 

-OO 

/OO 

Q{f)H(f)e>2nfr»df  (|117b) 

-OO 

and  ,  the  variance,  or  power,  of  the  noise  at  the  filter  output,  is 

/OO 

Sn(f)\H(f)\2df  U»-17c) 

-OO 

Without  loss  of  generality,  we  let  P„(Tb)  >  P„(Tb).  Denote  n  as  the  noise  output  at  Tb- Tben 
the  sampler  output  r(7/,)  =  q„(Th)  +  nor  p„(Tb)  +  n,  depending  on  whether  m  =  Oorm  =  1 
is  received.  Hence,  r  is  a  Gaussian  RV  of  variance  a2  with  mean  q„(Th)  or  p0(Tb)<  depending 
on  whether  m  =  0  or  1.  Thus,  the  conditional  PDFs  of  the  sampled  output  r(7/,)  are 


PrlmMO) 

Primal) 


d^exp(- 


[r-<?0(7*)]2\ 

2  o2  ) 

[r~Po(Th)]2\ 

2o2  J 


Optimum  Threshold 

The  two  PDFs  are  shown  in  Fig.  1 1 ,4b.  If  a„  is  the  optimum  threshold  of  detection,  then  the 
decision  rule  is 


m  =  I  °  if  r  <  a° 

1 1  if  r  >  a„ 

The  conditional  error  probability  P(e  |  m  =  0)  is  the  probability  of  making  a  wrong  decision 
when  m  =  0.  This  is  simply  the  area  A<)  under  pr|m(/'|0)  from  a„  to  oo.  Similarly,  /,(^ln1  " 
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is  the  area  4 1  under  pr|m(r|l)  from  -oo  to  a()  (Fig.  11.4b),  and 


Pe  =  /J(f|m,)P(m,)  =  -(4o  +  ^i) 


1 

2 


Qo(T()) 

On 


Po(T0)  ~ 
<*n 


)] 


(11.18) 


assuming  /^(O)  =  Pm(l)  =  0.5.  From  Fig.  1 1.4b  it  can  be  seen  that  the  sum  Aq  +  A\  of  the 
shaded  areas  is  minimized  by  choosing  a„  at  the  intersection  of  the  two  PDFs.  This  optimum 
threshold  can  also  be  determined  directly  by  setting  to  zero  the  derivative  of  Pe  in  Eq.  ( 1 1 . 1 8) 
with  respect  to  a0  such  that 


dPe 

da() 


1  ~  <jo(To)  ^  J qI  ^Po(Tq)  °0  j  _j_j 

i  r  i  r  ia0-<io(Tb)}2 1  i  _ 

2j;L^e,p[  J 


(  \Po(Tb)  -  a„\2  j  j 


=  0 


Thus,  the  optimum  a„  is 


a0  = 


Po(Th)  +  q«(Tb) 

2 


(1 1.19a) 


and  the  corresponding  Pe  is 


pe  =  P(e|0)  =  P(e|l) 


[r-q0(Tb)]2\ 

2  J 


dr 


=  Q 
=  Q 
=  Q 


-  q0(Tb)  j 
\p„(Tb)  -  qoiTh)' 

2o„ 


(f) 


where  we  define 


/?  = 


Po(Th)  -  q„(Th) 
On 


Substituting  Eq.  (11.17)  into  Eq.  ( 1 1  -20),  vve  get 

I  f^L[P(f)  -  Q(f)]H(f)ei2nfrh  df  |" 


(11.19b) 

(11.19c) 


(11.20) 
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This  equation  is  of  the  same  form  as  Eq.  (11.9)  with  Pif)  replaced  by  Pif)  -  Qif).  Hence, 
Cauchy-Schwarz  inequality  can  again  be  applied  to  show 


£max 


-r 


—  <2(/~)l2 

Sn(f ) 


df 


and  the  optimum  filter  H  if)  is  given  by 

IP(-f)  -  Q(-f)\e~i2nfr> 


H(f)  =  k- 


Sn{f) 


(11.21a) 


(11.21b) 


where  k  is  an  arbitrary  constant. 

The  Special  Case  of  White  Gaussian  Noise 

For  white  noise  Sn(f)  =  A/"/ 2,  and  the  optimum  filter  //(/*)  is  given  by* 

H(f)  =  [P(-f)  -  Q(-f)]e-i2nfr»  0 1.22a) 

and 

h(t)  =  p (Tb  —  t)  —  q  (Tb  —  t)  0‘-22b> 

This  is  a  filter  matched  to  the  pulse  p(t)  —  q(t).  The  corresponding  is  [Eq.  (1 1.21a)] 


0max  = 


\P(f)~Q(f)\2df 


2  CT»  , 

=  Jrl  d' 

_  Ep  +  Eq  —  2  Epq 

-  Jf]2 


where  Ep  and  Eq  are  the  energies  of  pit)  and  q(t),  respectively,  and 


(11.23a) 

(11.23b) 

(11.23c) 


rTh 

Epq  =  /  Pit)q(t)  dt 
Jo 


(11.24) 


So  far,  we  have  been  using  the  notation  Pe  to  denote  error  probability.  In  the  binary  case, 
this  error  probability  is  the  bit  error  probability  or  bit  error  rate  (BER)  and  will  be  denote 
by  Ph  (rather  than  Pe).  Thus,  from  Eqs.  (1 1.19c)  and  (1 1.23c), 


(11.25a) 

(11.25b) 


Because  k  in  Eq.  (1 1.21b)  is  arbitrary,  we  choose  k  =  M/2  for  convenience. 
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The  optimum  threshold  aQ  is  obtained  by  substituting  Eqs.  (11.17a,  b)  and  (1 1.22a)  into 
Eq.  ( 1 1 .19a)  and  recognizing  (via  variable  substitution)  that 

/oo  roo 

P  (f)Q  (-/)  df=  P  ( -f)Q  if)  df  =  Epg  (11 .26) 

-00  J -oo 

This  gives 

a0  =  \{Ep-Eq)  (11.27) 

In  deriving  the  optimum  binary  receiver,  we  assumed  a  certain  receiver  structure  (the  threshold 
detection  receiver  in  Fig.  1 1 .4).  It  is  not  clear  yet  whether  there  exists  another  structure  that 
may  have  better  performance  than  that  in  Fig.  1 1 .4.  It  will  be  shown  later  (in  Sec.  1 1 .6)  that  for 
a  Gaussian  noise,  the  receiver  derived  here  is  the  definite  optimum.  Equation  ( 1 1 ,25b)  gives  Ph 
for  the  optimum  receiver  when  the  channel  noise  is  white  Gaussian.  For  the  case  of  nonwhite 
noise,  P[,  is  obtained  by  substituting  /Wx  from  Eq.  (1 1.21a)  into  Eq.  ( 1 1.25a). 

Equivalent  Optimum  Binary  Receivers 

For  the  optimum  receiver  in  Fig.  1 1 .4a, 

H(f)  =  P(-f)e~j2j,JTh  -  Q(-f)e~j2nfTh 

This  filter  can  be  realized  as  a  parallel  combination  of  two  filters  matched  to  p(t)  and  q(t), 
respectively,  as  shown  in  Fig.  11.5a.  Yet  another  equivalent  form  is  shown  in  Fig.  11.5b. 
Because  the  threshold  is  (Ep  -  Eq)/2,  we  subtract  Ep/2  and  Eq/ 2,  respectively,  from  the  two 
matched  filter  outputs.  This  is  equivalent  to  shifting  the  threshold  to  0.  In  the  case  of  Ep  =  Eq, 
we  need  not  subtract  Ep/2  and  Eq/ 2  from  the  two  outputs,  and  the  receiver  simplifies  to  that 
shown  in  Fig.  1 1.5c. 

1  1 .2.2  Performance  Analysis  of  General  Binary  Systems 

In  this  section,  we  analyze  the  performance  of  several  typical  binary  digital  communication 
systems  by  applying  the  techniques  derived  in  the  last  section  for  general  binary  receivers. 

Polar  Signaling 

For  the  case  of  polar  signaling,  q(t )  =  -p(t).  Hence, 

Ep  =  Eq  and  Epq  =  -  J  P‘(0  dt  =  —Ep  ( 1 1 .28) 

Substituting  these  results  into  Eq.  (1 1.25b)  yields 

<u'291 


Also  from  Eq.  ( 1 1 .22b), 


h(t)  =  2p(Th  -  t ) 


(11.30) 


A 


" 
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Figure  11. 5 

Realization  of 
the  optimum 
binary  threshold 
detector. 


(b) 


Figure  1 1 .6 

(a)  Optimum 
threshold 
detector  and 

(b)  its  error 
probability  for 
polar  signaling. 


^<  =  Th 


Recall  that  the  multiplication  of  h(t)  by  any  constant  amplifies  both  the  signal  and  the  noise 
by  the  same  factor,  and  hence  does  not  affect  the  system  performance.  For  convenience,  ue 
shall  multiply  h(t)  by  0.5  to  obtain 

h(t)  =  p(Tb  —  t)  01-3D 


From  Eq.  ( 1 1 .27),  the  threshold  a0  is 


a„  =  0 


(11.32) 


Therefore,  for  the  polar  case,  the  receiver  in  Fig.  1 1 ,5a  reduces  to  that  shown  in  Fig.  1 1  -6a  w‘l 
threshold  0.  This  filter  is  equivalent  to  that  in  Fig.  1 1 .3. 

The  error  probability  can  be  expressed  in  terms  of  a  more  basic  parameter  Eb-  the  energ) 
per  bit: 


Eb  =  energy  per  bit 


In  the  polar  case,  Ep  =  Eq  and  the  bit  energy  Eb  is 

Eb  =  Ep  P  (m  =  1)  +  Eq  P  (m  =  0) 

=  £pP(m  =  1)  +  Ep[l  —  P(m  =  1)] 
=  Ep 
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Figure  1 1 .6 

(a)  Optimum 
threshold 
detector  and 

(b)  its  error 
probability  for 
polar  signaling 


h(t)  =  p(Tb-t) 


T(t) 


t=Tb 


r  (Tb) 


Decision  :  p(t)  if  r  >  0 
-p(t)  if  r  <  0 


(a) 


and  from  Eq.  (1 1.29), 


Pb  =  Q 


(11.33) 


The  parameter  Eb/M  is  the  normalized  energy  per  bit,  which  will  be  seen  in  future  discussions 
as  a  fundamental  parameter  serving  as  a  figure  of  merit  in  digital  communication.*  Because 
the  signal  power  is  equal  to  Eb  times  the  bit  rate,  a  given  Eb  is  equivalent  to  a  given  signal 
power  (for  a  given  bit  rate).  Hence,  when  we  compare  systems,  for  a  given  value  of  Eb,  we  are 
comparing  them  for  a  given  signal  power. 

Figure  1 1.6b  plots  Pb  as  a  function  of  Eb/N  (in  decibels).  Equation  (1 1.33)  indicates 
that,  for  optimum  threshold  detection,  the  polar  system  performance  depends  not  on  the  pulse 
shape,  but  on  the  pulse  energy. 


On-Off  Signaling  lfC  .  . 

In  the  case  of  on-off  signaling,  q(t)  =  0,  and  the  rece. ver  of  Fig.  1 1 ,5a  can  remove  the  lower 
branch  filter  of  q(Tb  -  t ).  Based  on  Eq.  ( 1 1 .27),  the  optimum  threshold  for  on-off  signaling 

receiver  is 


a0  —  £p/2 


*  If  the  transmission  rate  is  Rb  pulses  per 
that  5/ /A/7?/,  is  similar  to  the  parameter  y 


second,  the  signal  power  5,  is  5,  =  EhRh,  and  EbfM  =  5,7 A  Rb.  Observe 
(signal-to-noise  ratio  5,/AZ'fi)  used  in  analog  systems. 
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Additionally,  substituting  q(t )  =  0  into  Eqs.  (11 .24)  and  ( 1 1 .25)  yields 


Eq  =  0,  Epg  =  0, 


(11.34) 


If  both  symbols  m  =  0  and  m  =  1  have  equal  probability  0.5,  then  the  average  bit  energy  is 
given  by 


Eb  = 


Ep  +  Eq  _  Ep 

2  -  y 


Therefore,  the  BER  can  be  written  as 


11.3CC 
C  A 


Pb  =  Q 


(11.35) 


A  comparison  of  Eqs.  ( 1 1 .35)  and  ( 1 1 .33)  shows  that  on-off  signaling  requires  exactly  twice 
as  much  energy  per  bit  (3  dB  more  power)  to  achieve  the  same  performance  (i.e.,  the  same  P/») 
as  polar  signaling. 


Orthogonal  Signaling 

In  orthogonal  signaling,  pit)  and  q(t)  are  selected  to  be  orthogonal  over  the  interval  (0,7/?)* 
This  gives 


fTb 

Epq=  /  p(t)q(t)dt  =  0  0136) 

Jo 

On-off  signaling  is  in  fact  a  special  case  of  orthogonal  signaling.  Two  additional  examples  ot 
binary  orthogonal  pulses  are  shown  in  Fig.  11.7.  From  Eq.  (1 1.25), 


Assuming  1  and  0  to  be  equiprobable, 


(11.37) 


Eb  = 


Ep  Eq 
2 


F»gure  1 1 .8 

Digital 

modulated 

waveforms. 


Figure  1 1.7 

Examples  of 

orthogonal 

signals. 


pit) 


q(t) 

Th 

TJ2 
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and 


Pb  =  Q 


(11.38) 


This  shows  that  the  performance  of  any  orthogonal  binary  signaling  is  inferior  to  that  of  polar 
signaling  by  3  dB.  This  naturally  includes  on-off  signaling. 


1 1 .3  COHERENT  RECEIVERS  FOR  DIGITAL 
CARRIER  MODULATIONS 

We  introduced  amplitude  shift  keying  (ASK),  frequency  shift  keying  (FSK).  and  phase  shift 
keying  (PSK)  in  Section  7.9.  Figure  1 1 .8  uses  a  rectangular  baseband  pulse  to  show  the  three 
binary  schemes.  The  baseband  pulse  may  be  specifically  shaped  (e.g..  a  raised  cosine)  to 
eliminate  intersymbol  interference  and  to  stay  within  a  finite  bandwidth. 

BPSK 

In  particular,  the  binary  PSK  (BPSK)  modulation  transmits  binary  symbols  via 

1:  V2p'(t)cos  wct 
0:  -  \/2p’ (t)  cos  wct 

Here  p'(t)  denotes  the  baseband  pulse  shape.  When  p(t)  =  Vi p\t)  cos  ojct,  this  has  exactly 
the  same  signaling  form  as  the  baseband  polar  signaling.  Thus,  the  optimum  binary  receiver 
also  takes  the  form  of  Fig.  11.5a.  As  a  result,  for  equally  likely  binary  data,  the  optimum 
threshold  a0  =  0  and  the  minimum  probability  of  detection  error  is  identically 


Figure  11.8 

Digital 

modulated 

waveforms. 


1  0  1  0  0 


Baseband 

1  1  — 

i  :  :  1,1  i 

•  III 
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where  the  pulse  energy  is  simply 

Th 


rib 

-p  =  /  P2(t)dt 

Jo 

rTb 

=  2  I p'{ 

Jo 

rTh 

=  \P' 

Jo 


(f)]2  cos2  coctdt 


(t)]dt 


=  En 


This  result  requires  a  carrier  frequency  sufficiently  high  such  that/, T/,  >  >  1 . 

Binary  ASK 

Similarly,  for  binary  ASK,  the  transmission  is 

1:  \/2//(f)cos  coct 

0:  0 

This  coincides  with  the  on-off  signaling  analyzed  earlier  such  that  the  optimum  threshold 
should  be  aQ  =  Ep/2  and  the  minimum  BER  for  binary  ASK  is 


(11.40) 


where 


Figure  11 .9 

Coherent 
detection  of 
digital 
modulated 
signals. 


Figure  11.10 

Optimum 
coherent 
detection  of 
binary  FSK 
signals. 


F  _  ep  _  Ep' 

Comparison  of  Eq.  (1 1.39)  and  Eq.  ( 1 1.40)  shows  that  for  the  same  performance,  the  pultf 
energy  in  ASK  must  be  twice  that  in  PSK.  Hence,  ASK  requires  3  dB  more  power  than  PSK. 
Thus,  in  optimum  (coherent)  detection,  PSK  is  always  preferable  to  ASK.  For  this  reason.  ASK 
is  of  no  practical  importance  in  optimum  detection.  But  ASK  can  be  useful  in  noncoherent 
systems  (e.g.,  optical  communications).  Envelope  detection,  for  example,  can  be  applied  to 
ASK.  In  PSK,  the  information  lies  in  the  phase,  and.  hence,  it  cannot  be  detected  noncoherent!) 

The  baseband  pulses  p(t)  used  in  carrier  systems  should  be  shaped  to  minimize  the  IS! 
The  bandwidth  of  the  PSK  or  ASK  signal  is  twice  that  of  the  corresponding  baseband  signal 
because  of  modulation.* 


Bandpass  Matched  Filter  as  a  Coherent  Receiver 

For  both  PSK  and  ASK,  the  optimum  matched  filter  receiver  of  Fig.  1 1.5a  can  be  implemented- 
As  shown  in  Fig.  1 1.9a,  the  received  RF  pulse  can  be  detected  by  a  filter  matched  to  the  R 
pulse  p(t)  followed  by  a  sampler  before  a  threshold  detector. 

On  the  other  hand,  the  same  matched  filter  receiver  may  also  be  modified  into  Fig-  1 1  ■ 
without  changing  the  signal  samples  for  decision.  The  alternative  implementation  first  demo 
ulates  the  incoming  RF  signal  coherently  by  multiplying  it  with  Jlcos  coct.  The  product  is 


We  can  also  use  QAM  (quadrature  multiplexing)  to  double  bandwidth  efficiency. 
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Figure  1 1 .9 

Coherent 
detection  of 
digital 
modulated 
signals. 


(b) 


Figure  11.10 

Optimum 
coherent 
detection  of 
binary  FSK 
signals. 


the  baseband  pulse*  p'(t)  plus  a  baseband  noise  with  PSD  jV/2  (see  Example  9. 13),  and  this  is 
applied  to  a  filter  matched  to  the  baseband  pulse //(f ).  The  two  receiver  schemes  are  equivalent. 
They  can  also  be  implemented  as  correlation  receivers. 


Frequency  Shift  Keying 

In  FSK,  RF  binary  signals  are  transmitted  as 

0  :  y/2p'(t)  cos  [<dc  -  ( Acu/2)]/ 
1  :  >/2p'U)  cos  [a>c  +  ( Aco/2 )]/ 


Such  a  waveform  may  be  considered  to  be  two  interleaved  ASK  waves.  Hence,  the  PSD  will 
consist  of  two  PSDs,  centered  at  \f,  -  ( A//2)]  and  If,  +  ( A//2)].  For  a  large  A///c,  the  PSD 
will  consist  of  two  nonoverlapping  PSDs.  For  a  small  A///,,  the  two  spectra  merge,  and  the 
bandwidth  decreases.  But  in  no  case  is  the  bandwidth  less  than  that  of  ASK  or  PSK. 

The  optimum  correlation  receiver  for  binary  FSK  is  given  in  Fig.  11.10.  Because  the  pulses 
have  equal  energy,  when  the  symbols  are  equally  likely,  the  optimum  threshold  a„  =  0. 

Consider  the  rather  common  case  of  rectangular  p'(t)  =  A.  that  is,  no  pulse  shaping 

in  FSK. 

q(t)  =  n/2A  cos  - — 

p(t)  =  Via  cos  + 


*  There  is  also  a  spectrum  of  p'(D 


centered  at  2 a>c.  which  is  eventually  eliminated  by  the  filter  matched  to  pU). 


"■  11  ■ 
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To  compute  Pi,  from  Eq.  (1 1.25b),  we  need  Epq, 

rTb 

Epq  =  /  P(t)q(t)dt 

Jo 

=  2 A2  J  cos^Wc-  cos^o)c  +  ^-^tdt 


-Lt 
[ 


Tb  fTb 

cos  (A co)t  dt  +  /  cos  2 coct  dt  \ 


J  cos 


=  AzTb 


sin  (A co)Tb  sin  2cocTb 


■1 


(A  (o)Tb  2(ocTb 

In  practice  <yc7*  »  1,  and  the  second  term  on  the  right-hand  side  can  be  ignored.  Therefore, 

Enq  =  A2Tb  sine  (A coTb) 


Figure  11.11 

(a)  The  minimum 
of  the  sine 
function  and 

(b)  the  MSK 
spectrum. 


Similarly, 


Eb  =  Ep  =  Eq=  [  h[p(t)]2  dt  =  A2Th 

Jo 

The  BER  analysis  of  Eq.  (1 1.25b)  for  equiprobable  binary  symbols  1  and  0  becomes 


Pb  =  Q 


I  Eb  —  Eb  sine  (AcoTb) 


AT 


It  is  therefore  clear  that  to  minimize  P/,,  we  should  select  A co  for 
sine  (AcoTb)  is  minimum.  Figure  11.11a  shows  sine  (AcoTb)  as  a 
minimum  value  of  Epq  is  —  0.2M  A2Tb  at  A  co  •  Tb  =  \A3n  or  when 


the  binary  FSK  such  that 
function  of  ( AcoTb)-  Tte 


A  co 


0.715 


0.715/?*, 


This  leads  to  the  minimum  binary  FSK  BER 


Pb  =  Q 


(11.41a) 


When  Epq  =  0,  we  have  the  case  of  orthogonal  signaling.  From  Fig.  11.11a, 
that  Epq  =  0  for  A /  =  n/2Tb,  where  n  is  any  integer.  Although  it  appears  that  binary  F  ^ 
can  use  any  integer  n  when  selecting  A/,  larger  A f  means  wider  separation  between  signa  mc 
frequencies  coc  —  (Aa>/2)  and  wc  +  (Aw/2),  and  consequently  larger  transmission  bandwi 
To  minimize  the  bandwidth,  A f  should  be  as  small  as  possible.  Based  on  Fig.  1 1-1  la, 
minimum  value  of  A f  that  can  be  used  for  orthogonal  signaling  is  1  /27V  FSK  using  this  va  u 
of  A/  is  known  as  minimum  shift  keying  (MSK). 


Minimum  Shift  Keying  d 

In  MSK,  not  only  are  the  two  frequencies  selected  to  be  separated  by  1  /27V  but  we  shou  ^ 
also  take  care  to  preserve  phase  continuity  when  switching  between/  ±  A f  at  the  transmits 
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Figure  11.11 

(a)  The  minimum 
of  the  sine 
function  and 

(b)  the  MSK 
spectrum. 


0 


-0.217 


(A io)Tb 


(a) 


(b) 


This  is  because  abrupt  phase  changes  at  the  bit  transition  instants  when  we  are  switching 
frequencies  would  significantly  increase  the  signal  bandwidth.  FSK  schemes  maintaining  phase 
continuity  are  known  as  continuous  phase  FSK  (CPFSK),  of  which  MSK  is  one  special  case. 
These  schemes  have  rapid  spectral  roll-off  and  better  spectral  efficiency. 

To  maintain  phase  continuity  in  CPFSK  (or  MSK).  the  phase  at  every  bit  transition  is  made 
dependent  on  the  past  data  sequence.  Consider,  for  example,  the  data  sequence  1001. . .  starting 
at  f  _  O.  The  first  pulse  corresponding  to  the  first  bit  1  is  cos  [wc  +  (Aw/2)  ]/  over  the  interval 
0  to  Tb  seconds  At  t  =  Tb,  this  pulse  ends  with  a  phase  [a>c  +  (Aa>/2)]Tb.  The  next  pulse, 
corresponding  to  the  second  data  bit  0,  is  cos  [a>c-  (Aa>/2)]t.  To  maintain  phase  continuity 
at  the  transition  instant,  this  pulse  is  given  additional  phase  (co(  +  A (o)Tb.  We  achieve  this 

continuity  at  each  transition  instant  kTb. 

MSK  being  an  orthogonal  scheme,  its  error  probability  is  given  by 


(1 1.41b) 
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MSK  also  has  other  useful  properties.  It  has  self-synchronization  capabilities  and  its 
bandwidth  is  only  1 .5 Rh,  as  shown  in  Fig.  11.11b.  This  is  only  50%  higher  than  for  duobinary 
signaling.  Moreover,  the  MSK  spectrum  decays  much  more  rapidly  as  l//4,  in  contrast  to  the 
PSK  (or  bipolar)  spectrum,  which  decays  only  as  \/f~  [see  Eqs.  (7.15)  and  (7.22)].  Because  of 
these  properties,  MSK  has  received  a  great  deal  of  practical  attention.  For  more  discussions, 
see  Refs.  1  and  2. 

1 1 .4  SIGNAL  SPACE  ANALYSIS  OF 
OPTIMUM  DETECTION 

Thus  far,  our  discussions  on  digital  receiver  optimization  have  been  limited  to  the  simple  case 
of  linear  threshold  detection  for  binary  transmissions  under  Gaussian  channel  noise.  Such 
receivers  are  constrained  by  their  linear  structure.  To  determine  the  truly  optimum  receivers, 
we  need  to  answer  the  question:  Given  an  M-ary  transmission  with  channel  noise  n (t)  and 
channel  output 

y(0  =  Pi(t)  +  n(r)  0  <  t  <  T0  i  =  1,  . . . ,  M 

what  receiver  is  optimum  that  can  lead  to  minimum  error  probability? 

To  answer  this  question,  we  shall  analyze  the  problem  of  digital  signal  detection  from 
a  more  fundamental  point  of  view.  Recognize  that  the  channel  output  is  a  random  process 
y (r),  0  <  t  <  Ta.  Thus,  the  receiver  must  make  a  decision  by  transforming  y(f)  int0  a 
finite-dimensional  decision  space.  Such  an  analysis  is  greatly  facilitated  by  a  geometrical 
representation  of  signals  and  noises. 

A  Word  about  Notation:  Let  us  clarify  the  notations  used  here  to  avoid  confusion.  As 
before,  we  use  roman  type  to  denote  an  RV  or  a  random  process  [e.g.,  x  or  x(r)].  A  particular 
value  assumed  by  the  RV  in  a  certain  trial  is  denoted  by  italic  type.  Thus,  x  represents  the  value 
assumed  by  x.  Similarly,  x(t)  represents  a  particular  sample  function  of  the  random  process 
\(t).  For  random  vectors,  we  follow  the  same  convention:  a  random  vector  is  denoted  by  roman 
boldface  type,  and  a  particular  value  assumed  by  the  vector  in  a  certain  trial  is  represented  b\ 
boldface  italic  type.  Thus,  r  denotes  a  random  vector,  but  r  is  a  particular  value  of  r. 

1  1 .4.1  Geometrical  Signal  Space 

We  now  formally  show  that  a  signal  in  an  A/ -ary  transmission  system  is  in  reality  an  w- 
dimensional  vector  and  can  be  represented  by  a  point  in  an  /i-dimensional  hyperspace  (n  <  ^  • 
The  foundations  for  such  a  viewpoint  were  first  laid  during  the  introduction  of  the  signal  space 
in  Section  2.7. 

rpUp 

To  begin,  an  ordered  n-tuple  (jci,  x2 .  xn)  is  an  n-dimensional  vector  x. 

//-dimensional  (signal)  vector  space  is  spanned  by  n  unit  vectors  <p ( .  <pi,  ...  •  Vn 

<p\  =  (1,  0,  0 . 0) 

<p2  =  (0,  1,  0 . 0) 


<Pn  =  (0,  o,  0 . 1) 


(11.42) 
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Any  vector  jc  =  (jcj  ,  JC2,  . . . ,  xn)  can  be  expressed  as  a  linear  combination  of  n  unit  vectors. 


X  =X\ (px  +JC2^2  “I - 1 -*n<Pn  0  l-43a) 

=  YlXk<Pk  (11.43b) 

k= 1 

This  vector  space  is  characterized  by  the  definitions  of  the  inner  product  between  two 
vectors 

<x,y>  =  Y^xkyk  (11.44) 

*=i 

and  the  vector  norm 

IMIJ  =  <*.*>  =  £■<*  C1.4S) 

k= 1 


The  norm  ||x||  is  the  length  of  a  vector.  Vectors  x  andy  are  said  to  be  orthogonal  if  their  inner 
product 


<x ,  y>  =  0 


(11.46) 


A  set  of  n-dimensional  vectors  is  said  to  be  independent  if  none  of  the  vectors  in  the 
set  can  be  represented  as  a  linear  combination  of  the  remaining  vectors  in  that  set.  Thus,  if 
y  | ,  y->,  . . . ,  ym  is  an  independent  set,  then  the  equality 


aiJ’i 


+  0^2  4"  “  '  4"  an>y m  ~  ® 


(11.47) 


would  require  that  a,  =  <U  =  1 . m.  A  subset  of  vectors  m  a  given  /.-dimensional  space 

can  have  dimensionality  less  than  n.  For  example,  in  a  three-dimensional  space,  all  vectors 
lying  in  one  plane  can  be  specified  by  two  dimensions,  and  all  vectors  lying  along  a  line  can 

be  specified  by  one  dimension. 

An  n-dimensional  space  can  have  at  most  n  independent  vectors.  If  a  space  has  a  maxi¬ 
mum  of  n  independent  vectors,  then  every  vector  *  in  this  space  can  be  expressed  as  a  linear 
combination  of  these  n  independent  vectors.  Thus,  any  vector  m  this  space  can  be  specified 
by  n-tuples.  For  this  reason,  a  set  of  n  independent  vectors  m  an  n- dimensional  space  can  be 

viewed  as  its  busis  vectors. 

The  members  of  a  set  of  basis  vectors  form  coordinate  axes,  and  they  are  not  unique.  The 
n  unit  vectors  in  Eq.  ( 1 1 .42)  are  independent  and  can  serve  as  basis  vectors.  These  vectors  ave 
an  additional  property  in  that  they  are  (mutually)  orthogonal  and  have  normalized  length. 

that  is. 


<«V  Vk>  = 


0 

1 


j 

j=k 


(11.48) 
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Such  a  set  is  an  orthonormal  set  of  vectors.  They  capture  an  orthogonal  vector  space.  Any 
vector  x  =  (jti,  X2, . . x„)  can  be  represented  as 


x  =X\<P\  +X2V2  4 - \-Xn<P„ 


where  x,  is  the  projection  of  x  on  the  basis  vector  <pk  and  is  the  A'th  coordinate.  By  using 
Eq.  ( 1 1 .48).  the  kth  coordinate  can  be  obtained  from 


(11.49) 


<x,<pk>=xk  ft  =  1,2,...,  n 


Since  any  vector  in  the  //-dimensional  space  can  be  represented  by  this  set  of  n  basis  vectors, 
this  set  forms  a  complete  orthonormal  (CON)  set. 

1 1 .4.2  Signal  Space  and  Basis  Signals 

The  concepts  of  vector  space  and  basis  vectors  can  be  generalized  to  characterize  continuous 
time  signals  defined  over  a  time  interval  ©.  As  described  in  Section  2.7,  a  set  of  orthonormal 
signals  {<pi(t)}  can  be  defined  for  t  e  0  if 


(11.50) 


If  {(pi(t)}  form  a  complete  set  of  orthonormal  basis  functions  of  a  signal  space  defined  over  (M), 
then  every  signal  *(f)  in  this  signal  space  can  be  expressed  as 


x(t)  =  y}Txk<pk(t)  t  e  0 


(11.51) 


k 

where  the  signal  component  in  the  direction  of  <pk{t)  is* 


One  such  example  is  for  ©  =  (-00,  00).  Based  on  sampling  theorem,  all  low-pass  signals 
with  bandwidth  B  Hz  can  be  represented  by 


(11.53a) 


k 


*  If  {^*(01  is  complex,  orthogonality  implies 


<Pj(t)(pl(t)dt  =0 


and  Eq.  ( 1 1 .52)  becomes 
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with 


(11.53b) 


Just  as  there  are  an  infinite  number  of  possible  sets  of  basis  vectors  for  a  vector  space,  there  are 
an  infinite  number  of  possible  sets  of  basis  signals  for  a  given  signal  space.  For  a  band-limited 
signal  space,  {  sflB  ■  sine  (2j xBt  -  kn)}  is  one  possible  set  of  basis  signals. 

Note  that  r  (k /2  B)  are  the  Nyquist  rate  samples  of  the  original  band-limited  signal.  Since  a 
band-limited  signal  cannot  be  time-limited,  the  total  number  of  Nyquist  samples  needed  will  be 
infinite.  Samples  at  large  k ,  however,  can  be  ignored,  because  their  contribution  is  negligible. 
A  rigorous  development  of  this  result,  as  well  as  an  estimation  of  the  error  in  ignoring  higher 
dimensions,  can  be  found  in  Landau  and  Poliak.  ' 

Scalar  Product  and  Signal  Energy 

In  a  certain  signal  space,  let  x(t)  and  y(t)  be  two  signals.  If  { <pk (/))  are  the  orthonormal  basis 
signals,  then 


x(t)  = 


i 


yU )  = 


Hence, 


Because  the  basis  signals  are  orthonormal,  we  have 

f  x(t)y{t)dt  =  YJ  xkyk 


(11.54a) 


The  right-hand  side  of  Eq.  (1 1.54a),  however,  is  by  the  inner  product  of  vectors  x  and  y 
Therefore,  we  again  arrive  at  Parseval’s  theorem. 


<x(t),  y(t)>  =  f  x(t)y(t)  dt  =  H  =  <x,y> 


xkyk  =  <x,y> 


(11.54b) 


The  signal  energy 


for  a  signal  x(t)  is  a  special  case.  The  energy  Ex  is  given  by 


=  <x,  x>  =  ||x  II 


2 


(11.55) 


Hence,  the  signal  energy  is 


is  equal  to  the  square  of  the  length  of  the  corresponding  vector. 
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Example  11.1  A  signal  space  consists  of  four  signals  5i(f),  52(0.  53(0.  and  54(f),  as  shown  in  Fig.  11.12. 

Determine  a  suitable  set  of  basis  vectors  and  the  dimensionality  of  the  signals.  Represent  these 
signals  geometrically  in  the  vector  space. 


Figure  11.12 

Signals  and  their 
representation  in 
signal  space. 


(a) 


The  two  rectangular  pulses  <p\ ( t )  and  <p2(t)  in  Fig.  1 1 . 1 2b  are  suitable  as  a  basis  signal  set. 
In  terms  of  this  set,  the  vectorssi,  52,  53,ands4Correspondingtosignals5|(f),  52(f).  s3(f)’ 
and  54(f)  aresi  =  (1,  -0.5),  s2  =  (-0.5,  1),  s3  =  (0,  -1),  ands4  =  (0.5,  1).  These 
points  are  plotted  in  Fig.  11.1 2c.  Observe  that  the  inner  product  between  s  1  and  54  is 


<si,  $4>  =  0.5  —  0.5  =  0 


Hence,  S|  and  54  are  orthogonal.  This  result  may  be  verified  from  the  fact  that 


5 1  (f )s4(f)  dt  =  0 


Note  that  each  point  in  the  signal  space  in  Fig.  1 1.12c  corresponds  to  some  waveform. 


Determining  an  Orthonormal  Basis  Set 

It  there  are  a  finite  number  of  signals  jr,-(f)  in  a  given  signal  set  of  interest,  then  the  orthonorm 
signal  basis  can  either  be  selected  heuristically  or  systematically.  A  heuristic  approach  requiresa 
good  understanding  of  the  relationships  among  the  different  signals  as  well  as  a  certain  amount 
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of  luck.  On  the  other  hand,  Gram-Schmidt  orthogonalization  is  a  systematic  approach  to 
extract  the  basis  signals  from  the  known  signal  set.  The  details  ot  this  approach  are  given  in 
Appendix  C. 


1 1 .5  VECTOR  DECOMPOSITION  OF  WHITE  NOISE 
RANDOM  PROCESSES 


In  digital  communications,  the  message  signal  is  always  one  ot  the  M  possible  waveforms.  It 
is  therefore  not  difficult  to  represent  all  M  waveforms  via  a  set  of  CON  basis  functions.  The 
real  challenge,  in  fact,  lies  in  the  vector  decomposition  of  the  random  noise  n(/)  at  the  receiver. 
A  deterministic  signal  can  be  represented  by  one  vector,  a  point  in  a  signal  space.  Is  it  possible 
to  represent  a  random  process  as  a  vector  of  random  variables?  If  the  answer  is  positive,  then 
the  detection  problem  can  be  significantly  simplified. 

Consider  a  complete  orthonormal  (CON)  set  of  basis  functions  {<?*(/)}  for  a  signal  space 
defined  over  [0,  T0].  Then  any  deterministic  signal  s(t)  in  this  signal  space  will  satisfy  the 
following  condition: 


(11.56a) 


This  implies  that  for  /  €  [0,  T()],  we  have  the  equality* 

s(l)  =  ^Sktpkit) 


However,  for  random  processes  defined  over  [0.  Tc],  this  statement  is  generally  not  true 


Certain  modifications  are  necessary. 


1  1 .5.1  Determining  Basis  Functions  for  a  Random  Process 

First  of  all  a  general  random  process  x(f)  cannot  strictly  satisfy  Eq.  ( 1 1 .56a).  Instead,  a  proper 
convergence  requirement  is  in  the  mean  square  sense,  that  is. 


(11.56b) 


This  equality  can  be  denoted  as 


x(f)  =s']Tx*wt(r) 


(11.56c) 


k 


If  x(t)  and  y(t)  are  equal  in 
these  two  random  processes  h 
signals  (or  signal  differences) 


equal  in  the  mean  square  sense,  then  physically  the  difference  between 
•ocesses  have  zero  energy.  As  far  as  we  are  concerned  in  communications, 
Terences)  with  zero  energy  have  no  physical  effect  and  can  be  viewed  as  0. 


*  Strictly  speaking,  this  equality  is  true  not 


for  the  entire  interval  [0.  T„).  The  set  of  points  for  which  equality  does 


not  hold  is  a  measure  zero  set. 
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For  a  set  of  deterministic  signals,  the  basis  signals  can  be  derived  via  the  Gram-Schmidt 
orthogonalization  procedure.  However,  Gram-Schmidt  is  invalid  for  random  processes. 
Indeed,  a  random  process  x(/)  is  an  ensemble  of  signals.  Thus,  the  basis  signals  \(pk(t))  must 
also  depend  on  the  characteristics  of  the  random  process. 

The  full  and  rigorous  description  of  the  decomposition  of  a  random  process  can  be  found 
in  some  classic  references.4  Here,  it  suffices  to  state  that  the  orthonormal  basis  functions  must 
be  solutions  of  the  following  integral  equation 

fT° 

ki  ■  (pi(t)  =  /  Rx(t,  1 1)  •  <pi(t\)dt\  0  <t  <Ta  (11.57) 

«/o 

The  solution  Eq.  (11.57)  is  known  as  the  Karhunen-Loeve  expansion.  The  auto-correlation 
function  Rx(t ,  t\)  is  known  as  its  kernel  function.  Indeed,  Eq.  (1 1.57)  is  reminiscent  of  the 
linear  algebra  equation  with  respect  to  eigenvalue  A  and  eigenvector  0. 

A0  =  /?x0 

in  which  0  is  a  column  vector  and  Rx  is  a  positive  semidefinite  matrix;  A/  are  known  as  the 
eigen values,  whereas  the  basis  functions  (p/(t)  are  the  corresponding  eigenfunctions. 

e  Karhunen-Loeve  expansion  clearly  establishes  that  the  basis  functions  of  a  random 
process  x(t)  depend  on  its  autocorrelation  function  Rx(t,  /,).  We  cannot  arbitrarily  select  a 
unction  set.  In  fact,  solving  the  Karhunen-Loeve  expansion  can  be  a  nontrivial  task. 

1  1 .5.2  Geometrical  Representation  of 
White  Noise  Processes 


*  V7I  U 


^luuviuu;  wuuc 


a vi me  autocorrelation 

**('.  f|)  =  —  8(t  —  t\ ) 

For  this  special  kernel,  the  integral  equation  Eq.  ( 1 1 .57)  is  reduced  to  a  simple  form  of 

ki  ■  <pi(t)  =  ^  __5(,  _  ,,) .  <£>,(,!)<*,  =  —^t)  t  €  (0,  To)  (U-51 

whiteMise  nm^eS  C°N  SC*  °f  baSIS  functions  can  be  used  to  represent  stationai 

Additionally,  the  eigenvalues  are  identically  =  M/2, 

cations  we  focn  ^  '  'S°  utmost  lmPortance  to  us.  In  most  digital  communication  appl 

channels  In  the  rT  fT™™  receiver  des'gn  and  performance  analysis  under  white  nois 

tZ to  1 Tm™°nS-  W  h3Ve  a"  orthonorma.  set  of  basis  functior 
m(t) '  rePresent  the  M  waveforms  {*,(/)},  such  that 

si(t)  =  J^SiJcViU)  /  =  l . m  (11  -59a 

* 

channel  noiVnlmsuchtha^ ,UnCt'°nS  are  a,so  suitable  for  the  representation  of  the  whit 


nw(t)  =s  n k<pk(t)  0  <  t  <  T„ 

k 


Fl9ure  11.13 

Geometrical 
^presentation  of 
0  Gaussian 
random  process. 


(11.59b) 
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Consequently,  when  the  transmitter  sends  s,(f).  the  received  signal  can  be  decomposed  into 


y(0  =  s,(f)  +  n„  (/) 

=S'  X  si.k<PkV)  +  X  nkVkU) 

k  k 

ls-£v*«<'>  (1 1.59c) 

k 


by  defining 


yk  =  f  y (t)<pl(t)dt  =  Sijc  +  n*  if  5/(r)  i 

Jo 


is  sent 


(1 1.59d) 


As  a  result,  when  the  channel  noise  is  white,  the  received  channel  output  signal  can  be 
effectively  represented  by  a  sequence  of  random  variables  {y* }  of  Eq.  ( 1 1  -59d).  In  other  words, 
the  optimum  receiver  for  white  noise  channels  can  be  derived  from  the  information  contained  in 

(yt.  y2 . y *’  •••)• 

of  an  ensemble  of  sample  functions.  The  coefficients 


We  note  that  white  noise \(t)  consists 


x*  =  f  \(t)<p*k(t)dt 

Jo 


k  =  1.  2, 


,  ^  *.*  n  nf  Fn  (||  59b)  will  be  different  for  each  sample  function.  Consequently. 

,n  the  decomposition  of  Eq.  ^  ^  *  specific  vector  (xu  X2> . .  and 

the  coefficients  are  s.  ac  This  means  that  the  ensemble  of  sample  functions 

will  map  into  one  point  in  the  s  g  •  P  ^  of  points  in  ,he  signal  space,  as  shown 

for  the  random  process  x  /  wdl  p  three-dimensional  graph  (because  it  is  not 

in  Fig.  11.13.  Although  this  figure  shows  on  y  tQ  jndicate  the  idea. 

possible  to  show  a  higher  dimen*lona  ju  ‘  (the  sample  function)  is  a  certain  point  x. 
For  each  trial  of  the  experiment  the  outcome  (  ^  #  JP  ^  ^  the  density  of  points 

The  ensemble  of  points  in  the  sig  P  f  %  jn  that  region.  ,f  we  denote  the  joint 

directly  proportional  to  the  proba  y 
PDF  of  x | ,  . x„  by  p*(x),  then 


»v(X)  =  Pxixi  -x„(X|-  *2.  •  •  •»  V'|) 


(11.60) 
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Thus,  px(x)  has  a  certain  value  at  each  point  in  the  signal  space,  and  px(x)  represents  the 
relative  probability  (dust  density)  of  observing  x  =  x. 


1  1 .5.3  White  Gaussian  Noise 

If  the  channel  noise  n w(t)  is  white  and  Gaussian,  then  from  the  discussions  in  Section  8.6,  the 
expansion  coefficients 


n k  =  [  MO  (pk(t)  dt  (11.61) 

Jo 

are  also  Gaussian.  Indeed,  (ni,  n2 ,  ...»  n*, . . . )  are  jointly  Gaussian. 

Here,  we  shall  provide  some  fundamentals  on  Gaussian  random  variables.  First,  we  define 
a  column  vector  of  n  random  variables  as 


xi 

X2 

X  = 

_  X"  . 

Note  that  x T  denotes  the  transpose  of  x.  and  x  denotes  the  mean  of  x.  Random  variables  (RVs) 
X|,  X2,  ...»  x„  are  said  to  be  jointly  Gaussian  if  their  joint  PDF  is  given  by 


Px|X2...x„ C*1  *  •  •  •  *  *n)  — 


1 


(2n)"/2J3tt(Kd 


exp 


-  —  (x  —  \)tKx  {(x 


-«*] 


(11.62) 


where  Kx  is  the  n  x  n  covariance  matrix 


•••  o\n 
n 

' ' '  ann 

and  the  covariance  of  x,  and  x,  is 

°ij  =  (Xj  -  Xi)(\j  -  Xj) 


Kx  =  (x  -  x)  •  (x  -  x)r  = 


<711 

0 12 

(721 

<722 

®n\ 

0/12 

(11.63a) 


(11.63b) 


Here,  we  use  conventional  notations  det(Kx)  and  K~]  to  denote  the  determinant  and  the  inverse 
of  matrix  Kx ,  respectively. 

Gaussian  variables  are  important  not  only  because  they  are  frequently  observed,  but  also 
because  they  have  certain  properties  that  simplify  many  mathematical  operations  that  are 
otherwise  impossible  or  very  difficult.  We  summarize  these  properties  as  follows: 


P- 1 :  The  Gaussian  density  is  completely  specified  by  only  the  first-  and  second-order  statistics 
x  and  Kx.  This  follows  from  Eq.  ( 1 1 .62). 

P-2:  If //jointly  Gaussian  variablesxi,  x2,  ...,  x„  are  uncorrelated,  then  they  are  independent 
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If  the  n  variables  are  uncorrelated,  axj  =  0  (i  ^  j),  and  Kx  reduces  to  a  diagonal 
matrix.  Thus,  Eq.  ( 1 1 .62)  becomes 


1  -(*,  -  x7)2 


(II. 64a) 


(11.64b) 


=  />X|(*l)Px2(*2)  •  •  -PxMn) 


As  we  observed  earlier,  independent  variables  are  always  uncorrelated,  but  uncorrelated 
variables  are  not  necessarily  independent.  For  the  case  of  jointly  Gaussian  RVs,  however, 
uncorrelatedness  implies  independence. 

P-3:  When  xj,  X2,  ....  x„  are  jointly  Gaussian,  all  the  marginal  densities,  such  as />X|  (•*/)<  ar>d 
all  the  conditional  densities,  such  as Pxjx.j\xt\i  -x.p(xi’  xj\xk>  x/,...,  Xp),  are  Gaussian. 
This  property  can  be  readily  verified  (Prob.  8.2-11). 

P-4:  Linear  combinations  of  jointly  Gaussian  variables  are  also  jointly  Gaussian.  Thus,  if  we 
form  m  variables  y j ,  y2,  •  •  • .  y»i  (m  —  n )  obtained  from 


n 


y i  —  aik  *k 

*= I 


(11.65) 


then  yi,  y2,  ....  ym  arc  also  jointly  Gaussian  variables. 


1  1 .5.4  Properties  of  Gaussian  Random  Process 


Gaussian  [Eq.  (11.62)]  tor  every  n  ana  torevc.y  ^  o,,  .2,  .... 

RVs  x  (/ 1 ),  x(/2) . x(f„)  of  a  Gaussian  random  process  is  given  by  Eq.  ( 1 1 .62)  in  which  the 


RVsx(f|),  x(r2).  • 
mean  and  the  cova 


-  * 

and  the  covariance  matrix  Kx  are  specified  by 


(11.66) 


and  atj  -  Rx(ti,  tj)  -  x(6)  •  x(tj) 


This  shows  that  a  Gaussian  random  process  is  completely  specified  by  its  autocorrelation 


function  /?x(t„  tj)  and  its  mean  value  x(/). 
As  discussed  in  Chapter  9,  if  the  Gi 


Chapter  9.  if  the  Gaussian  random  process  satisfies  two  additional 


conditions: 


Rx (ti,  tj)  —  R\(ti  tj) 


(11.67a) 


and 


x(/)  =  constant  for  all  t 


(11.67b) 


then  it  is  a  wide-sense  stationary  pn 
of  the  Gaussian  RVs  x(ri),  x(^2)»  • 


stationary  process.  Moreover.  Eqs.  (1 1.67)  also  mean  that  the  joint  PDF 
it  )  x(/-> )....,  x(r„)  is  also  invariant  to  a  shift  of  time  origin.  Hence, 
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we  can  conclude  that  a  wide-sense  stationary  Gaussian  random  process  is  also  strict-sense 
stationary. 

Another  significant  property  of  the  Gaussian  process  is  that  the  response  of  a  linear  system 
to  a  Gaussian  process  is  also  a  Gaussian  process.  This  arises  from  property  P-4  of  the  Gaussian 
RVs.  Let  x(f )  be  a  Gaussian  process  applied  to  the  input  of  a  linear  system  whose  unit  impulse 
response  is  h(t).  If  y(r)  is  the  output  (response)  process,  then 


00 


is  a  weighted  sum  of  Gaussian  RVs.  Because  x(/)  is  a  Gaussian  process,  all  the  variables 
\(t  -  k At)  are  jointly  Gaussian  (by  definition).  Hence,  the  variables  y(fj),  y(t2),  •••»  y(^) 
for  all  n  and  every  set  (t\9  ti,  . ..,  tn)  are  linear  combinations  of  variables  that  are  jointly 
Gaussian.  Therefore,  the  variables  y(/i),  y(^),  . . . ,  y  (tn)  must  be  jointly  Gaussian,  according 
to  the  earlier  discussion.  It  follows  that  the  process  y(t)  is  a  Gaussian  process. 

To  summarize,  the  Gaussian  random  process  has  the  following  properties: 

1 .  A  Gaussian  random  process  is  completely  specified  by  its  autocorrelation  function  and  mean 


value. 


2.  If  a  Gaussian  random  process  is  wide-sense  stationary,  then  it  is  stationary  in  the  strict 
sense. 


3.  The  response  of  a  linear  system  to  a  Gaussian  random  process  is  also  a  Gaussian  random 
process. 


Consider  a  white  noise  process  n  w(t)  with  PSD  A/*/2.  Then  any  complete  set  of  orthonormal 
basis  signals  <p\  (r),  <p2(t ),  . . .  can  decompose  n w(t)  into 


nH.(f)  =  n\cp\(t)  +  r\2(p2(t)  H - 

= 


* 


White  noise  has  infinite  bandwidth.  Consequently,  the  dimensionality  of  the  signal  space  is 
infinity. 


We  shall  now  show  that  RVs  ni ,  m,  . . .  are  independent,  with  variance  M /2  each,  fwsl 


we  have 


ro  Jo 
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Because  /?nii,(r)  =  (A/72)  <$(r),  then 


nm*  =  f  [  -  a)<Pj(a)<pk(P)dadp 

Jo  Jo  L 

M  fT° 

=  —  I  <pj(a)<pk(a)  da 
Jo 


|  0  j*k 


(11.68) 


Hence,  n,  and  n*  are  uncorrelated  Gaussian  RVs,  each  with  variance  A/7 2.  Since  they  are 
Gaussian,  uncorrelatedness  implies  independence.  This  proves  the  result. 

For  the  time  being,  assume  that  we  are  considering  an  /V -dimensional  case.  The  joint  PDF 
of  independent  joint  Gaussian  RVs  ni ,  n2,  . . . ,  nyv,  each  with  zero  mean  and  variance  A  /2, 
is  [see  Eq.  ( 1 1 .64)] 


N 


Pn(«)  =  n 
7=1 


1  _-"?/2(A/~/2) 

y/2n7T/2 


_ *  -(it? +«?+•••+»»«  )/A/~ 

"  (ttAO*/2 

_  1  -iinn2/-A/~ 

“  (tt^2 


(11.69a) 

(11.69b) 


This  shows  that  the  PDF  pn(n )  depends  only  on  the  norm  ||»||,  which  is  the  sampled  length 
of  the  noise  vector  n  in  the  hyperspace,  and  is  therefore  spherically  symmetrical  if  plotted  in 
the  N -dimensional  hyperspace. 


1 1 .6  OPTIMUM  RECEIVER  FOR  WHITE  GAUSSIAN 
NOISE  CHANNELS 

11.6.1  Geometric  Representations 

We  shall  now  consider,  from  a  more  fundamental  point  of  view,  the  problem  of  A*  -ary  commu¬ 
nication  in  the  presence  of  additive  white  Gaussian  noise  (AWGN).  Such  a  channel  is  known 
as  the  AWGN  channel.  Unlike  the  linear  receivers  previously  studied  in  Secs.  11.1  to  11.3, 
no  constraint  is  placed  on  the  optimum  structure.  We  shall  answer  the  fundamental  quest.on: 

What  receiver  will  yield  the  minimum  error  probability  ?  .... 

The  comprehension  of  the  signal  detection  problem  is  greatly  facilitated  by  geometrical 
representation  of  signals.  In  a  signal  space,  we  can  represent  a  signal  by  a  fixed  point  (or  a 
«cL)  A  random  process  can  be  represented  by  a  random  potn.  (or  a  random  vector).  The 
region  in  which  the  random  point  may  lie  will  be  shown  shaded,  with  the  shading  mtensi  y 

proportional  to  the  probability  of  observing  the  signal  in  that  region.  In  the  M- ary  scheme, 

proportion  v  . mM.  Each  of  these  symbols  is  represented  by 

we  use  M  symbols,  or  messages,  l*  2  ,  .  Thu «  thp 

a  suecified  waveform.  Let  the  corresponding  wavefomts  be  si  <!).  «('<•_■  .  s«  (D  ■ 
syE  (or  message,  m,  is  sen,  b,  transmitting  the  waveform  «(,).  These  waveforms  are 
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Figure  11.14 

M-ary  communi¬ 
cation  system. 


Figure  11.16 

Binary  commu¬ 
nication  in  the 
presence  of 
noise. 


corrupted  by  AWGN  n w(t)  (Fig.  11.14)  with  PSD 

SnJ<»)  =  — 

At  the  receiver,  the  received  signal  r (t)  consists  of  one  of  the  M  message  waveforms  $*(*) 
plus  the  channel  noise, 


r(0  =  sk(t)  +  nw(t) 


(11.70a) 


Because  the  noise  nw.(r)  is  white,  we  can  use  the  same  basis  functions  to  decompose  both 
Sk(t)  and  nw(t).  Thus,  we  can  represent  r (t)  in  a  signal  space  by  denoting  r,  s*,  and  nM  as  the 
vectors  representing  signals  r(/),  sk(t ),  and  n^f),  respectively.  Then  it  is  evident  that 

r  =  sk  +  nM  (11.70b) 

The  signal  vector  sk  is  a  fixed  vector,  because  the  waveform  sk(t)  is  nonrandom,  whereas  the 
noise  vector  is  random.  Hence,  the  vector  r  is  also  random.  Because  nM-(/)  is  a  Gaussian 
white  noise,  the  probability  distribution  of  nu.  has  spherical  symmetry  in  the  signal  space  (as 
shown  in  the  last  section).  Hence,  the  distribution  of  r  is  a  spherical  distribution  centered  at  a 
fixed  point  sk ,  as  shown  in  Fig.  11.15.  Whenever  the  message  mk  is  transmitted,  the  probability 
of  observing  the  received  signal  r(/)  in  a  given  scatter  region  is  indicated  by  the  intensity  of  the 
shading  in  Fig.  11.15.  Actually,  because  the  noise  is  white,  the  space  has  an  infinite  number  of 
dimensions.  For  simplicity,  however,  we  have  shown  the  space  to  be  three-dimensional.  This 
will  suffice  to  indicate  our  line  of  reasoning.  We  can  draw  similar  scatter  regions  for  variou* 
points  si,  • .  •,  sm .  Figure  1 1.16a  shows  the  scatter  regions  for  two  messages  m}  and  m 
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Figure  11.16 

Binary  commu¬ 
nication  in  the 
presence  of 
noise. 


when  Sj  and  s,  are  widely  separated  in  signal  space.  In  this  case,  there  is  virtually  no  overlap 
between  the  two  scattered  regions.  If  either  m,  orw,  is  transmitted,  the  received  signal  will  lie 
in  one  of  the  two  scatter  regions.  From  the  position  of  the  received  signal,  one  can  decide  with 
a  very  small  probability  of  error  whether  mj  or  mk  was  transmitted.  In  Fig.  1 1 . 1 6a.  the  received 
signal  r  is  much  closer  to  s,  than  to  sj.  It  is  therefore  more  likely  that  mk  was  transmitted. 
Note  that  theoretically  each  scatter  extends  to  infinity,  although  the  probability  of  observing 
the  received  signal  diminishes  rapidly  as  a  point  is  scattered  away  from  the  center.  Hence, 
there  will  always  be  some  overlap  between  the  two  scatter  sets,  resulting  in  a  nonzero  error 
probability.  Thus,  even  though  the  received  r  is  much  closer  to  sk  in  Fig.  1 1 . 16a.  it  may  still 

be  generated  by  s;  plus  channel  noise. 

Figure  II  16b  illustrates  the  case  of  stronger  noise.  In  this  case,  there  is  a  considerable 
overlap  between  the  two  scattered  regions.  Because  the  received  signal  r  is  closer  to  sj  than 
to  st  it  is  more  likely  that  mj  was  transmitted.  But  in  this  case  there  is  also  a  considerable 
probability  that  mk  may  have  been  transmitted.  Hence  in  this  situation,  there  will  be  a  much 

higher  probability  of  error  in  any  decision  scheme. 

The  optimum  receiver  must  decide,  from  a  knowledge  of  r.  which  message  has  been 
transmitted.  The  signal  space  must  be  divided  into  M  nonoverlapping,  or  disjoint,  decision 

regions  R\  Ri  ...  Rm  ,  corresponding  to  the  M  messages  m,,  m2 . mM .  If  r  falls  in  the 

region/?,, ’the  decision  is  m,.  The  problem  of  designing  the  receiver  then  reduces  to  choosing 
the  boundaries  of  these  decision  regions  /?i .  *2 . to  minimize  the  probability  of  error 

in  decision  making.  .  r  hA 

To  recapitulate:  A  transmitter  sends  a  sequence  of  messages  from  a  set  of  M 
messages  rm  mi  .,mM.  These  messages  are  represented  by  finite  energy  waveforms 
7m  s,(t)  sM(t)  One  waveform  is  transmitted  every  T0  =  Tm  seconds.  We  assume  that 
the  receiver  is  time-synchronized  with  the  transmitter.  The  waveforms  are  corrupted  during 
transmissions  by  an  AWGN  of  PSD  Af/2.  Knowing  the  received  waveform,  the  receiver  must 
decide  which  waveform  was  transmitted.  The  merit  criterion  of  the  receiver  is  the  minimum 
probability  of  error  in  making  this  decision. 


1  1 .6.2  Dimensionality  of  the  Detection  Signal  Space 

Let  us  now  discuss  the  dimensionality  of  the  signal  space  in  our  detection  problem.  If  there  was 
Let  us  now  discu  with  only  M  waveforms  s,(f),  s2(t) . sM(t).  In  this  case 

jTsignaf 'spaced,  at  most,  ^'dimensions  would  suffice.  Tltis  is  because  the  dimensionality 


mmsm 
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of  a  signal  space  is  always  equal  to  or  less  than  the  number  of  independent  signals  in  the 
space  (Sec.  1 1 .4).  For  the  sake  of  generality,  we  shall  assume  the  space  to  have  N  dimensions 
(N  <  M).  Let  (p\  (f),  <p2(r),  . . . ,  (pN(t)  be  the  orthonormal  basis  set  for  this  space.  Such  a  set 
can  be  constructed  by  using  the  Gram-Schmidt  procedure  discussed  in  Appendix  C.  We  can 
then  represent  the  signal  waveform  $*(/)  as 

Sj(t)  =  Sjt\<p\(t)  +Sj,2(p2(t)  H - MyWWvW  (11.71a) 

N 

=  ^,sj*9k(t)  7  =  1.2,...,  A#  (11.71b) 

k=\ 


where 


Sj(t)<pk(t)dt 


(1 1.71c) 


Now  consider  the  white  Gaussian  channel  noise  nH.  (t).  This  signal  has  an  infinite  bandwidth 
( B  =  oo).  It  has  an  infinite  number  of  dimensions  and  obviously  cannot  be  fully  represented 
in  a  finite  W-dimensional  signal  space  discussed  earlier.  We  can,  however,  split  nw(0  int0 
two  components:  (1)  the  portion  of  n w(t)  inside  the  N-dimensional  signal  space,  and  (2)  the 
remaining  component  orthogonal  to  the  N-dimensional  signal  space.  Let  us  denote  the  two 
components  by  n  (t)  and  no(0-  Thus, 


nw(r)  =  n(/)  +  n0(r) 


where 


N 


n  (o  = 


k=\ 


and 


no(0  =  ^  n j<pj(t) 

k=N+ 1 


where 


nj=  (  n 


(t)<Pj(t)dt 


(11.72) 


(11.73a) 


(11.73b) 


(11.73c) 


Because  no(0  is  orthogonal  to  the  W-dimensional  space,  it  is  orthogonal  to  every  signal  in  that 
space.  Hence, 


Therefore, 


/  no(f)ipj(t) dt  =  0  j  =  1,  2,  ...,N 

JTm 

t\j=  /  [n(0  +  no(t)](pj(t)dt 

=  /  n w(t)<pj(t)dt  j  =  1,2,...,W 
JTm 


(11.74) 


Figure  11.17 

Eliminating  the 
noise  orthogon 
to  signal  space 
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From  Eqs.  ( 1 1.73a)  and  (1 1.74),  it  is  evident  that  we  can  filter  out  the  component  no(f)  from 
nM,(/).  This  can  be  seen  from  the  fact  that  the  received  signal.  r(f),  can  be  expressed  as 

r(f)  =  skU)  +  n  At) 

=  sk(t)  +  n(r)  +  no(0 

=  q(r)  +  no(r)  (11.75) 

where  q(r)  is  the  projection  of  r(f)  on  the  ^-dimensional  space: 

q(0  =  sk(t)  +  n(f)  (11-76) 

We  can  obtain  the  projection  q(r)  from  r(r)  by  observing  that  [see  Eqs.  (1 1.71b)  and  (1 1.73a)| 

N 

q(r)  =  ]T( skj  +  n j)<Pj(t)  (1 1.77) 

7=1 

From  Eqs.  (1 1.71c),  (1 1.74),  and  ( 1 1.77)  it  follows  that  if  we  feed  the  received  signal  r(r) 
into  the  system  shown  in  Fig.  11.17,  the  resultant  outcome  will  be  q(f).  Thus,  the  orthogonal 
noise  component  can  be  filtered  out  without  disturbing  the  message  signal. 

The  question  here  is:  Would  such  filtering  help  in  our  decision  making?  We  can  easily  show 
that  it  cannot  hurt  us.  The  noise  n„  (/)  is  independent  of  the  signal  waveform  sk(t).  Therefore, 
its  component  n0(f)  is  also  independent  of  s*  (f).  Thus,  n(>(0  contains  no  information  about  the 
transmitted  signal,  and  discarding  such  a  component  from  the  received  signal  r(f)  will  not  cause 
any  loss  of  information  regarding  the  signal  waveform  sk(t).  This,  however,  is  not  enough. 
We  must  also  make  sure  that  the  noise  being  discarded  [n0(/)|  is  not  in  any  way  related  to  the 
remaining  noise  component  n (t).  If  n o(0  and  n(/)  are  related  in  any  way,  it  will  be  poss.ble  to 
obtain  some  information  about  n(f)  from  n0(f),  thereby  enabling  us  to  detect  that  signal  w.th 
less  error  probability.  If  the  components  n 0(r)  and  n(r)  are  independent  random  processes,  the 
component  n0(0  does  not  carry  any  information  about  n(r)  and  can  be  discarded.  Under  these 
conditions,  n0(/)  is  irrelevant  to  the  decision  making  at  the  receiver. 

The  process  n(/)  is  represented  by  components  nj ,  n2,  .  •  •  *  hn  along  ip\  (/),  •  •  •  * 

mit)  and  n0(0  is  represented  by  the  remaining  components  (infinite  number)  along  the 
remaining  basis  signals  in  the  complete  set,  {?*(*)}•  Because  the  channel  no.se  is  white 
Gaussian  from  Eq  (11.68)  we  observe  that  all  the  components  are  independent.  Hence, 


Figure  11.17 

Eliminating  the 
noise  orthogonal 
to  signal  space. 
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the  components  representing  no(f)  are  independent  of  the  components  representing  n(/). 
Consequently,  no(/)  is  independent  of  n(r)  and  contains  only  irrelevant  data. 

The  received  signal  r(r)  is  now  reduced  to  the  signal  q(f ),  which  contains  the  desired  signal 
waveform  and  the  projection  of  the  channel  noise  on  the  TV -dimensional  signal  space.  Thus, 
the  signal  q(f)  can  be  completely  represented  in  the  signal  space.  Let  the  vectors  representing 
n(r)  and  q(r)  be  denoted  by  n  and  q.  Thus, 

q  =  s  +  n 


where  s  may  be  any  one  of  vectors  s  i ,  sj,  •  •  ■ .  -Vo . 

The  random  vector  n  =  (nj,  ....  n\)  is  represented  by  TV  independent  Gaussian 
variables,  each  with  zero  mean  and  variance  an2  =  Af/2.  The  joint  PDF  of  vector  n  in  such  a 
case  has  a  spherical  symmetry,  as  shown  in  Eq.  (1 1.69b), 


p„(n)  = 


1  p-\\n\\2lN 

(nAf)N/2e 


Note  that  this  is  actually  a  compact  notation  for 


Pni,n2 . n/v (nl <  n2,  •••,  ”/v)  = 


-(nj+n?+-+nj)/ N 

(TiN)"!2 


(11.78a) 


(11.78b) 


1  1 .6.3  (Simplified)  Signal  Space  and  Decision  Procedure 

Our  problem  is  now  considerably  simplified.  The  irrelevant  noise  component  has  been  filtered 
out.  The  residual  signal  q (t)  can  be  represented  in  an  jV-dimensional  signal  space.  We  proceed 
to  determine  the  M  decision  regions  R\ ,  Ri,  . . . ,  Rm  in  this  space.  The  regions  must  be  chosen 
to  minimize  the  probability  of  error  in  making  the  decision. 

Suppose  the  received  vector  q  =  q.  Then  if  the  receiver  decides  m  =  m*,  the  conditional 
probability  of  making  the  correct  decision,  given  that  q  =  q,  is 

P(C\q  =  q)  =  P(mk\q=q)  (llJ9) 

where  P(C |q  =  q)  is  the  conditional  probability  of  making  the  correct  decision  given  q  = 
and  P(mk  |q  =  q)  is  the  conditional  probability  that  mk  was  transmitted  given  q  =  <7  The 
unconditional  probability  P(C)  is  given  by 

P(C)=  f  P(C\q=q)pq(q)dq  (ll  80) 

Jq 

where  the  integration  is  performed  over  the  entire  region  occupied  by  q.  Note  that  this  is 

an  TV-fold  integration  with  respect  to  the  variables  q\,  q2 . qN  over  the  signal  waveform 

duration.  Also,  because  pq(qr)  >  0,  this  integral  is  maximum  when  P(C|q  =  q)  is  maximum. 
From  Eq.  (1 1.79)  it  now  follows  that  if  a  decision  m  =  mk  is  made,  the  error  probability  is 
minimized  if  the  probability 

P(C)  =  f  P(C\q=q)pq(q)dq 
Jq 

is  maximized.  The  probability  P(mk  |q  =  q)  is  called  the  a  posteriori  probability  of  ,nk ■  ^ 
is  because  it  represents  the  probability  that  mk  was  transmitted  when  q  was  being  received. 
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The  decision  procedure  to  maximizing  the  probability  of  correct  decision  P(C),  thereby 
minimizing  the  probability  of  error,  is  now  clear.  Once  we  receive  q  =  q.  we  evaluate  all  M 
a  posteriori  probability  functions  {P(mj\q  =  q)}.  Then  we  make  the  decision  in  favor  of  that 
message  for  which  the  a  posteriori  probability  is  highest — that  is,  the  receiver  decides  that 
m  =  mk  if 


(11.81) 


P(mk  |q  =  q)  >  P(roy|q  =  q)  for  all;  ^  k 


Thus,  the  detector  that  minimizes  the  error  probability  is  the  maximum  a  posteriori 
probability  (MAP)  detector. 

We  can  use  Bayes’  rule  (Chapter  8)  to  determine  the  a  posteriori  probabilities.  We  have 

,  ,  P(mk)pq(q\mk) 

P(mk\q  =  q)  = - —r~ -  U1.8Z) 

Pq(q) 

Hence,  the  receiver  decides  m  =  mk  if  the  decision  function 


P(mj)pq(q\mj) 

Pq(q) 


is  maximum  for  i  =  k. 

Note  that  the  denominator  pq(?)  is  common  to  all  decision  functions  and  is  not  effected 
by  the  decision.  Hence,  it  may  be  ignored  during  the  decision.  Thus,  the  receiver  sets  m  =  mk 
if  the  decision  function 


(11.83) 


P(mi)pq(q\mj) 


is  maximum  for  /  =  it.  Thus,  once  q  is  obtained,  we  compute  the  decision  function  (Eq.  ( 1 1 .83)] 


lo  iiiaAimum  ivm  1  —  ....  — — 7 M  ... 

for  all  messages  m,.  m2,  ....  «w  and  decide  that  the  message  for  wh.ch  the  funct.cn  is 

maximum  is  the  one  most  likely  to  have  been  sent. 

We  now  turn  our  attention  to  finding  the  decision  functions.  The  a  prion  probability  P(m,) 
represents  the  probability  that  the  message  m,  will  be  transmitted.  These  probabilities  must  be 
known  if  the  criterion  discussed  is  to  be  used.*  The  term  pq(?K)  represents  the  PDF  ot  q 
when  the  transmitter  sends  5(f)  =  s,(t).  Under  this  condition. 


q  =  Si  +  n 


and 


n  =  q  -  Si 


/\llCIllttllVtlj,  U1V  j 

nr -q  -  s\.  Hence  [Eq.  (1 1.78a)], 


pq(q\mi)  =Pn(q~si)  ~  („tf)N/2e 


(11.84) 


*  In  case  these  probabilities  are  unknown,  one 
minimax,  as  will  be  discussed  later. 


must  use  other  merit  criteria,  such  as  maximum  likelihood  or 
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The  decision  function  in  Eq.  (1 1.83)  now  becomes 

-,-II<7-s.II2AM  (1185) 

(ttAO"/2 

Note  that  the  decision  function  is  always  nonnegative  for  all  values  of  i.  Hence,  comparing  these 
functions  is  equivalent  to  comparing  their  logarithms,  because  the  logarithm  is  a  monotone 
function  for  the  positive  argument.  Hence,  for  convenience,  the  decision  function  will  be 
chosen  as  the  logarithm  of  Eq.  ( 1 1 .85).  In  addition,  the  factor  (nA 0,v  ' '  is  common  for  all  i 
and  can  be  left  out.  Hence,  the  decision  function  to  maximize  is 

ln/*(m,-)  —  -j^\\q-Si\\2  ( 1 1  -86) 

Note  that  \  \q  —  s/||2  is  the  square  of  the  length  of  the  vector  q  —  S/.  Hence, 

II?  -  Sill2  =  <q  —  Si,  q  -Si> 

=  ll?ll2  +  INI2  -  2<?,  Si>  (H-87) 

Hence,  the  decision  function  in  Eq.  (1 1.86)  becomes  (after  multiplying  throughout  by  M/2) 

yin  />(«,•)  -  ^  (||?||2  +  ||s,||2  -  2<?,  s,>)  01-881 

Note  that  the  term  ||s,||2  is  the  square  of  the  length  of  s,  and  represents  E, ,  the  energy  of  signal 
Sj(t).  The  terms  AT  In  P(nij)  and  Et  are  constants  in  the  decision  function.  Let 

a\  =  j[Af\n  P{n\i)  —  E(\  (11.89) 

Now  the  decision  function  in  Eq.  (1 1.88)  becomes 


ai  -f  <q ,  Si> 


The  term  ||^||2/2  is  common  to  all  M  decision  functions  and  can  be  omitted  for  the  purpose 
of  comparison.  Thus,  the  new  decision  function  bi  is 


We  compute  this  function  bi  for 
function  is  the  largest  for  i  =  k. 
whose  impulse  response  is  /?(r). 


bi=aj  +  <q,Si>  (11-90) 

i  =  I,  2,  . . . ,  N,  and  the  receiver  decides  that  m  =  mk  •ltl|lS 
If  the  signal  q(t)  is  applied  at  the  input  terminals  of  a  system 
the  output  at  t  =  Tm  is  given  by 


q(T)h(TM  -x)dx 


If  we  choose  a  filter  matched  to  j,(r),  that  is,  h(t)  =  Si(TM  -  t). 


h(TM  -  r)  =  s,(r) 
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and  based  on  Parseval’s  theorem,  the  output  is 


q(T)si(x)dr  =  <q,  s,> 


Hence,  <q ,  s,>  is  the  output  at  t  =  Tm  of  a  filter  matched  to  Sj(t)  when  q(t)  is  applied  to 
its  input. 

Actually,  we  do  not  have  q(t).  The  incoming  signal  r(t)  is  given  by 


r(f)  =  Sj(t)  +  nw(/) 

-  Sj(t)  +  n(f)  +  no(Q 
q(t)  irrelevant 


where  n0(0  is  the  (irrelevant)  component  of  n„.(/)  orthogonal  to  the  N -dimensional  signal 
space.  Because  n0(t)  is  orthogonal  to  this  space,  it  is  orthogonal  to  every  signal  in  this  space. 
Hence,  it  is  orthogonal  to  the  signal  and 


/OO 

n0U)Si(t)dt  =  0 

-00 


and 


/oo  roo 

q(t)sj(t)dt  +  /  no  (t)sj(t)dt 

-oo  J-oo 

/oo 

[?(/)  +  no(t)\si(t)dt 

-OO 


-OO 

/OO 

r(t)s,-(t )  dt 

-OO 


(11-91) 


Hence  it  is  immaterial  whether  we  use  q(t)  or  r(t)  at  the  input.  We  thus  apply  the  incoming 
signal  V(f)  to  a  parallel  bank  of  matched  filters,  and  the  output  of  the  filters  is  sampled  at 
t  =  Tm  Then  a  constant  a,  is  added  to  the  ith  filter  output  sample,  and  the  resulting  outputs 
are  compared.  The  decision  is  made  in  favor  of  the  signal  for  which  this  output  is  the  largest. 
The  receiver  implementation  for  this  decision  procedure  is  shown  in  Fig.  1 1 . 1 8a.  Section  11.1 
has  already  established  that  a  matched  filter  is  equivalent  to  a  correlator.  One  may  therefore 
use  correlators  instead  of  matched  filters.  Such  an  arrangement  is  shown  in  Fig.  1 1.18b. 

We  have  shown  that  in  the  presence  of  AWGN,  the  matched  filter  receiver  is  the  optimum 
receiver  when  the  merit  criterion  is  minimum  error  probability.  Note  that  the  optimum  system 
is  found  to  be  linear,  although  it  was  not  constrained  to  be  so.  Therefore,  for  white  Gaussian 
noise  the  optimum  receiver  happens  to  be  linear.  The  matched  filter  obtained  in  Sec.  1 1. 1  and 
Sec  112  as  well  as  the  decision  procedure  are  identical  to  those  derived  here. 

The  optimum  receiver  can  be  implemented  in  another  way.  From  Eq.  (11  -91),  we  have 


<qs  Si>  =  <r,  s/> 


From  Eq.  ( 1 1 .44),  we  can  rewrite  this  as 

N 

<9-  *<>  =  !]  rJsU 
j=  i 
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Figure  11.18 

Optimum  M -ary 
receiver: 

(a)  matched  filter 
detector; 

(b)  correlation 
detector. 


Sample  at  t  =  TM 
(a) 


Sample  at  /  -Tm 
(b) 


The  term  <q,  s,>  is  computed  according  to  this  equation  by  first  generating  r7  and  then  comput 
ing  the  sum  of  rjs-,j  (remember  that  the  sy  are  known),  as  shown  in  Fig.  1 1 . 1 9a.  The  M  correlator 
detectors  in  Fig.  1 1.18b  can  be  replaced  by  N  filters  matched  to  <p\ ( t ),  y?2 ( ? ) ,  •  •  •>  * ' 1 '  as 
shown  in  Fig.  1 1. 19b.  These  types  of  optimum  receiver  (Figs.  11. 18  and  1 1 . 1 9)  perform  identi 
cally.  The  choice  will  depend  on  the  hardware  cost.  For  example,  if  N  <  M  and  signals  {(ft(nl 
are  easier  to  generate  than  (s;(r)},  then  the  design  of  Fig.  11.19  would  be  chosen. 


1  1 .6.4  Decision  Regions  and  Error  Probability 

To  compute  the  error  probability  of  the  optimum  receiver,  we  must  first  determine  deu 
sion  regions  in  the  signal  space.  As  mentioned  earlier,  the  signal  space  is  divided  into 

nonoverlapping,  or  disjoint,  decision  regions  R\,  R2 . Rm  ,  corresponding  to  M  messages- 

If  q  falls  in  the  region  Rk,  the  decision  is  that  mk  was  transmitted.  The  decision  region^ 
are  chosen  to  minimize  the  probability  of  error  in  the  receiver.  In  light  of  this  geometric 
representation,  we  shall  now  try  to  interpret  how  the  optimum  receiver  sets  these  dec1- 
regions. 
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Figure  11.19 

Another  form  of 
optimum  M-a ry 
receiver: 

(a)  correlator; 

(b)  matched  filter. 


Sample  at  /  =  TM 


(a) 


Sample  at  t  =  TM 


(b) 


The  decision  function  is  given  by  Eq.  (1 1.86).  The  optimum  receiver  sets  m  =  mk  if  the 
decision  function 

Af\nP(mi)-\\q-Si\\2 

is  maximum  for  /  =  k.  This  equation  defines  the  decision  regions. 


Geometric  Interpretation  in  Signal  Space 

For  simplicity,  let  us  first  consider  the  case  of  equtprobable  messages,  that  .s  )  =  1/M 
for  all  /  In  this  case,  the  first  term  in  the  decision  function  ts  the  same  for  all  »  and.  hence, 
can  be  dropped.  Thus,  the  receiver  decides  that  m  -  mk  if  the  term  II?  s/ll  is  argest 
(numerically  the  smallest)  for  i  =  *.  Alternatively,  this  may  be  stated  as  follows,  the  receiver 
decides  that  m  =  mk  if  the  decision  function  ||?  -  s,||2  is  minimum  for  ,  =  k.  Note  that 
Z-  S  |  is  the  distance  of  point  q  from  point  s, .  Thus,  the  decision  procedure  in  this  case  has  a 


— *■  -  - 
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Figure  1 1 .20 

Determining 
optimum 
decision  regions 
in  a  binary  case. 


» 


simple  interpretation  in  geometrical  space.  The  decision  is  made  in  favor  of  that  signal  which 
is  closest  to  q ,  the  projection  of  r  [the  component  of  r(t)]  in  the  signal  space. 

This  result  is  expected  on  qualitative  grounds  for  Gaussian  noise,  because  the  Gaussian 
noise  has  a  spherical  symmetry.  If,  however,  the  messages  are  not  equiprobable,  we  cannot  go 
too  far  on  purely  qualitative  grounds.  Nevertheless,  we  can  draw  certain  broad  conclusions. 
If  a  particular  message  m,  is  more  likely  than  the  others,  one  will  be  safer  in  deciding  more 
often  in  favor  of  m,-  than  other  messages.  Hence,  in  such  a  case  the  decision  regions  will  be 
biased,  or  weighted,  in  favor  of  m,-.  This  is  shown  by  the  appearance  of  the  term  In  P(mi)  in 
the  decision  function.  To  better  understand  this  point,  let  us  consider  a  two-dimensional  signa 
space  and  two  signals  s\  and  52,  as  shown  in  Fig.  1 1 .20a.  In  this  figure,  the  decision  regions 
R\  and  R2  are  shown  for  equiprobable  messages;  P(m\ )  =  P(ni2)  =  0.5.  The  boundary  of  the 
decision  region  is  the  perpendicular  bisector  of  the  line  joining  points  s  \  and  52-  Note  that  an) 
point  on  the  boundary  is  equidistant  from  s\  and  $2-  If  q  happens  to  fall  on  the  boundary,  we 
just  “flip  a  coin”  and  decide  whether  to  select  m\  or  m2.  Figure  1 1 .20b  shows  the  case  ot  two 
messages  that  are  not  equiprobable.  To  delineate  the  boundary  of  the  decision  regions,  we  use 
Eq.  (1 1.86).  The  decision  is  m\  if 


II Q  “-sill2  -  M\nP(m\)<\\q  -  s2\\2  -Af\nP(m2) 

Otherwise,  the  decision  is  m2. 

Note  that  \\q  -  s\  ||  and  \\q  -  s2||  represent  distances  d\  and  d2 ,  the  distance  of  q  from  51 
and  52,  respectively.  Thus,  the  decision  is  mi  if 


d\  <  AAln 


P(m  1) 

P(m2) 


The  right-hand  side  of  this  inequality  is  a  constant  c: 


c  =  Af\n 


P(m\) 


\ 

/ 


P(mi) 
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Thus,  the  decision  rule  is 


Decision  (q)  =  <  m2 


m  1 


if  t/j2  —  d\  <  c 
if  d ,2  —  d}  >  c 


randomly  m\  or  m2  if  d\  —  d\  =  c 


The  boundary  of  the  decision  regions  is  given  by  d\  —  d\  =  c.  We  now  show  that  such  a 
boundary  is  given  by  a  straight  line  perpendicular  to  line  s\ — S2  and  passing  through  s\ — s 2 
at  a  distance  /x  from  s\y  where 


(11.92) 


where  d  is  the  distance  between  s\  and  S2 .  To  prove  this,  we  redraw  the  pertinent  part  of 
Fig.  1 1 .20b  as  Fig.  1 1.20c,  from  which  it  is  evident  that 


djW+M2 

dn  =  CT  +  (d  —  fir 


Flence, 


d 2  -  d\  —  2d [i  -  d2  —c 


Therefore, 


c  +  d2 


This  is  the  desired  result.  Thus,  along  the  decision  boundary  df  -  dj  is  constant  and  equal  to  c. 
The  boundaries  of  the  decision  regions  for  M  >2  may  be  determined  via  similar  argument. 
The  decision  regions  for  the  case  of  three  equiprobable  two-dimensional  signals  are  shown 
in  Fig  1 1 .2 1 .  The  boundaries  of  the  decision  regions  are  perpendicular  bisectors  of  the  lines 
joining  the  original  transmitted  signals.  If  the  signals  are  not  equiprobable.  then  the  boundaries 
will  be  shifted  away  from  the  signals  with  larger  probabilities  of  occurrence. 

For  signals  in  N -dimensional  space,  the  decision  regions  will  be  W-dimensional  hyper¬ 
cones  If  there  are  M  messages  mi,  m2,  mM  with  decision  regions  R\,  R2 . Rm . 

respectively  then  P(C\mt),  the  probability  of  a  correct  decision  when  m,  is  transmitted,  is 


(11.93) 


M 


P(C )  =  P(m,j/)(C|m,j 


(11.94) 


i=l 


and  PeM,  the  probability  of  error,  is  given  by 

PeM  =  1  -  HO 


(11.95) 
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Figure  1 1.21 

Determining 
optimum 
decision  regions. 


t 


<P 9 


Example  1  1 .2  Binary  data  is  transmitted  by  using  polar  signaling  over  an  AWGN  channel  with  noise  PSD 
A/72.  The  two  signals  used  are 


S\(t)  =  -s2(t)  =  '/E<p(t)  (U-96) 

The  symbol  probabilities  P(m\)  and  P(nii)  are  unequal.  Design  the  optimum  receiver  and 
determine  the  corresponding  error  probability. 


Figure  1 1 .22 

Decision  regions 
for  the  binary 
case  in  this 

*2 

P  d  P 

. . 

example. 

h 

*1 

(a) 

(b) 


m2  if  r  < —  In 
4  VE 


P(m2) 
P(m , ) 
P(m2) 
Pimx) 


The  two  signals  are  represented  graphically  in  Fig.  1 1.22a.  If  the  energy  of  each  sig^ 
is  £,  the  distance  of  each  signal  from  the  origin  is  \/£.  The  distance  d  between  the  t 
signals  is 


d  =  2  y/E 
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The  decision  regions  R\  and  R2  are  shown  in  Fig.  11.22a.  The  distance  /x  is  given  by 
Eq.  ( 1 1 .92).  Also,  the  conditional  probability  of  correct  decision 

P(C  |m  =  m\)  =  P(  noise  vector  originating  atsi  remains  in  R\) 


=  P(n  >  -/x) 


Similarly, 


P(C|m  =  m2)  =  1  -  Q 


and  the  probability  of  correct  decision  is 


P(C)  =  />(/»!,) 


=  1  -P(m\)Q 


>te) 

(tsti) 


+  P(m2) 


l-Q 


te)i 


-  P(m2)Q 


(j-*\ 


and 

nc)  - ( ~m) + '>(raj,c  te) 

where 

d  =  2sfE 


and 


-AT  ,  P(mi) 

:  In 


+ 


M  4\/£  P(m2) 

When  />(m, )  =  P(m)  =  0.5,  ^  =  VE  =  d/2,  and  Eq.  ( 1 1 ,97a)  reduces  to 


Pe=Q 


(11.97a) 


(11.97b) 


(11.97c) 


(1 1.97d) 


In  this  problem,  because  N  =  1  and  M  =  2,  the  receiver  in  Fig.  1 1 . 19  is  preferable 
to  that  in  Fi<J  11.18.  For  this  case  the  receiver  of  the  form  in  Fig.  1 1 . 1 9b  reduces  to  that 
shown  in  Fig  1 1.22b.  The  decision  threshold  d'  as  seen  from  Fig.  1 1.22a  is 


d'  =  VE-n  = 


U  ,  P(m2) 

— —  In - 

4  s[E  P(m\) 


J 
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I  Note  that  d'  is  the  decision  threshold.  Thus,  in  Fig.  1 1.22b,  if  the  receiver  output  r  >  d\ 
the  decision  is  mi.  Otherwise  the  decision  is  /m. 

When  P{m\)  =  P(ni2)  =  0.5,  the  decision  threshold  is  zero.  This  is  precisely  the 

result  derived  in  Sec.  1 1 . 1  for  polar  signaling. 


Figure  1 1 .23 

(a)  Conditional 
PDFs  in  PAM. 

(b)  Error  pro¬ 
bability  in  PAM. 


1 1.6.5  Multiamplitude  Signaling  (PAM) 

We  now  consider  the  M  -ary  generalization  of  the  binary  polar  signaling,  often  known  as  pulse 
amplitude  modulation  (PAM).  In  the  binary  case,  we  transmit  two  symbols,  consisting  of  the 
pulses  p(t)  and  -p(r),  where  p(t)  may  be  either  a  baseband  pulse  or  a  carrier  modulated  by  a 
baseband  pulse.  In  the  multiamplitude  (PAM)  case,  the  M  symbols  are  transmitted  by  M  pulses 
±p(t),  ±3 p(t),  ±5 p(t),  ....  ±(M  -  1  )p(t).  Thus,  to  transmit  Rm  A/ -ary  digits  per  second, 
we  are  required  to  transmit  Rm  pulses  per  second  of  the  form  kp(t).  Pulses  are  transmitted 
every  Tm  seconds,  so  that  Tm  =  1  /RM .  If  Ep  is  the  energy  of  pulse  p(t),  then  assuming  that 
pulses  ±p(t),  ±3 p(t),  ±5  p(t), ±(Af  -  1  )p(t)  are  equally  likely,  the  average  pulse  energy 
EpM  is  given  by 


Epm  =  jj[Ep  +  9 Ep  +  25 EP  +  ---  +  (M  -  1  )2EP] 

M-2 

2  Ep  9 

=  1 + 


k=  0 


A/--  1 


3  Ep 


A/2 

~  _ r 

-  3  Ep 


M  »  1 


(11.98a) 

(11.98b) 


Recall  that  an  A/  -ary  symbol  carries  an  information  of  log2  A/  bits.  Hence,  the  bit 


energy  Eb  is 


_  EpM  _  3Y2  -  1  E  (11.98c) 

log2  M  3  log2  M  ’’ 


Because  the  transmission  bandwidth  is  independent  of  the  pulse  amplitude,  the  M -ary  ban 
width  is  the  same  as  in  the  binary  case  for  the  given  rate  of  pulses,  yet  it  carries  more  information. 
This  means  that  for  a  given  information  rate,  the  PAM  bandwidth  is  less  than  that  of  the  binary 
case  by  a  factor  of  log2  M . 

To  calculate  the  error  probability,  we  observe  that  because  we  are  dealing  with  the  same 
basic  pulse  p(t),  the  optimum  A/  -ary  receiver  is  a  filter  matched  to  p(t).  When  the  input  pu  * 
is  kp(t),  the  output  at  the  sampling  instant  will  be 

r(Tvf )  =  kAp  +  na(TM ) 

Note  that  Ap  =  Ep,  the  energy  of  p(t),  and  that  a2,  the  variance  of  n„(f),  is  A/'fp/2- ThuS 'j 
optimum  receiver  for  the  multiamplitude  A/  -ary  signaling  case  is  identical  to  that  ot  the  pl 
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Figure  1 1 .23 

(a)  Conditional 
PDFs  in  PAM. 

(b)  Error  pro¬ 
bability  in  PAM. 


(a) 


Eb/K 

dB  ^ 


binary  case  (see  Fig.  1 1 .3  or  1 1 .6a).  The  sampler  has  M  possible  outputs 

±kAp  +  n0(TM)  k=  1,3,5 . M-  1 

that  we  wish  to  detect.  The  conditional  PDFs  p(r|m,)  are  Gaussian  with  mean  ±kAp  and 
variance  <r2,  as  shown  in  Fig.  11.23a.  Let  PeM  be  the  error  probability  detecting  a  symbol  and 
P(€\m)  be  the  error  probability  given  that  the  symbol  m  is  transmitted. 

To  calculate  PeM ,  we  observe  that  the  case  of  the  two  extreme  symbols  [represented  by 
±(M  -  !)»(/)]  is  "similar  to  the  binary  case  because  they  have  to  guard  against  only  one 
neighbor  As  for  the  remaining  symbols,  they  must  guard  against  neighbors  on  both  sides,  and, 
hence  PU\m)  in  this  case  is  twice  that  of  the  extreme  symbol.  From  Fig.  1 1 .23a  it  is  evident 
that  P(€\mj)  is  Q(Ap/on)  for  the  two  extreme  signals  and  is  2Q(Ap/on )  for  the  remaining 
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(AY  —  2)  symbols.  Hence, 


M 


PeM  =  '^2,P{m,)P(€\m,) 

i—  1 

1  M 

=  iEpt<w 

i—  1 

■>K£)+cG 

2(M-\)  (Ap\ 

AY  \oJ 

— ^  +  (AY  —  2)2<2 

Tn/ 

For  a  matched  filter  receiver,  (Ap/an)2  =  2 Ep/M,  and 

(VM/5) 

'  / 6  log2 AY  /  Eh\ 
_Y  AY2  -  1  \M) 

(11.99a) 


(11.99b) 


(11.99c) 
(1  l.99d) 


Bit  Error  Rate  (BER) 

It  is  somewhat  unfair  to  compare  AY -ary  signaling  on  the  basis  of  P,M  .  the  error  probability 
of  an  AY  -ary  symbol,  which  conveys  the  information  of  k  =  log2  AY  bits.  Because  not  all  bits 
are  wrong  when  an  AY -ary  symbol  is  wrong,  this  weighs  unfairly  against  larger  AY .  Fora  lair 
comparison,  we  should  compare  various  schemes  in  terms  of  their  probability  of  bit  error  i’b- 
rather  than  PeM ,  the  probability  of  symbol  error  (symbol  error  rate).  We  now  show  that  tor 
multiamplitude  signaling  Pb  *  PeM /  log2  AY . 

Because  the  type  of  errors  that  predominate  are  those  in  which  a  symbol  is  mistaken  for 
its  immediate  neighbors  (see  Fig.  11.23a),  it  would  be  logical  to  assign  neighboring  AY -ary 
symbols,  binary  code  words  that  differ  in  the  least  possible  digits.  The  Gray  code*  is  suitab  e 
for  this  purpose  because  adjacent  binary  combinations  in  this  code  differ  only  by  one  digit- 
Hence,  an  error  in  one  AY  -ary  symbol  detection  most  likely  will  cause  only  one  error  in  a 
group  of  log2AY  binary  digits  transmitted  by  the  AY -ary  symbol.  Hence,  the  bit  error  r^e 
Pb  =  PeM  /  log2  AY .  Figure  1 1 ,23b  shows  PeM  as  a  function  of  Eh/M  for  several  values  ot  A  • 
Note  that  the  relationship  Ph  =  Pe\i /  log2  AY,  valid  for  PAM,  is  not  necessarily  valid  forot  er 
schemes  to  the  specific  code  structure.  One  must  recompute  the  relationship  between  Pb  an 
Pe\i  for  each  specific  scheme. 


The  Gray  code  can  be  constructed  as  follows.  Construct  an  n-digit  natural  binary  code  (NBC)  corresponding  ^ 
decimal  numbers.  If  b\ b2 . . .  b„  is  a  code  word  in  this  code,  then  the  corresponding  Gray  code  word  g\g2  ■  ■  gn 
obtained  by  the  rule 

Si  =b\ 

Sk=h®bk_  |  k>  2 

Thus  for  n  =  3.  the  binary  code  (MM),  (MU.  010,  (111,  100.  101, 110,  111  is  transformed  into  the  Gray 

011.010,  110.  Ill,  101,  too 


code  000. 0*H- 
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Trade-off  between  Power  and  Bandwidth 

To  maintain  a  given  information  rate,  the  pulse  transmission  rate  in  the  M -ary  case  is  reduced 
by  the  factor  A  =  log2  M .  This  means  the  bandwidth  of  the  M  -ary  case  is  reduced  by  the 
same  factor  k  =  log2  M .  But  to  maintain  the  same  P,\i ,  Eqs.  ( 1 1 .99)  show  that  the  power 
transmitted  per  bit  (which  is  proportional  to  £/,)  increases  roughly  as 


M2/\og2M  =  2  2k/k 


On  the  other  hand,  if  we  maintain  a  given  bandwidth,  the  information  rate  in  the  M  -ary  case 
is  increased  by  the  factor  A-  =  log2  M .  The  transmitted  power  is  equal  to  £/,  times  the  bit  rate. 
Hence,  an  increased  data  rate  also  necessitates  increased  power  by  the  factor 


(M  2 1  log2  M  )(log2  M )  =  2a 


Thus,  the  power  increases  exponentially  with  the  increase  in  information  rate  by  a  factor  of  A. 
In  high-powered  radio  systems,  such  a  power  increase  may  not  be  tolerable.  Multiamplitude 
systems  are  attractive  when  bandwidth  is  very  costly.  Thus  we  can  see  how  to  trade  power 
for  bandwidth.  Because  the  voice  channels  of  a  telephone  network  have  a  fixed  bandwidth, 
multiamplitude  (or  multiphase,  ora  combination  of  both)  signaling  is  a  more  attractive  method 
of  increasing  the  information  rate.  This  is  how  voiceband  computer  modems  achieve  high  data 
rate. 

All  the  results  derived  here  apply  to  baseband  as  well  as  modulated  digital  systems  with 
coherent  detection.  For  noncoherent  detection,  similar  relationships  exist  between  the  binary 
and  M  -ary  systems.* 

1  1 .6.6  M-ary  QAM  Analysis 

In  M -ary  QAM.  the  transmitted  signal  is  represented  by 


(11.100) 


where 


d  3 d  ,  (VM-  1  )d 

b,  =  ±-,  ±— ,  •  •  •  ± - ~ - 


It  is  easy  to  observe  that  tne  ^ 
(p\  (/)  and  Instead  ot  deteri 

arbitrary  QAM  constellation,  we^ 
configuration  shown  in  Fig.  1 1 
channel. 


It  is  easy  to  observe  that  the  QAM  signal  space  is  two-dimensional  with  basis  functions 
Wl  (t)  and  *,(,)  Instead  of  determining  the  optimum  receiver  and  its  error  probability  tor  an 
(p]  {t)  ana  (PA  J.  ,, onnrnurh  hv  tirmlv^ina  rhp  16-nmnt  OAM 


illation,  we  illustrate  the  basic  approach  by  analyzing  the  16-point  QAM 
in  Fig.  11.24a.  We  assume  all  signals  to  be  equiprobable  in  an  AWGN 


*  For  the  noncoherent  case. 


the  baseband  pulses  must  be  of  the  same  polarity:  for  example. 


0,  p(t),  2 p(t) . (M  )pUh 
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Figure  1 1 .24 

1 6-ary  QAM. 


1 

<Pz 

•  • 

•  • 

•  • 

•  • 

•  • 

•  •  <P1— " 

•  • 

•  • 

(a) 


*4 

•v4 

tdl2t 

*1 

dl2 

•*3 

r3 

•  s2 

*2 

r\ 

(b) 


(c) 


Let  us  first  calculate  the  error  probability.  The  first  quadrant  of  the  signal  spa 
duced  in  Fig.  1 1.24b.  Because  all  the  signals  are  equiprobable,  the  decision  region 
will  be  perpendicular  bisectors  joining  various  signals,  as  shown  in  Fig.  1 1.24b. 
From  Fig.  1 1 .24b  it  follows  that 

P(C\m\)  =  ^(noise  vector  originating  at  s\  lies  within  R\) 

-*(■■>—.  .>-0 

-H©r 

-Mjwr 

For  convenience,  let  us  define 

p=i-eby 


P(C\m\)  =  p2 


:e  is  repro- 

boundaries 


(11.101) 


Hence, 
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Using  similar  arguments,  we  have 


P(C\m2)  =  P(C\m4)  =  [l  -  Q  (  y=) 

=  p(2p  -  1) 


1-2  Q 


and 

P(C\mi)  =  (2p  —  l)2 


Because  of  the  symmetry  of  the  signals  in  all  four  quadrants,  we  get  similar  probabilities 
for  the  four  signals  in  each  quadrant.  Hence,  the  probability  of  correct  decision  is 

16 

P(C)  =  P{C\mi)P(mi) 

i=l 
1  16 

=  _£?(CM,) 

1=1 

=  _L[4p2  +  4p(2/7  -  1)  +  4p(2p  -  1)  +  4(2/7  -  l)2] 


=  \[9p2  -6/7+  1] 
4 


(11.102) 


and 


PeM  =  1  -  P<C)  =  \ 


(1  ~P) 


In  practice,  PeM  -*•  0  if  SNR  is  high  and.  hence,  P(C)  -*  1.  This  means  p  ~  1  and/7+  i  ~  1  $ 
[see  Eq.  (11. 102)],  and 


~  3(1  -p)  =  3Q 


(11.103) 


To  express  this  in  terms  of  the  received  power  S„  we  determine  £,  the  average  energy 
signal  set  in  Fig.  1 1.24.  Because  Ek,  the  energy  of  **,  is  the  square  of  the  distance  of  s* 


the  origin. 


Similarly, 


656 


PERFORMANCE  ANALYSIS  OF  DIGITAL  COMMUNICATION  SYSTEMS 


Hence, 


£ 


1  T9  l2  5  2  d2  5  21 

+2  +T  + 2d  J 


and  J2  =  0.4£.  Moreover,  for  M  =  16,  each  symbol  carries  the  information  of  log2 16=4 
bits.  Hence,  the  energy  per  bit  E t,  is 


Figure  1 1 .25 

MPSK  signals. 


and 


Eb  _  E  5 d2 

Jr  ~  JJr  ~  W 


Hence,  for  large  Eb/N 


PM=3Q(m) 

,luo4, 

A  comparison  of  this  with  binary  PSK  [Eq.  1 1 .33)]  shows  that  1 6-point  QAM  requires  almost 
2.5  times  as  much  power  as  does  binary  PSK;  but  the  rate  of  transmission  is  increased  by  a 
factor  of  log2  M  =  4.  This  comparison  does  not  take  into  account  the  fact  that  /’/, ,  the  BER.  is 
somewhat  smaller  than  Pem . 

In  terms  of  receiver  implementation,  because  N  —  2  and  M  =  16,  the  receiver  in 
Fig.  1 1.19  is  preferable.  Such  a  receiver  is  shown  in  Fig.  1 1.24c.  Note  that  because  all  signals 
are  equiprobable, 


PSK  is  a  special  case  of  QAM  with  all  signal  points  lying  on  a  circle.  Hence,  the  same  ana 
lytical  approach  applies.  However,  the  analysis  may  be  more  convenient  if  a  polar  coordinate 
is  selected.  We  use  the  following  example  to  illustrate  the  two  different  approaches. 


Example  1 1 .3  MPSK 

Determine  the  error  probability  of  the  optimum  receiver  for  equiprobable  MPSK  signals,  each 
with  energy  £. 

Figure  1 1 ,25a  the  MPSK  signal  configuration  for  M  =  8.  Because  all  the  signals  are 
equiprobable,  decision  regions  are  conical,  as  shown.  The  message  m\  is  transmit 

by  a  signal  represented  by  the  vector  S|  =  ($,,  0).  If  the  projection  in  the  sign 
space  ot  the  received  signal  r  is  q  =  (q\ ,  <72),  and  the-noise  is  n  =  («i ,  rt2)>  ^en 

9  =  (•si  +  ri|,  n2)  =  (n/e  +  nj,  n2  ) 

' - - v - '  ' 

<71 


<72 
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Figure  1 1 .25 

MPSK  signals. 


Also, 

P(C\m\)  =  P(q  lies  in  R\) 

This  is  simply  the  volume  under  the  conical  region  of  the  joint  PDF  of  q,  and  qz.  Because 
n,  and  n2  are  independent  Gaussian  RVs  with  variance  M/2  qi  and  q2  are  independent 
Gaussian  variables  with  means  VE  and  0,  respectively,  and  each  with  variance  A  /2. 

Hence, 


r  *  -(<?,-^)wi  r _ 

Pqiq2(9l42>  L  VirAT  JLs/ttA 


1  e-ftM 


] 


and 


P(C|m,)  =  ^  j k  dq ,  iqi 

=  _L  (  (  f 
nM  Jn ,  \Jq2  ' 


(11.105a) 

(11.105b) 


To  integrate  over 
the  border  ot  R  i , 


/?, .  we  first  integrate  over  the  solid  vertical  strip  in  Fig.  1 1 .25b.  Along 
<72  =  ±(tan^)<?t 


■  ■ 
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Hence, 

,  foo  //•<?!  tan (ir/A/)  \  , 

P(C\m{)  =  — -7  /  (  /  e  W*  dq2)e  V  >  dq\ 

J o  tan  (jt/M )  / 

Changing  the  variable  to  x  =  y]2/J\T  q\ ,  we  get 

P(C|mi)  =  —4=  (1 1.106a) 

V27T  Jo  L  v  M  n 

Using  the  fact  that  £*,  the  energy  per  bit,  is  £/log2  M ,  we  have 

™«>  -  vfe  l> - *  («?)] 

(11.106b) 

The  integration  can  also  be  performed  in  cylindrical  coordinates  using  the  transformation 
q\  =  py/M/2  cos  0  and  <72  =  py/M/2  sin  0.  The  limits  on  p  are  (0,  00)  and  those  on  0 
are  —n/M  to  n/M .  Hence, 

1  ptt/M  roo  _ 

P{C\m\)  =  —  I  del  pe-(P1-2P^2^co,e+2E/X)i2dp  (11.107a) 

J-n/M  Jo 

—  _L  f  do  l  pe~\p2~^’p'J (2  log2  M *  (£f>/A0 cos  8+(2  log2  M  )(£*/A'’>l/2jp 
2^T  J-n/M  Jo 

(11.107b) 

Because  of  the  symmetry  of  the  signal  configuration,  P(C\mt  )  is  the  same  for  all  i.  Hence, 

P(C)  =  P(C\m\) 

and 


Figure  1 1 .21 

Error  probabili 
of  MPSK. 


PeM  =  1  —  P(C\m\) 

On  the  other  hand,  because  s, (t)  =  y/2E/TM  cos  Gu„/-bft),  where  a)0  =  2n/T\i  ,  ft  = 
2 ni/M ,  the  optimum  receiver  turns  out  to  be  just  a  phase  detector  similar  to  that  shown 
in  Fig.  1 1.24  (Prob.  1 1.6-10).  Based  on  this  observation,  an  alternative  expression  of  PeM 
can  also  be  found.  Since  p®(0)  of  the  phase  ©  of  a  sinusoid  plus  a  bandpass  Gaussian 
noise  is  found  in  Eq.  (9.86d), 

/nJM 

ps(6)d0 

■n/M 

The  PDF  p(r>(6)  in  Eq.  (9.86d)  involves  A  (the  sinusoid  amplitude)  and  ol  0he  n0'*e 
variance).  Assuming  matched  filtering  and  white  noise  [see  Eq.  (11.11a)], 

A2  _  2 Ep  _  2£*log2A/ 

al  N  A/- 


■  ' 
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Hence, 


PeM 


1  ri r/A# 

=  1  -  —  / 

2n  J-ji/m 


-(Eb\og2M/Jsf) 


_J_  ^  ^JlEb  \og2  M  Qe(Ehcos2  0  \og2M/M) 


do 


(11.108) 


Figure  1 1.26  shows  the  plot  of  PeM  as  a  function  of  Eh/M.  For  Eh/Af  »  1  (weak  noise) 
and  M  »  2,  Eq.  (1 1.108)  can  be  approximated  by7 


(11.109a) 


(11.109b) 


Figure  1 1 .26 

Error  probability 
ofMPSK. 


mgw 


i 
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1 1 .7  GENERAL  EXPRESSION  FOR  ERROR 
PROBABILITY  OF  OPTIMUM  RECEIVERS 

Thus  far  we  have  considered  rather  simple  schemes  in  which  the  decision  regions  can  be  found 
easily.  The  method  of  computing  error  probabilities  from  knowledge  of  decision  regions  has 
also  been  discussed.  When  the  number  of  signal  space  dimensions  grows,  it  becomes  harder 
to  visualize  the  decision  regions  graphically,  and  as  a  result  the  method  loses  its  power.  We 
now  develop  an  analytical  expression  for  computing  error  probability  for  a  general  Af-ary 
scheme. 

From  the  structure  of  the  optimum  receiver  in  Fig.  11.18,  we  observe  that  if  m\  is 
transmitted,  then  the  correct  decision  will  be  made  only  if 

bj  >  b2,  b3 . b m 


In  other  words. 


P(C\m\)  =  probability  (bi  >  b2,  b^,  . . . ,  b m  \m\)  (1 1-1 10) 


If  m  i  is  transmitted,  then  (Fig.  11.18) 


cTm 

b*  =  /  [si(0  +  n(/)]5*(f)t/r  +  a* 

Jo 


Let 


Pij  = 


rT\i 

Jo 


Si(t)sj(t)dt 


i,j=  1,  2,  ...,  M 


where  the  are  known  as  cross-correlations.  Thus  (if  m\  is  transmitted). 


(11.111) 


(11.112) 


cTm 

b*  =  P\k  +  /  n  (t)sk(t)  dt  +  ak 

Jo 


AT 

=  P\k  +ak  +  Y^skj"j 
7=1 


(11.113a) 

(11.113b) 


where  n,  is  the  component  of  n(/)  along  Note  that  p]k  +  ak  is  a  constant,  and  variables 
n7  0  =  Ij  2,  . . . ,  N)  are  independent  jointly  Gaussian  variables,  each  with  zero  mean  and  a 
variance  ot  M  /  2.  Thus,  variables  b*  are  a  linear  combination  of  jointly  Gaussian  variables-  It 
follows  that  the  variables  bh  b2,  . . . ,  bM  are  also  jointly  Gaussian.  The  probability  of  making 
a  correct  decision  when  m\  is  transmitted  can  be  computed  from  Eq.  ( 1 1 . 1 10).  Note  that  bi 

can  lie  anywhere  in  the  range  (-oo,  oo).  More  precisely,  if  />(/>,,  b2 . bM  \m\ )  is  the  joint 

conditional  PDF  ot  b| ,  b2,  . . . ,  b m  ,  then  Eq.  (11.110)  can  be  expressed  as 


P(C\m\) 


P(b\,  b2 . bM\m\)db\,  db2,  ...,  db\i 


Example  1 


Figure  1 1.2 

Orthogonal 

signals. 


» 


(11.11 4a) 
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where  the  limits  of  integration  of  b\  are  (-oo,  oo),  and  for  the  remaining  variables  the  limits 
are  (— oo,  b\).  Thus, 

/oo  rb  i  rb  i 

db\  l  db2  --  I  p(b\,  b2 . b\t\m\)dbM  (11.114b) 

-OO  J- 00  J— oo 

Similarly.  P(C\m2) . P(C\mM)  can  be  computed,  and 

M 

P(C)  =  Y^P(C\mj)P(mj) 
j= I 

and 

PeM  =  1  ~  P(C) 


Exomple  1  ’  4  ** ».(')•  ««> . *« -  — » As - 

example,  a  signal  set  for  M  =  3  is  shown  in  Fig.  1 1 .27. 


The  orthogonal  set  { sk(t))  is  characterized  by 

JO 

<*/,**>—  |  £  j  —  k 


(11.115) 


Pij  =  <Si,Sj>  = 


Hence, 


0 

£ 


i*j 

i  =j 


(11.116) 
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Further,  we  shall  assume  all  signals  to  be  equiprobable.  This  yields 

=  +£) 

where  £*  =  E  is  the  energy  of  each  signal.  Note  that  a *  is  the  same  for  every  sig¬ 
nal.  Because  the  constants  a *  enter  the  expression  only  for  the  sake  of  comparison 
(Fig.  11.1 9b),  when  they  are  the  same,  they  can  be  ignored  (by  setting  a *  =  0).  Also  for  an 
orthogonal  set, 

sk(t)  =  VE<pk(t )  (11.117) 

Therefore, 


skj  — 


0 


k  =J 


(11.118) 


Hence,  from  Eqs.  (11.11 3b),  (11.116),  and  ( 1 1 . 1 1 8),  we  have  (when  m  \  is  transmitted) 


b*  = 


E  +  \fEn\ 
sFe 


n* 


k  =  1 

k  =  2,  3 . M 


(11.119) 


Note  that  nj,  n2,  . . .,  n m  are  independent  Gaussian  variables,  each  with  zero  mean  and 
variance  A f /2.  Variables  b;  that  are  of  the  form  (an*  +  ft)  are  also  independent  Gaussian 
variables.  Equation  (11.119)  shows  that  the  variable  b|  has  the  mean  E  and  variance 
(VE)2(Ar/2)  =  ME/2.  Hence, 


Pb,(6|)  = 


s/nAfE 


-(bt-E)2/tfE 


p"fh)=7m 


e~bh^E 


k  =  2,  3 . M 


Because  b|,  b2,  . . .,  b\j  are  independent,  the  joint  probability  density  is  the  product 
the  individual  densities: 


of 


p{b\,  bi,  ....  bM\m\)  = 


1 


\fnAfE 


M  , 
E)2/m  j-j  / 

k=2  ■ 


K  y/rcAfE  ) 


and 


ro o  M  / 

n(/_ 


rb'  _L _e-*ki#*dbk 

-oo  'JnAfE  ) 


b'  I 


-oo 

i  r°° 


-oo  v/ ttAIE 


M- 1 


s/nAfE 
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Changing  the  variable  so  that  b\lJME/2  =  v,  and  recognizing  that  E/M  =  (log2  M) 
Eb/N,  we  obtain 


P(C\m\)  =  —j= 
V27T 


e-(y-y/2E/tf)  /2  [!  _  0(y)]M-'  dy 
e~\y~ (2Io*2 m ]  /2 1 1  _  @(j)]w-1 


dy 


(11.120b) 

(11.120c) 


Note  that  this  signal  set  is  geometrically  symmetrical;  that  is,  every  signal  has  the  same 
relationship  with  other  signals  in  the  set.  As  a  result. 


P(C\nt\)  =  P(C\nt2)  =  ■•■  =  P(C\mM) 


Hence, 


and 


P(C)  =  P(C\m\) 


PeM  =  1  -  WO 


=  ,  _  J_  [°°  e-[y->/W°* 2W)£»/V]2/2  u  _  Qiy)]M- 1  dy  (11.1 20d) 

V27T  7-00 

In  Fig  1 1  28  the  result  of  PeM  vs.  Eb/M  is  computed  and  plotted.  This  plot  shows  an 
interesting  behavior  for  the  case  of  M  =  00.  As  M  increases,  the  performance  improves 
but  at  the  expense  of  larger  bandwidth.  Hence,  this  is  a  typical  case  of  trading  bandwidth 

for  performance. 


M  *e“*"Cof  symbols  am  •ransmined  by  M  orthogonal  pulses  of 

frequencies  <e„  «o . «.  each  of  dura, ion  T„.  Thus,  rhe  M  rransmmed  pulses  am  of 

the  form 


s/lp'il)  COS  (Dkt 


cok  = 


2n(N  +  k ) 


T  KA 


The  receiver  (Fig.  1 1-29)  is  a  simple  extension 
multiplied  by  the  corresponding  references  V- 
matched  to  the  baseband  pulse  p\t)  such  that 


of  the  binary  receiver.  The  incoming  pulse  is 
cos  a>jt  (i  =  1,  2,  . . . ,  M ).  The  filter  H[)  )  is 


h(t)  =p\T\i  -  t) 


•m.  -„if  k  obtained  if  in  the  ith  bank,  instead  of  using  a  multiplier  and  H(f),  we  use 

“Sd  "pulse  p\!) cos  The  M  bank  oulpuls  sampled  a. ,  =  T„  am 

b|,  bj,  ••  •.  t • 
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Figure  1 1 .28 

Error  probability 
of  orthogonal 
signaling  and 
coherent  MFSK. 


Figure  1 1 .29 

Coherent  MFSK 
receiver. 


Because  the  M  signal  pulses  are  orthogonal,  the  analysis  from  Example  1 1 .4  is  directly 
applicable  with  error  probability 

PeM  =  \-~f°°  e-(y-^2Eh  l0^/Af)2/2(1  _  Q(y)f~l  dy  (11.12D 


The  M-ary  results  were  shown  in  Fig.  1 1 .28. 
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The  integral  appearing  on  the  right-hand  side  ot  Eq.  (1 1.121)  is  computed  and  plotted  in 
Fig.  1 1.28  (PeM  vs.  Eb/J\f).  This  plot  shows  an  interesting  behavior  for  the  case  of  M  =  oo. 
By  properly  taking  the  limit  of  Pe\i  in  Eq.  (1 1.121)  as  M  — ►  oo,  it  can  be  shown  that 


lim 

M-*  oo 


Eh/Af  <  log,,  2 
Eh/Af>  log,  2 


Because  the  signal  power  S,  =  EbRh,  where  Rh  is  the  bit  rate,  it  follows  that  for  error-free 
communication, 


5, 

M 


i'»8.2=T44 


ArRh  ~  1.44 


Hence, 

Rb<\M%  bit/s  (11.122) 

Jv 

This  shows  that  M-ary  orthogonal  signaling  can  transmit  error-free  data  at  a  rate  of  up  to 
1 .44  5(- /Af  bit/s  as  M  ->  oo  (see  Fig.  11.28). 


Bit  Error  Rate  (BER)  of  Orthogonal  Signaling 

For  PAM  and  MPSK,  we  have  shown  that,  by  applying  the  Gray  code.  Ph  -  P,m  /  log2  M  ■ 
This  result  is  not  valid  for  MFSK  because  the  errors  that  predominate  in  PAM  and  MPSK.  are 
those  in  which  a  symbol  is  mistaken  for  its  immediate  neighbor.  We  can  use  the  Gray  code  to 
assign  the  adjacent  symbols  codes  that  differ  in  just  one  digit.  In  MFSK.  on  the  other  hand,  a 
symbol  is  equally  likely  to  be  mistaken  for  any  of  the  remaining  M  -  I  symbols.  Hence.  /  (e), 
the  probability  of  mistaking  one  particular  M  -ary  symbol  for  another,  is  equally  likely, 

P  eM  PeM 

P(f)  =  M  -  1  ~  2*  -  1 

If  an  M-ary  symbol  differs  by  1  bit  from  N\  number  of  symbols,  and  differs  by  2  bits  from  N2 
number  of  symbols,  and  so  on,  then  N(,  the  average  number  of  bits  in  error  in  reception  of  an 

M-ary  symbol,  is 


N(  =  J^nNnP(e) 


n=  1 
k 


n=\ 


=  I^Tn(k) 

2k  -  1  |  V»/ 

n—  1 


=  k  2‘ 


_t  _PeM_ 
2k  -  1 


This  is  an  average  number  of  bits  in  error  ,n  a  sequence  of  k  bits  (one  M- ary  symbol). 
Consequently,  the  BER.  fV  is  this  figure  divided  by  k. 


■>k-\ 


P  eM 
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From  this  discussion,  one  very  interesting  fact  emerges:  whenever  the  optimum  receiver 
is  used,  the  error  probability  does  not  depend  on  specific  signal  waveforms;  it  depends  only 
on  their  geometrical  configuration  in  the  signal  space. 


Bandwidth  and  Power  Trade-offs  of  M -ary  Orthogonal  Signals 

As  illustrated  by  Landau  and  Poliak,3  the  dimensionality  of  a  signal  is  2 BTm  +  1,  where  Tm  is 
the  signal  duration  and  B  is  its  essential  bandwidth.  It  follows  that  for  an  rV -dimensional  signal 
space  (N  <  M),  the  bandwidth  is  B  =  (N  —  \)/2Tm.  Thus,  reducing  the  dimensionality  N 
reduces  the  bandwidth. 

We  can  verify  that  N -dimensional  signals  can  be  transmitted  over  (N  —  \)/2Tm  Hz  by 
constructing  a  specific  signal  set.  Let  us  choose  the  following  orthonormal  signals: 


<Po(t)  = 


1 

•JTm 


<P\(t)  = 

<P2(t)  = 

<Pl(t)  = 

(p4(t)  = 


2  7T 


COn  = 


Tm 

0  <t  <Tm 


(11.123) 


G""' 

These  k  +  1  orthogonal  pulses  have  a  total  bandwidth  of  (k/2)(co0/2n)  =k/2TM  Hz.  Hence, 
when  k  +  1  =  N,  the  bandwidth*  is  (N  -  l)/2TM.  Thus,  N  =  2 TMB  +  1. 

To  attain  a  given  error  probability,  there  is  a  trade-off  between  the  average  energy  of  the 
signal  set  and  its  bandwidth.  It  we  reduce  the  signal  space  dimensionality,  the  transmission 
bandwidth  is  reduced.  But  the  distances  among  signals  are  now  smaller,  because  of  the  reduced 
dimensionality.  This  will  increase  P e\\.  Hence,  to  maintain  a  given  low  Pcm  ,  we  ntus*  n0'v 
move  the  signals  farther  apart:  that  is,  we  must  increase  energy.  Thus,  the  cost  of  reduced 
bandwidth  is  paid  in  terms  of  increased  energy.  The  trade-off  between  SNR  and  bandwidth 
can  also  be  described  from  the  perspective  of  information  theory  (Sec.  14.6). 

M  -ary  signaling  provides  us  with  additional  means  of  exchanging,  or  trading,  the  transrnis 
sion  rate,  transmission  bandwidth,  and  transmitted  power.  It  provides  us  flexibility  in  designinr 
a  proper  communication  system.  Thus,  for  a  given  rate  of  transmission,  we  can  trade  the  trans 
mission  bandwidth  for  transmitted  power.  We  can  also  increase  the  information  rate  by  a 


*  ^ere.  1®no™8  the  band  spreading  at  the  edge.  This  spread  is  about  I /TM  Hz.  The  actual  bandwidth  exceeds 

(N  -  I  )/2 Tm  by  this  amount. 
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factor  of  k  (k  =  log?  M)  by  paying  a  suitable  price  in  terms  of  the  transmission  bandwidth 
or  the  transmitted  power.  Figure  11.28  showed  that  in  multitone  signaling  the  transmitted 
power  decreases  with  M .  However,  the  transmission  bandwidth  increases  linearly  with  M,  or 
exponentially  with  the  rate  increase  factor  k(M  =  2k).  Thus,  multitone  signaling  is  radically 
different  from  multiamplitude  or  multiphase  signaling.  In  the  latter,  the  bandwidth  is  indepen¬ 
dent  of  M ,  but  the  transmitted  power  increases  as  M1  /  log2  M  —2 ~k/k;  that  is,  the  power 
increases  exponentially  with  the  information  rate  increase  factor  k .  Thus,  in  the  multitone  case, 
the  bandwidth  increases  exponentially  with  k,  and  in  the  multiamplitude  or  multiphase  case, 
the  power  increases  exponentially  with  k . 

The  practical  implication  is  that  we  should  use  multiamplitude  or  multiphase  signaling  if 
the  bandwidth  is  at  a  premium  (as  in  telephone  lines)  and  multitone  signaling  when  power  is 
at  a  premium  (as  in  space  communication).  A  compromise  exists  between  these  two  extremes. 
Let  us  investigate  the  possibility  of  increasing  the  information  rate  by  a  factor  k  simply  by 
increasing  the  number  of  binary  pulses  transmitted  by  a  factor  k.  In  this  case,  the  transmitted 
power  increases  linearly  with  k.  Also  because  the  bandwidth  is  proportional  to  the  pulse  rate, 
the  transmission  bandwidth  increases  linearly  with  k.  Thus,  in  this  case,  we  can  increase 
the  information  rate  by  a  factor  of  k  by  increasing  both  the  transmission  bandwidth  and  the 
transmitted  power  linearly  with  k ,  thus  avoiding  the  phantom  of  the  exponential  increase  that 
was  required  in  the  M  -ary  system.  But  here  we  must  increase  both  the  bandwidth  and  the 
power  whereas  formerly  the  increase  in  information  rate  can  be  achieved  by  increasing  either 
the  bandwidth  or  the  power.  We  have  thus  a  great  flexibility  in  trading  various  parameters  and 
thus  in  our  ability  to  match  our  resources  to  our  requirements. 


Example  1  1  .5  We  are  required  to  transmit  2.08  x  106  binary  digits  per  second  with  Ph  <  I0"6.  Three  possible 


schemes  are  considered: 


(a)  Binary 

(b)  16-ary  ASK 

(c)  16-ary  FSK 

The  channel  noise  PSD  is  Sn(a>)  =  KT8.  Determine  the  transmission  bandwidth  and  the  signal 
power  required  at  the  receiver  input  in  each  case. 

(a)  Binary:  We  shall  consider  polar  signaling  (the  most  efficient  scheme). 


This  yields  Eb/tf  =  1 1  -35.  The  signal  power  S,  =  EhRh.  Hence, 

5,  =  11.35 NRb  =  11-35(2  x  1(T8)(2.08  x  106)  =  0.47  W 
Assuming  raised-cosine  baseband  pulses  of  roll  off  factor  1 ,  the  bandwidth  BT  is 


BT  =  Rb  =  2.08  MHz 


(b)  16-ary 
of  log2  16  =  4  bit 
16-ary  pulses  per 
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and  1.04  MHz  for  modulated  pulses  (assuming  raised-cosine  pulses).  Also, 


Therefore, 


6 Eh  log;  >6 
N(M2  -  1) 


For  M  =  16,  this  yields  Eb  =  0.499  x  10  5.  If  the  M -ary  pulse  rate  is  Rm  ,  then 


Si  =  Epm  Rm  =  Eb  log2  M  ■  Rm 

=  0.499  x  10-5  x  4  x  (0.52  x  106)  =  9.34  W 


(c)  16-ary  PSK:  We  need  transmit  only  Rm  =  0.52  x  106  pulses  per  second.  For 
baseband  pulses,  this  will  require  a  bandwidth  of  520  kHz.  But  PSK  is  a  modulated  signal, 
and  the  required  bandwidth  is  2(0.52  x  106)  =  1 .04  MHz.  Also, 


This  yields  =  137.8  x  10~8  and 


Si  =  Eh  log2  16 Rm 

=  (137.8  x  1(T8)  x  4  x  (0.52  x  I06)  =  2.86  W 


1 1 .8  EQUIVALENT  SIGNAL  SETS 

The  computation  of  error  probabilities  is  greatly  facilitated  by  the  translation  and  rotation  of 


coordinate  axes.  We  now  show  that  such  operations  are  permissible. 

Consider  a  signal  set  with  its  corresponding  decision  regions,  as  shown  in  Fig.  1  F30a- 
The  conditional  probability  P(C\m\ )  is  the  probability  that  the  noise  vector  drawn  from  s  1  hes 
within  R\.  Note  that  this  probability  does  not  depend  on  the  origin  of  the  coordinate  system. 
We  may  translate  the  coordinate  system  any  way  we  wish.  This  is  equivalent  to  translating  the 
signal  set  and  the  corresponding  decision  regions.  Thus,  the  P(C\mj)  for  the  translated  system 
shown  in  Fig.  1 1.30b  is  identical  to  that  of  the  system  in  Fig.  1 1.30a. 

In  the  case  of  Gaussian  noise,  we  make  another  important  observation.  The  rotation  of  the 
coordinate  system  does  not  affect  the  error  probability  because  the  noise- vector  probability 
density  has  spherical  symmetry.  To  show  this,  we  shall  consider  Fig.  1 1 .30c,  which  shows  the 
signal  set  in  Fig.  11.30a  translated  and  rotated.  Note  that  a  rotation  of  the  coordinate  systcn 
is  equivalent  to  a  rotation  of  the  signal  set  in  the  opposite  sense.  Here  for  convenience 
rotate  the  signal  set  instead  of  the  coordinate  system.  It  can  be  seen  that  the  probability  that  the 


noise  vector  n  drawn  from  s\  lies  \n  R\  is  the  same  in  Fig.  1 1.30a  and  c,  since  this  probability 
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(b) 


<P  i— 


is  given  by  .he  integral  of  the  noise  probability  density  />„<» I  over  the  region  R,  Because 
„  ?„  ltM  spherical  symmetry  for  Gaussian  noise,  the  probabtl.ty  will  remit,  n  unaffected  by  a 
"  .  f/  •  n  o  rie  irlv  for  additive  Gaussian  channel  noise,  translation  and  rotation 

and  rotation  of  the  signal  set,  do  no,  affect  the  error 
probability  Note  that  when  we  rotate  or  translate  a  set  of  signals,  the  resulting  set  represents 
al  entirely  different  se,  of  signals.  Ye,  the  error  probabilities  of  the  two  sets  are  tdenucal.  Such 

S"S  SltTg'elmJ“Smonsua,es  the  utility  of  translatton  and  rotation  of  a  signal  se, 
in  the  computation  of  error  probability. 


Example  1  1 .6  A  quaternary  PSK  (QPSK)  signal  set  is  shown  in  Fig.  1 1.31a: 

S|  =  — *2  =  <P\ 

S3  =  -s4  = 


■■ 
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Assuming  all  symbols  to  be  equiprobable,  determine  PeM  for  an  AWGN  channel  with  noise 
PSD  jV/2. 


I 


Figure  1 1 .31 

Analysis  of 
QPSK. 


(a) 


(b) 


This  problem  has  already  been  solved  in  Example  1 1 .4  for  a  general  value  of  M .  Here  we 
shall  solve  it  for  M  =  4  to  demonstrate  the  power  of  the  rotation  of  axes. 

Because  all  the  symbols  are  equiprobable,  the  decision  region  boundaries  will  be 
perpendicular  bisectors  of  lines  joining  various  signal  points  (Fig.  1 1.31a).  Now 

P(C\m\)  =  P(noise  vector  originating  at  s i  remains  in  R\)  (1 1.124) 


This  can  be  found  by  integrating  the  joint  PDF  of  components  nj  and  n2  (originating  at 
si)  over  the  region  .  This  double  integral  can  be  found  by  using  suitable  limits,  as  in 
Eq.  (1 1.106).  The  problem  is  greatly  simplified,  however,  if  we  rotate  the  signal  set  by 
45°,  as  shown  in  Fig.  11.31b.  The  decision  regions  are  rectangular,  and  if  ni  and  n2  are 
noise  components  along  <p\  and  (p 2,  then  Eq.  (1 1.124)  can  be  expressed  as 


P(C\m\)  = 


=  /■(".  >-/f.n2>-y|) 

=  '>("!»  -/§)  P  (n,  >  -/f) 

■[-(/ 5)1 


II 

['-«(£)] 

= 

1 - 1 

1 

(O 
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2 

(11.125a) 

(11.125b) 


1  1.8.1  Minimum  Energy  Signal  Set 


As  noted  earlier,  an  infinite  number  of  possible  equivalent  signal  sets  exist.  Because  signal 
energy  depends  on  its  distance  from  the  origin,  however,  equivalent  sets  do  not  necessarily 
have^the  same  average  energy.  Thus,  among  the  infinite  possible  equivalent  signal  sets,  the 
one  in  which  the  signals  are  closest  to  the  origin,  has  the  minimum  average  signal  energy  (or 
transmitted  power). 

Let  mi,  m2,  be  M  messages  with  waveforms  si(r),  *2(0 . sM(t),  repre¬ 
sented,  respectively,  by  points  *i,  «2.  in  the  signal  space.  The  mean  energy  of  these 

signals  is  E ,  given  by 

M 

£=£>(/*,)  IlSill2 
1=1 

Translation  of  this  signal  set  is  equivalent  to  subtracting  some  vector  a  from  each  signal.  We 
now  use  this  simple  operation  to  yield  a  minimum  mean  energy  set.  We  basically  wish  to  find 
the  vector  o  such  that  the  new  mean  energy 


M 


(11.126) 


E'  =  ^>(m,)||Si  -a| 
i=i 

is  minimum  We  can  show  that  a  must  be  the  center  of  gravity  of  M  points  located  at 
S2 . sM  with  masses  />(m,),  P(m2) . P(mM),  respectively, 

M 


a  =  2>(m,)s,=S?  (H.127) 

(=1 

To  prove  this,  suppose  the  mean  energy  is  minimum  for  some  translation  b.  Then 
M 

E'  =  ^2P(nti)\\Si  -b\\- 

i=t 

M 

=  £>(m/)ll(*/-fl)  +  (a-b)\\- 

'=l  m  m 

=  YP(mi)\\Si-a\\2  +  2<(a-b),J^P(mi)(Si-a)>  +  J^P(mi)\\a-b\\- 

(=1  i=l 


/=! 


■  ' 
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Observe  that  the  second  term  in  the  foregoing  expression  vanishes  according  to  Eq.  (1 1.127) 
because 

M  MM 

Y  P(mi)(Si  -  a)  =  Y  ~aYl  P(m) 

/=1  1=1  i=  1 

=  a  -  a  •  1  =  0 

Hence, 


M  M 

E'  =  Y  p(m>)  11*1  -  all2  +  Y  P{nti)  II"  -  ^ll2 

1=1  1=1 

This  is  minimum  when  b  —  a.  Note  that  the  rotation  of  the  coordinates  does  not  change  the 
energy,  and,  hence,  there  is  no  need  to  rotate  the  signal  set  to  minimize  the  energy  after  the 
translation. 


Example  1  1  .7  For  the  binary  orthogonal  signal  set  of  Fig.  1 1.32a,  determine  the  minimum  energy  equivalent 
signal  set. 


Figure  1 1 .32 

Equivalent  signal 
sets. 


The  minimum  energy  set  for  this  case  is  shown  in  Fig.  1 1 ,32b.  The  origin  lies  at  the  center 
of  gravity  of  the  signals.  We  have  also  rotated  the  signals  for  convenience.  The  distances 
&i  and  ki  must  be  such  that 


k\  +  k2  =  d 
and 


k\P(m\)  =  k2p(m2 ) 
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Solution  of  these  two  equations  yields 

*1  =  P(m2)d 


and 

*2  =  P(m\)d 

Both  signal  sets  (Fig.  1 1 .32aand  b)  have  the  same  error  probability,  but  the  latter  has 
a  smaller  mean  energy.  If  £  and  £'  are  the  respective  mean  energies  of  the  two  sets,  then 

_  d2  dr  d2 

E  =  P(mi)— +  P(m2)~  =  y 


and 

T'  =  P(m\)k\  +  P(m2)kl 

=  P(m\)P2(mi)d 2  +  P(m2)P2(m\)d2 
=  P(mi)P(m2)d2 

Note  that  for  P(m\)  +  P(m2)  =  1.  the  product  P(m\)P(m2)  is  maximum  when  P(m\ )  = 
P(m2)  =  1/2,  in  which  case 

1 

P(m\)P(m2 )  =  - 


and  consequently 


Therefore, 


and  for  the  case  of  equiprobable  signals. 


In  this  case. 


d 

k\  =*2=2 

d2  —  d2 

E  =  —  and  £'  =  — 

2  4 


The  signals  in  Fig.  1 1 .32b  are  called  antipodal  signals  when  *,  = 
of  the  Signal  set  in  Fig.  1 1  32a  (and  1 1 .32b)  is  eqnal  to  that  in  Fig. 

from  Eq.  (1 1.97a). 


&2 •  The  error  probability 
1 1 .22a  and  can  be  found 
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As  a  concrete  example,  let  us  choose  the  basis  signals  as  sinusoids  of  frequency 
co()  =  2tt/Tm : 


0  <  t  <  Tm 


Hence, 


S|<,)  =  ^,<,)=vfc 


sin  co0t 


sin  2(o, ,t 


0<t<TM 


The  signals  si(f)  and  S2(t)  are  shown  in  Fig.  1 1.32c,  and  the  geometrical  representation 
is  shown  in  Fig.  1 1 .32a.  Both  signals  are  located  at  a  distance  d /V2  from  the  origin,  and 
the  distance  between  the  signals  is  d. 

The  minimum  energy  signals  s',  (t)  and  s'2(t)  for  this  set  are  given  by 


•s'i(')  =  J —P(m2)d  sin  <o0t 


0  <  t  <  Tm 


4W  =  -,/  Y^P('m^d  sin  Wot 


These  signals  are  sketched  in  Fig.  1 1.32d. 


1  1 .8.2  Simplex  Signal  Set 

A  minimum  energy  equivalent  set  of  an  equiprobable  orthogonal  set  is  called  a  simple 
or  transorthogonal,  signal  set.  A  simplex  set  can  be  derived  as  an  equivalent  set  from  the 
orthogonal  set  in  Eq.  ( 1 1 . 1 1 5). 

To  obtain  the  minimum  energy  set,  the  origin  should  be  shifted  to  the  center  of  gravity  of  the 
signal  set.  For  the  two-dimensional  case  (Fig.  1 1 .33a),  the  simplex  set  is  shown  in  Fig*  1  ^  c 
and  for  the  three-dimensional  case  (Fig.  1 1 .33b),  the  simplex  set  is  shown  in  Fig.  1 1 
that  the  dimensionality  of  the  simplex  signal  set  is  less  than  that  of  the  orthogonal  set  b>  • 
This  is  true  in  general  tor  any  value  of  M .  It  can  be  shown  that  the  simplex  signal  set  is 
optimum  (minimum  error  probability)  for  the  case  of  equiprobable  signals  embedded  in  whi 
Gaussian  noise  when  energy  is  constrained.4, 8 
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Figure  1 1 .33 

Simplex  signals. 


We  can  calculate  the  mean  energy  of  the  simplex  set  by  noting  that  it  is  obtained  by 
translating  the  orthogonal  set  by  a  vector  a.  gi  ven  in  Eq.  ( 1 1 . 1 27), 


M 


a  =  -ySi 

M  ^ 


1=1 


For  orthogonal  signals. 


Si  =  JE<Pi 


Therefore, 


-*r2> 


1=1 


where  E  is  the  energy  of  each  signal  in  the  orthogonal  set  and  Vi  is  the  unit  vector  along  the 
,th  coordinate  axis.  The  signals  in  the  simplex  set  are  g.ven  by 


s'k=Sk~a 

=  V£.,-^£>,  <"l28> 

i=  1 


The  energy  E’  of  signal  s'k  is  given  by  \s’k\2. 


E’  =  <s'k,s’k> 


(11.129) 
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Substituting  Eq.  (1 1.128)  into  Eq.  (1 1.129)  and  observing  that  the  set  <p,  is  orthonormal,  we 
have 


(11.130) 


Hence,  for  the  same  performance  (error  probability  ),  the  mean  energy  of  the  simplex  signal  set 
is  1  —  1  /M  times  that  of  the  orthogonal  signal  set.  For  M  1 ,  the  difference  is  not  significant. 
For  this  reason  and  because  of  the  simplicity  in  generating,  orthogonal  signals,  rather  than 
simplex  signals  are  used  in  practice  whenever  M  exceeds  4  or  5. 

In  Sec.  14.6,  we  shall  show  that  in  the  limit  as  M  — >  oo,  the  orthogonal  (as  well  as  the 
simplex)  signals  attain  the  upper  bound  of  performance  predicted  by  Shannon’s  theorem. 


1 1 .9  NONWHITE  (COLORED)  CHANNEL  NOISE 

Thus  far  we  have  restricted  our  analysis  exclusively  to  white  Gaussian  channel  noise.  Our 
analysis  can  be  extended  to  nonwhite,  or  colored,  Gaussian  channel  noise.  To  proceed, 
the  Karhunen-Loeve  expansion  of  Eq.  (11.57)  must  be  solved  for  the  colored  noise  with 
autocorrelation  function  Rx(t,  t\).  This  general  solution,  however,  can  be  quite  complex  to 
implement.4 

Fortunately,  for  a  large  class  of  colored  Gaussian  noises,  the  power  spectral  density  5nM 
nonzero  within  the  message  signal  bandwidth  B.  This  property  provides  an  effective  alternative. 
We  use  a  noise-whitening  filter  H(f)  at  the  input  of  the  receiver,  where 

H(f)  =  -  1  e-P*fid 

The  delay  t d  is  introduced  to  ensure  that  the  whitening  filter  is  causal  (realizable). 

Consider  a  signal  set  { j,  (r)|  and  achannel  noise  n(/)  that  is  not  white  [Sn  (f)  is  not  constant]. 
At  the  input  of  the  receiver,  we  use  a  noise-whitening  filter  H(f)  that  transforms  the  colored 
noise  into  white  noise  (Fig.  1 1.34).  But  it  also  alters  the  signal  set  {s,(r)J  to  {^(r)},  where 

s'i(t)  =  si(t)  *h(t) 

We  now  have  a  new  signal  set  {^(r)}  mixed  with  white  Gaussian  noise,  for  which  the  optimum 
receiver  and  the  corresponding  error  probability  can  be  determined  by  the  method  discussed 
earlier. 


Figure  1 1 .34 

Optimum  M-ary 
receiver  for 
non  white 
channel  noise. 
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11.10  OTHER  USEFUL  PERFORMANCE  CRITERIA 

The  optimum  receiver  uses  the  decision  strategy  that  makes  the  best  possible  use  of  the  observed 
data  and  any  a  priori  information  available.  The  strategy  will  also  depend  on  the  weights 
assigned  to  various  types  of  error.  In  this  chapter  we  have  thus  far  assumed  that  all  errors  have 
equal  weight  (or  equal  cost).  This  assumption  is  not  justified  in  all  cases,  and  we  may  therefore 
have  to  alter  the  decision  rule. 

Generalized  Bayes  Receiver 

If  we  are  given  a  priori  probabilities  and  the  cost  functions  of  errors  of  various  types,  the 
receiver  that  minimizes  the  average  cost  of  decision  is  called  the  Bayes  receiver,  and  the 
decision  rule  is  Bayes’  decision  rule.  Note  that  the  receiver  that  has  been  discussed  so  far 
is  the  Bayes  receiver  under  the  condition  that  all  errors  have  equal  cost  (equal  weight).  To 
generalize  this  rule,  let 

Ckj  =  cost  of  deciding  that  m  =  mk  when  m,  was  transmitted  (11.131) 


and,  as  usual, 

P(rtn\q)  =  conditional  probability  that  wt,  was  transmitted  when  q  is  received 

If  q  is  received,  then  the  probability  that  ntj  was  transmitted  is  P(mj \q )  for  ally  =  1.  2 . M. 

Hence,  the  average  cost  of  the  decision  m  =  mk  is  given  by 

Pk  =  CklP(mi\q)  +  Ck2P(m\<l)  H - b 

M 

=  CkjP(mj\q)  (11.132) 

j= l 

Thus,  if  q  is  received,  the  optimum  receiver  decides  that  in  =  mk  if 

Pk  <  Pi  f°r  all  /  ^  k 


or 


M 


M 


y  CkjP(ntj\q)  <  ^2  ctip(mjW  for  all  /  /  A-  (11.1 33) 

;=  i  j=l 


(11.134) 


Use  of  Bayes’  mixed  rule  in  Eq.  (1 1.133)  yields 
M  M 

Y  CkjP(inj)Pq(q\mj)  <  CijP(mj)pq(q\mj)  for  all  i  *  k 

u 

Note  that  Ckk  is  the  cost  of  setting  m  =  mk  when  mk  is  transmitted.  This  cost  is  generally  zero. 
If  we  assign  equal  weight  to  all  other  errors,  then 


1°  k  =j 

Ck>  '  1 1  k^j 


(11.135) 
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and  the  decision  rule  in  Eq.  (1 1.134)  reduces  to  the  rule  in  Eq.  (11.83),  as  expected.  The 
generalized  Bayes  receiver  for  M  =  2,  assuming  C\\  =  C22  =  0.  sets  m  =  m  1  if 


CnP(m2)pq(g\m2)  <  C2\P(,m\)Pq(q\m\) 

Otherwise,  the  receiver  decides  that  m  —  m2. 

Maximum  Likelihood  Receiver 

The  strategy  used  in  the  Bayes  receiver  discussed  in  the  preceding  subsection  is  general,  except 
that  it  can  be  implemented  only  when  the  a  priori  probabilities  P(m\ ),  P(m2),  . . . ,  P(^M )  are 
known.  Frequently  this  information  is  not  available.  Under  these  conditions  various  possibili¬ 
ties  exist,  depending  on  the  assumptions  made.  When,  for  example,  there  is  no  reason  to  expect 
any  one  signal  to  be  more  likely  than  any  other,  we  may  assign  equal  probabilities  to  all  the 
messages: 

1 

P(m\ )  =  P(m2)  =  •  •  •  =  P(mM )  =  — 

M 


Bayes’  rule  [Eq.  (1 1 .83)]  in  this  case  becomes:  set  m  =  ra*  if 

Pq(q\mk)  >  PqtoK)  for  all  i^k  (1 ll36) 

Observe  that  pq(q |m*)  represents  the  probability  of  observing  q  when  m *  is  transmitted.  Thus, 
the  receiver  chooses  that  signal  which,  when  transmitted,  will  maximize  the  likelihood  (proba¬ 
bility)  of  observing  the  received  q.  Hence,  this  receiver  is  called  the  maximum  likelihood 
receiver.  Note  that  the  maximum  likelihood  receiver  is  a  Bayes  receiver  for  the  cost  of 
Eq.  (11.135)  under  the  condition  that  the  a  priori  message  probabilities  are  equal.  In  terms 
of  geometrical  concepts,  the  maximum  likelihood  receiver  decides  in  favor  of  that  signal 
which  is  closest  to  the  received  data  q.  The  practical  implementation  of  the  maximum  like¬ 
lihood  receiver  is  the  same  as  that  of  the  Bayes  receiver  (Figs.  11.18  and  11.19)  under  the 
condition  that  all  a  priori  probabilities  are  equal  to  1 JM . 

If  the  signal  set  is  geometrically  symmetrical,  and  if  all  a  priori  probabilities  are  equal 
(maximum  likelihood  receiver),  then  the  decision  regions  for  various  signals  are  congruent. 
In  this  case,  because  of  symmetry,  the  conditional  probability  of  a  correct  decision  is  the  same 
no  matter  which  signal  is  transmitted,  that  is, 

P(C\mi)  =  constant  for  all  i 

Because 

M 

/><0  =  5>(m,)l»(C  K) 

1=1 

in  this  case 

P(C)  =  P(C\ntj)  (1,J37) 

Thus,  the  error  probability  of  the  maximum  likelihood  receiver  is  independent  of  the  at'*u  , 
source  statistics  P(m/)  for  the  case  of  symmetrical  signal  sets.  It  should,  however,  be  realize 


Figure  1 1 .35 
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that  if  the  actual  source  statistics  were  known  beforehand,  one  could  use  Bayes'  decision  rule 
to  design  a  better  receiver. 

It  is  apparent  that  if  the  source  statistics  are  not  known,  the  maximum  likelihood  receiver 
proves  very  attractive  for  a  symmetrical  signal  set.  In  such  a  receiver  one  can  specify  the  error 
probability  independently  of  the  actual  source  statistics. 

Minimax  Receiver 

Designing  a  receiver  with  a  certain  decision  rule  completely  specifies  the  conditional 
probabilities  P(C\m,).  The  probability  of  error  is  given  by 

PeM  =  1  -  P{C) 

M 

=  1  -  y:  P(mi)P(C\mi) 
i=  1 

Thus,  in  general,  for  a  given  receiver  (with  some  specified  decision  rule)  the  error  probability 
depends  on  the  source  statistics  P(m,).  The  error  probability  is  the  largest  for  some  source 
statistics.  The  error  probability  in  the  worst  possible  case  is  [  PeM  ]max  and  represents  the  upper 
bound  on  the  error  probability  of  the  given  receiver.  This  upper  bound  [PeM  Imax  serves  as  an 
indication  of  the  quality  of  the  receiver.  Each  receiver  (with  a  certain  decision  rule)  will  have  a 
certain  [ PeM  ]max-  The  receiver  that  has  the  smallest  upper  bound  on  the  error  probability,  that 
is,  the  minimum  [/\>W]max,  is  called  the  minimax  receiver. 

We  shall  illustrate  the  minimax  concept  for  a  binary  receiver  with  on-off  signaling.  The 
conditional  PDFs  of  the  receiving-filter  output  sample  r  at  /  =  Th  are  p(r  1 1 )  and  p(r  |0).  These 
are  the  PDFs  of  r  for  the  “on”  and  the  “off”  pulse  (i.e.,  no  pulse),  respectively.  Figure  1 1 .35a 
shows  these  PDFs  with  a  certain  threshold  a.  If  we  receive  r  >  a,  we  choose  the  hypothesis 
“signal  present”  (1).  and  the  shaded  area  to  the  right  of  a  is  the  probability  of  false  alarm 
(deciding  “signal  present”  when  in  fact  the  signal  is  not  present).  If  r  <  a,  we  choose  the 
hypothesis  “signal  absent”  (0),  and  the  shaded  area  to  the  left  of  a  is  the  probability  of  false 
dismissal  (deciding  “signal  absent”  when  in  fact  the  signal  is  present).  It  is  obvious  that  the 
larger  the  threshold  a,  the  larger  the  false  dismissal  error  and  the  smaller  the  false  alarm  error 

(Fig.  11.35b).  .  . 

We  shall  now  find  the  minimax  condition  for  this  receiver.  For  the  mirnmax  receiver,  we 

consider  all  possible  receivers  (all  possible  values  of  a  in  this  case)  and  find  the  maximum 
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error  probability  (or  cost)  that  occurs  under  the  worst  possible  a  priori  probability  distribution. 
Let  us  choose  a  =  au  as  shown  in  Fig.  1 1.35b.  In  this  case  the  worst  possible  case  occurs 
when  P(0)  =  1  and  P(  1)  =  0,  that  is,  when  the  signal  s(t)  is  always  absent.  The  type  of 
error  in  this  case  is  false  alarm.  These  errors  have  a  cost  C\ .  On  the  other  hand,  it  we  choose 
a  =  ci2,  the  worst  possible  case  occurs  when  />(())  =  0  and  P(l)  =  1,  that  is,  when  the  signal 
is  always  present,  causing  only  the  false-dismissal  type  of  errors.  These  errors  have  a  cost  Ci. 
It  is  evident  that  for  the  setting  a  =  a,  the  costs  of  false  alarm  and  false  dismissal  are  equal, 
namely,  Ca.  Hence,  for  all  possible  source  statistics  the  cost  is  Ca.  Because  Ca  <  C\  and  C2, 
this  cost  is  the  minimum  of  the  maximum  possible  cost  (because  the  worst  cases  are  considered) 
that  accrues  for  all  values  of  a.  Hence,  a  =  a  represents  the  minimax  setting. 

It  follows  from  this  discussion  that  the  minimax  receiver  is  rather  conservative.  It  is 
designed  under  the  pessimistic  assumption  that  the  worst  possible  source  statistics  exist.  The 
maximum  likelihood  receiver,  on  the  other  hand,  is  designed  on  the  assumption  that  all  mes¬ 
sages  are  equally  likely.  It  can,  however,  be  shown  that  for  a  symmetrical  signal  set,  the 
maximum  likelihood  receiver  is  in  fact  the  minimax  receiver.  This  can  be  proved  by  observing 
that  for  a  symmetrical  set,  the  probability  of  error  of  a  maximum  likelihood  receiver  (equal  a 
priori  probabilities)  is  independent  of  the  source  statistics  [Eq.  (1 1.137)].  Hence,  for  a  sym¬ 
metrical  set,  the  error  probability  PeM  =  ot  of  a  maximum  likelihood  receiver  is  also  equal 
to  its  [PeM  ]max-  We  now  show  that  no  other  receiver  exists  whose  [PeM  ]max  is  less  than  the 
a  of  a  maximum  likelihood  receiver  for  a  symmetrical  signal  set.  This  is  seen  from  the  tact 
that  for  equiprobable  messages,  the  maximum  likelihood  receiver  is  optimum  by  definition. 
All  other  receivers  must  have  PeM  >  o'  for  equiprobable  messages.  Hence,  \PeM  Imax  for  these 
receivers  can  never  be  less  than  o'.  This  proves  that  the  maximum  likelihood  receiver  is  indeed 
the  minimax  receiver  for  a  symmetrical  signal  set. 


11.11  NONCOHERENT  DETECTION 

If  the  phase  0  in  the  received  RF  pulse  \fl p'(t)  cos  (coct  +  6)  is  unknown,  we  can  no  longer 
use  coherent  detection  techniques.  Instead,  we  must  rely  on  noncoherent  techniques,  such  as 
envelope  detection.  It  can  be  shown9, 10  that  when  the  phase  6  of  the  received  pulse  is  random 
and  uniformly  distributed  over  (0,  2tt).  the  optimum  detector  is  a  filter  matched  to  the 
pulse  y/l p'(t)  cos  coct  followed  by  an  envelope  detector,  a  sampler  (to  sample  at  t  =  Tbh an 
a  comparator  to  make  the  decision  (Fig.  1 1 .36). 


Amplitude  Shift  Keying 

The  noncoherent  detector  for  ASK  is  shown  in  Fig.  1 1 .36.  The  filter  H  (f)  is  a  filter  matched  to 
the  RF  pulse,  ignoring  the  phase.  This  means  the  filter  output  amplitude  Ap  will  not  necessan  > 
be  maximum  at  the  sampling  instant.  But  the  envelope  will  be  close  to  maximum  at  the  samp in- 
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instant  (Fig.  1 1.36).  The  matched  filter  output  is  now  detected  by  an  envelope  detector.  The 
envelope  is  sampled  at  t  =  T\,  for  making  the  decision. 

When  a  1  is  transmitted,  the  output  of  the  envelope  detector  at  /  =  7),  is  an  envelope  of  a 
sine  wave  of  amplitude  Ap  in  a  Gaussian  noise  of  variance  a* .  In  this  case,  the  envelope  r  has 
a  Ricean  density,  given  by  [Eq.  (9.86a)] 


(11.138a) 


Also,  when  Ap  an  (small-noise  case)  from  Eq.  (9.86c),  we  have 


(11.138b) 


,-{r-Ap)2/2oi 


(11.138c) 


e 


Observe  that  for  small  noise,  the  PDF  of  r  is  practically  Gaussian,  with  mean  Ap  and  variance 
a\.  When  0  is  transmitted,  the  output  of  the  envelope  detector  is  an  envelope  of  a  Gaussian 
noise  of  variance  a~.  The  envelope  in  this  case  has  a  Rayleigh  density,  given  by  [Eq.  (9.81 )] 


Both  pT(r\n\  =  1)  and  pT(r\m  =  0)  are  shown  in  Fig.  1 1.37.  Using  the  argument  used  earlier 
(see  Fig.  1 1 .4),  the  optimum  threshold  is  found  to  be  the  point  where  the  two  densities  intersect. 
Hence,  the  optimum  threshold  aQ  is 


or 


This  equation  is  satisfied  to  a  close  approximation  for 


F'gure  11.37 

Conditional  PDFs  /?rlm 
in  the 

noncoherent 
detection  of  ASK 
signals. 


pT(  rim  =  0) 


r 


■f  ■  • 


6 


682  PERFORMANCE  ANALYSIS  OF  DIGITAL  COMMUNICATION  SYSTEMS 

Because  the  matched  filter  is  used,  Ap  =  Ep  and  a\  =  A  Ep/2.  Moreover,  tor  ASK  there  are, 
on  the  average,  only  Rh/ 2  nonzero  pulses  per  second.  Thus,  Eh  —  Ep/2.  Hence, 


_  —  A  — 

“  Af  ~  AT 


and 


a  /i  +  2  (11.139a) 

a°-bhf  +  Eb/ N 

Observe  that  the  optimum  threshold  is  not  constant  but  depends  on  Eb/Af  •  This  is  a  serious 
drawback  in  a  fading  channel.  For  a  strong  signal,  Eb/Af  1, 


a0  ~  Eh  = 


^ £ 
2 


(11.139b) 


and 


rOO 

P(e|m  =  0)  =  /  pr(r|m  =  0)  dr 
JAp/2 


r  dr 

Jap/2  <*i 


=  e-\Eb/M 


(11.140) 


Also, 


[AP!  2 

P(e|m  =  1)  =  /  pT(r\m  =  l)dr 

J— 00 

Evaluation  of  this  integral  is  somewhat  cumbersome.4  For  a  strong  signal  (that  is,  for  £/>/A  ^ 
1),  the  Ricean  PDF  can  be  approximated  by  the  Gaussian  PDF  [Eq.  (9.86c)],  and 


P(e|m  =  1)  % 


=  Q 


(11.14D 


As  a  result. 


Figure  1 1 ,3( 
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Pb  =  P m(0)(f|m  =  0)  +  Pm(l)P(e|rn  =  1) 
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Assuming  Pm(  1)  =  Pm(0)  =  0.5, 


(11.142a) 


Using  the  Q(  )  approximation  in  Eq.  (8.38a), 


Pb  ~  2  (‘  +  J2nEh/M  ) 


e-\Eb/M  Eb/x  »  j 


(11.142b) 


(11.142c) 


Figure  1 1 .38 
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Note  that  in  an  optimum  receiver,  for  Eb/N  »  1,  P(e|m  =  I)  is  much  smaller  than  P(t 
|m  =  0).  For  example,  at  Eb/M  =  10,  P(e |m  =  0)  ~  8.7  P(€|m  =  1).  Hence,  mistaking  0 
for  1  is  the  type  of  error  that  predominates.  The  timing  information  in  noncoherent  detection 
is  extracted  from  the  envelope  of  the  received  signal  by  methods  discussed  in  Sec.  7.5.2. 

For  a  coherent  detector. 


e-\Eb/N  Ebitfy>  i 


(11.143) 


sj2nEh/N 


This  appears  similar  to  Eq.  (1 1.142c)  (the  noncoherent  case).  Thus  for  a  large  Eh/M,  the 
performances  of  the  coherent  detector  and  the  envelope  detector  are  similar  (Fig.  1 1 .38).  This 
is  akin  to  the  behavior  observed  in  the  case  of  analog  signals  (see  Chapter  10). 

Frequency  Shift  Keying 

A  noncoherent  receiver  for  FSK  is  shown  in  Fig.  11.39.  The  filters  H0(f)  and  H\(f)  are 
matched  to  the  two  RF  pulses  corresponding  to  0  and  1,  respectively.  The  outputs  of  the 
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Figure  1  1 .39 

Noncoherent 
detection  of 
binary  FSK. 


envelope  detectors  at  t  =  7/ ,  are  ro  and  r\,  respectively.  The  noise  components  of  outputs  ot 
filters  H0(f)  and  H\ (f)  are  the  Gaussian  RVs  no  and  nj ,  respectively,  with  rxn<,  =  crn,  =  ^n- 
If  1  is  transmitted  (m  =  1),  then  at  the  sampling  instant,  the  envelope  n  has  the 
Ricean  PDF* 

CTn  V  ai  ) 

and  ro  is  the  noise  envelope  with  Rayleigh  density 


Pr0(r0)=r-\e-'2^ 


The  decision  is  m  =  1  if  ri  >  ro  and  m  =  0  if  ri  <  ro.  Hence,  when  binary  1  is  transmitted, 
an  error  is  made  if  ro  >  r  j , 


P(€|m  =  1)  =  P( r0  >  rj) 

The  event  ro  >  v\  is  the  same  as  the  joint  event  “r  j  has  any  positive  value7  and  ro  has  a  value 
greater  than  ri”  This  is  simply  the  joint  event  (0  <  rj  <  oo,  ro  >  rj).  Hence, 

P(e\m  =  1)  =  P(0  <  ri  <  oo,  ro  >  r\ ) 

pOO  poo 

=  /  Pr,r„(n,  ro)drt  dro 

J0  Jr  i 

Because  ri  and  r0  are  independent,  pr,ro  =  pripTu .  Hence, 

=..=n 

Jo 

f°°  n_ 

Jo 


P(e|m  =1)  =  /  -J^e  ('7+/,;;>/2<TS 


\°l  )  Jr,  °l 


Letting  .v  =  s/l  r\  and  a  =  A„/\/2,  we  have 


—  U  -  i  0~Kl^o, 


P(t\m  =  1)  =  je 


»  r  *-#+*/** h(”)  dx 

Jo  °n  W/ 


An  orthogonal  FSK  is  assumed.  This  ensures  that  ro  and  rj  have  Rayleigh  and  Rice  densities, 
is  transmitted. 

rl  is  the  envelope  detector  and  can  take  only  positive  values. 


respectively. 'vhenl 
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Observe  that  the  integrand  is  a  Ricean  density,  and,  hence,  its  integral  is  unity.  Therefore, 


P(€  |m  =  l)  = 


Note  that  for  a  matched  filter. 


2  _tf  =  2En 
P,nax  ~  al~  M 

For  FSK,  Eh  =  Ep,  and  Eq.  (1 1.144a)  becomes 


P(«|m  =  1)  =  \  e-^Eb,M 


(11.144a) 


(II. 144b) 


Similarly, 


P(*|m  =  0)  =  j  e~^EblM 


(1 1.144c) 


and 

Pb  =  \  e-iEb/M  (11.145) 

This  behavior  is  similar  to  that  of  noncoherent  ASK  [Eq.  (1 1.142c)].  Again  we  observe  that 
for  £/,/ A/"  1.  the  performance  of  coherent  and  noncoherent  FSK  are  essentially  similar. 

From  the  practical  point  of  view,  FSK  is  to  be  preferred  over  ASK  because  FSK  has  a 
fixed  optimum  threshold,  whereas  the  optimum  threshold  of  ASK  depends  on  Eh/N  (the  signal 
level)  Hence  ASK  is  particularly  susceptible  to  signal  fading.  Because  the  decision  ol  I  SK 
involves  a  comparison  between  r0  and  r,,  both  variables  will  be  affected  equally  by  signal 
fadin»  Hence,  channel  fading  does  not  degrade  the  noncoherent  FSK  performance  as  it  does 
the  noncoherent  ASK.  This  is  the  outstanding  advantage  of  noncoherent  FSK  over  noncoherent 
ASK  In  addition  unlike  noncoherent  ASK.  probabilities  P(t |m  =  1)  and  P(f|m  =  0)  are 
equal  in  noncoherent  FSK.  The  price  paid  by  FSK  for  such  an  advantage  is  its  larger  bandwidth 

requirement. 


Noncoherent  MFSK  .  .  .  . 

From  the  practical  point  of  view,  phase  coherence  of  M  frequencies  is  difficult  to  maintain. 
Hence  in  practice,  coherent  MFSK  is  rarely  used.  Noncoherent  MFSK  is  much  more  common 
The  receiver  for  noncoherent  MFSK  is  similar  to  that  for  binary  noncoherent  FSK  (Fig.  1139), 
but  with  M  banks  corresponding  to  M  frequences,  in  which  filter  H,(f )  is  matched  to  the  RF 
pulse p(t)  cos  ant.  The  analysis  is  straightforward.  if  m  =  1  is  transmitted,  then  r,  is  the  enve¬ 
lope  of  a  sinusoid  of  amplitude  Ar  plus  bandpass  Gaussian  noise,  and  r,  (j  -  2 . 

the  envelope  of  the  bandpass  Gaussian  noise.  Hence,  r,  has  Ricean  density,  and  r2  r3 . rM 

have  Rayleigh  density.  From  the  same  arguments  used  in  the  coherent  case,  we  have 


PcM  =  P(C |m  =  1)  =  P(0  <  r,  <  oo,  n2  <  n,  n3  <  r, . nM  < 


n) 

dr\ 
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Substituting  r^/2a~  =  x  and  ( Ap/an )2  —  2Ep/N  =  2Eb  log  M  / Af,  we  obtain 


PCM  =  e j f”  e-*(l  -  /„  ( 2V  (11J46a) 

Using  the  binomial  theorem  to  expand  (1  —  _1,  we  obtain 

m=0  '  ' 

Substitution  of  this  equality  into  Eq.  ( 1 1.146a)  and  recognizing  that 

rye-ay2I0(by)dy=^-e-*/4a 

Jo  2a 

we  obtain  (after  interchanging  the  order  of  summation  and  integration) 

(M  -  l\  (~1  )m  c-mEh  \og2M/M(m+\)  (11.146b) 

V  rn  )  m  +  \ 


( M  1 )  (~l)m+l  £)|  iog2M/Mm+l)  (1 1.146c) 

V  m  /  m  + 1 

The  error  probability  is  shown  in  Fig.  1 1.40  as  a  function  of  EbjH  ■  It  can  be  seen  that 
the  performance  of  noncoherent  MFSK  is  only  slightly  inferior  to  that  of  coherent  MFSK. 
particularly  for  large  M . 

Differentially  Coherent  PSK 

Just  as  it  is  impossible  to  demodulate  a  DSB-SC  signal  with  an  envelope  detector,  it  is  also 
impossible  to  demodulate  PSK  (which  is  really  DSB-SC)  noncoherently.  We  can,  however, 
demodulate  PSK  without  the  synchronous,  or  coherent,  local  carrier  by  using  what  is  known 
as  differential  PSK  (DPSK). 

The  optimum  receiver  is  shown  in  Fig.  1 1 .4 1 .  This  receiver  is  very  much  like  a  correlation 
detector  (Fig.  1 1 .3),  which  is  equivalent  to  a  matched  filter  detector.  In  a  correlation  detector,  we 
multiply  pulse p(t)  by  a  locally  generated  pulse p(t).  In  the  case  of  DPSK,  we  take  advantageot 
the  fact  that  the  two  RF  pulses  used  in  transmission  are  identical  except  for  the  sign  (or  phase). 
In  the  detector  in  Fig.  1 1.41,  we  multiply  the  incoming  pulse  by  the  preceding  pulse.  Hence, 
the  preceding  pulse  serves  as  a  substitute  for  the  locally  generated  pulse.  The  only  difference 
is  that  the  preceding  pulse  is  noisy  because  of  channel  noise,  and  this  tends  to  degrade  the 
performance  in  comparison  to  coherent  PSK.  When  the  output  r  is  positive,  the  present  pulse 
is  identical  to  the  previous  one,  and  when  r  is  negative,  the  present  pulse  is  the  negative  o 
the  previous  pulse.  Hence,  from  the  knowledge  of  the  first  reference  digit,  it  is  possible  to 
detect  all  the  received  digits.  Detection  is  facilitated  by  using  so-called  differential  encoding 
identical  to  what  was  discussed  in  Sec.  7.3.6  for  duobinary  signaling.  .  j 

To  derive  the  DPSK  error  probability,  we  observe  that  DPSK  by  means  of  differentia 
coding  is  essentially  an  orthogonal  signaling  scheme.  A  binary  1  is  transmitted  by  a  sequence 
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Figure  11. 41 

Differential  PSK 
detection. 


Delay 
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t=Tb 

Threshold 

H(/) 
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device 

Decision 


x.  1  c  /„  n,  nr  (-n  - p )  over  2 Tb  seconds  (no  transition).  Similarly,  a  binary  0  is 

i-  -«  »  <-P.  P,  over  27*  seconds  Cransitton,. 

E”of  the  pulse  sequences  used  for  binary  1  is  orthogonal  to  e.ther  of  the  pulse  sequences 
u  d  for  binary  0.  Because  no  local  carrier  is  generated  for  demodulate,  the  detect, on  ,s 

noncoherent,  with  an  effective  pulse  energy  equal  to  2 Ep  (twee  the  energy  of  pu  se  p)_The 
noncone  .  di  it  is  oniy  £  however,  the  same  as  in  noncoherent  FSK. 

Con^q'Sy.  the  performance  of  DPSK  is  3  dB  superior  to  that  of  noneoherent  FSK.  Hence 

from  Eq.  ( 1 1 .145),  we  can  write  Pb  for  DPSK  as 


Ph  =  \e'Eb,U 


(11.147) 


.  rFia  11  42)  is  superior  to  that  of  noncoherent  FSK  by  3  dB  and  is 

SnSy  simdar  ,o  jhemn.  PSK  for  E„/M  »  1  [Eq.  (1 1.39,1.  This  is  as  expected,  because 
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Figure  1  1.42 

Error  probability 
of  PSK,  DPSK, 
and  coherent 
and  noncoherent 
FSK. 


we  saw  earlier  that  DPSK  appears  similar  to  PSK  for  large  SNR.  Rigorous  derivation  ot 
Eq.  (1 1.147)  can  be  found  in  the  literature.7 


11.12  MATLAB  EXERCISES 

In  this  group  of  computer  exercises,  we  give  readers  an  opportunity  to  test  the  implementation 
and  the  performance  of  basic  digital  communication  systems. 

COMPUTER  EXERCISE  11.1:  BINARY  POLAR  SIGNALING  WITH  DIFFERENT  PULSES 

In  the  first  exercise,  we  validate  the  performance  analysis  of  the  binary  polar  signaling  presente 
in  Section  11.1.  Optimum  (matched  filter)  detection  are  always  used  at  the  receiver.  In  the  program 
Exll_l  .m,  three  different  pulses  are  used  for  polar  signaling: 

•  Rectangular  pulse  p{t)  =  u(t)  -  u(t  -  T). 

•  Half-sine  pulse  p(t)  =  sin  (7Tt/T)[u(t)  -  u(t  -  7”)]. 

•  Root-raised  cosine  pulse  with  roll-off  factor  r  =  0.5  (or  bandwidth  0.75 /T)  and  truncated  to  duration 
of  6  T. 


%  Matlab  Program  <Exll_l.m> 

%  This  Matlab  exercise  <Exll_l.m>  performs  simulation  of 
%  binary  baseband  polar  transmission  in  AWGN  channel. 

%  The  program  generates  polar  baseband  signals  using  3  different 
%  pulse  shapes  (root-raised  cosine  (r=0.5),  rectangular,  half-sine) 

%  and  estimate  the  bit  error  rate  (BER)  at  different  Eb/N  for  disp  a* 
clear ;clf; 

L=1000000;  %  Total  data  symbols  in  experiment  is  1 

%  To  display  the  pulse  shape,  we  oversample  the  signal 
%  by  factor  of  f_ovsamp=8 

f_ovsamp=8;  %  Oversampling  factor  vs  data  rate 

delay_rc=3 ; 

1 5  Generating  root-raised  cosine  pulseshape  (rolloff  factor  =  0-^ 
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prcos=rcosf It ( [  1  ]  ,  1,  f_ovsamp,  'sqrt',  0.5,  delay_rc); 
prcos=prcos  (1 :  end-f_ovsamp+l)  ; 
prcos=prcos/norm (prcos)  ; 
pcmatch=prcos ( end :  - 1 : 1 )  ; 

%  Generating  a  rectangular  pulse  shape 
prect=ones (1, f_ovsamp) ; 
prect=prect/norm (prect ) ; 
prmatch=prect (end : -1 : 1)  ; 

%  Generating  a  half -sine  pulse  shape 
psine=sin ( [0 : f_ovsamp-l] *pi/f_ovsamp) ; 
psine=psine/norm (psine) ; 
psmatch=psine (end: -1 : 1)  ; 

%  Generating  random  signal  data  for  polar  signaling 
s_data=2*round (rand (L, 1) ) -1; 

%  upsample  to  match  the  'fictitious  oversampling  rate' 

%  which  is  f_ovsamp/T  (T=l  is  the  symbol  duration) 

s  up=upsample (s_data, f_ovsamp) ; 


%  Identify  the  decision  delays  due  to  pulse  shaping 

%  and  matched  filters 

de 1 ayr c= 2  *  de 1 ay_r c *  f _ovsamp ; 

delayrt=f_ovsamp- 1 ; 

delaysn=f  __ovsamp- 1 ; 

%  Generate  polar  signaling  of  different  pulse-shaping 

xrcos=conv ( s_up , prcos ) ; 
xrect=conv ( s_up , prect ) ; 

xsine=conv (s_up, psine) ; 
t= (1:200) /f_ovsamp; 

subplot (311)  ftoxv 

f igwavel=plot (t,xrcos (delayrc/2 : delayrc/2+199 ) ) ; 

title ( ' (a)  Root-raised  cosine  pulse.'); 
set (f igwavel , ' Linewidth' , 2) ; 

subplot (312)  .  . 

f igwave2=plot (t.xrect (delayrt :delayrt+199) ) ; 
title ('(b)  Rectangular  pulse. ' ) 

set ( f igwave2 , ' Linewidth ' ,2) ; 

subplot (313)  1QQ>\ 

f igwave3=plot (t.xsine (delaysn:delaysn+199)  ; 

title  ('(c)  Half-sine  pulse.') 
xlabelt'  Number  of  data  symbol  peno  s 
set (figwave3, 'Linewidth'  ,2) ; 

; Lsine-length (xsine) , 

BER=  []  ; 

noiseq=randn(Lrcos, 1) ; 

%  Generating  the  channel  noise  (AWGN) 
for  i=l:10, 


Eb2N ( i ) =i ;  .  /  x 

Eb2N_num=10" (Eb2N (i )  /10 )  / 
Var_n=l/ (2*Eb2N_num) ; 
signois=sqrt (Var_n) ; 

awgnois=signois*noiseq, 

%  Add  noise  to  signals  at 

yrcos=xrcos+awgnois ; 


% (Eb/N  in  dB) 

%  Eb/N  in  numeral 
%1/SNR  is  the  noise  variance 
%  standard  deviation 
%  AWGN 

the  channel  output 
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yrect=xrect+awgnois (1 : Lrect) ; 
ysine=xsine+awgnois (1 : Lsine) ; 

%  Apply  matched  filters  first 

zl=conv (yrcos , pcmatch) ; clear  awgnois,  yrcos; 
z2=conv (yrect , prmatch) ; clear  yrect; 
z3=conv(ysine,psmatch) /clear  ysine; 

%  Sampling  the  received  signal  and  acquire  samples 
zl=zl (delayrc+1 : f_ovsamp : end) ; 
z2=z2 (delayrt+1 : f_ovsamp : end) ; 
z3=z3 (delaysn+1 : f_ovsamp : end) ; 

%  Decision  based  on  the  sign  of  the  samples 

decl=sign (zl (1 :L) ) ;dec2=sign (z2 (1 :L) ) ;dec3=sign (z3 (1 :L) ) ; 

%  Now  compare  against  the  original  data  to  compute  BER  for 
%  the  three  pulses 

BER= [BER;sum(abs ( s_data-decl ) ) / (2*L) . . . 
sum(abs (s_data-dec2) ) /  (2*L)  ... 

sum(abs (s_data-dec3 ) ) / (2*L) ] ; 

Q (i) =0 . 5*erf c (sqrt (Eb2N_num) ) ;  %Compute  the  Analytical  BER 

end 

figure (2) 
subplot (111) 

f igber=semilogy (Eb2N,Q, 'k-' , Eb2N, BER ( :  ,  1)  , 'b-*'  ,  .  .  . 

Eb2N, BER ( : ,2) , ' r-o' ,Eb2N,BER(: ,3) , 'm-v' ) ; 
legend ( 'Analytical' ,  ' Root-raised  cosine'  #  '  Rectangular' , ' Half-sine' ) 
xlabel ( ' E_b/N  (dB) ' ) /ylabel ( ' BER' ) 
set (f igber , ' Linewidth' ,2) ; 
figure (3 ) 

%  Spectrum  comparison 

[Psdl,  f  ]  =pwelch(xrcos/  [],[],[],'  twosided'  ,  f_ovsamp)  ; 

[Psd2  ,  f  ]  =pwelch(xrect/  [],[],[],'  twosided'  ,f_ovsamp)  ; 

[Psd3 ,  f  ]  =pwelch  (xsine ,  [],[],  []  ,  '  twosided'  ,f_ovsamp)  ; 
figpsdl= semi logy (f -f_ovsamp/2 ,  f f tshif t (Psdl) ) ; 
ylabel ( ' Power  spectral  density' ) ; 
xlabel (' frequency  in  unit  of  { 1/T} ' ) ; 

ttl=title ( ' (a)  PSD  using  root-raised  cosine  pulse  (rolloff  factor  r-0. 
set (ttl, ' FontSize' , 11) ; 
figure (4 ) 

figpsd2=semilogy (f-f_ovsamp/2, fftshift (Psd2) ) ; 

ylabel ( ' Power  spectral  density' ) ; 

xlabel (' frequency  in  unit  of  { 1/T} ' ) ; ; 

tt2=title ( ' (b)  PSD  using  rectangular  NRZ  pulse'); 

set ( tt2 , ' FontSize' , 11) ; 

figure (5) 

figpsd3= semi logy (f -f_ovsamp/2 , fftshift (Psd3) ) ; 
ylabel (' Power  spectral  density'); 
xlabel (' frequency  in  unit  of  { 1/T} ' ) ; 
tt3=title ( ' (c)  PSD  using  half-sine  pulse'); 
set ( tt3 , ' FontSize' , 11) ; 


This  program  first  shows  the  polar  modulated  binary  signals  in  a  snapshot  given  by  Fig.  1 1  ^ 

3  different  waveforms  are  the  direct  results  of  their  different  pulse  shapes.  Nevertheless,  their  bit  em 
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Figure  1 1 .43 

Snapshot  of  the 
modulated 
signals  from 
three  difference 
pulse  shapes: 

(a)  root-raised 
cosine  pulses, 
of  rolloff 
factor =0. 5; 

(b)  rectangular 
pulse; 

(c)  half-sine 
pulse  pulse. 


(b) 


ie' ) 


r  r=0.5 )'); 


Figure  1 1 .44 

BER  of  optimum 
(matched  filter) 
detection  of 
polar  signaling 
using  three  diffe¬ 
rence  pulse 
shapes: 

(a)  root-raised 
cosine  pulse 
of  roll-off 
factor  0.5; 

(b)  rectangular 
pulse; 

(c)  half-sine 
pulse. 


13.  The 
it  error 


Jte  (BER)  performances  are  identical,  as  shown  in  Fig.  I .  .44/Hus  confirms  the  results  from  Sec.  11.1 
iat  the  polar  signal  perfom^nce  is  ind^CTdem  o^t^e  ^  ^  ^  signaling  using  the 

The  progrania'so  provides  the^  ^  ^  see  ^  ^  ^  root.ralsetl  cosine  pulse 

iree  different  modulated g  •  -  signaling  exhibits  larger  main  lobe  but  smaller  overall 

m*  -  -  -  Tto>-  ■*— 
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Figure  1 1 .45 

Power  spectral 
density  of  the 
binary  polar 
transmission 
using  three 
difference  pulse 
shapes: 

(a)  root-raised 
cosine  pulse 
of  roll-off 
factor  0.5; 

(b)  rectangular 
NRZ  pulse; 

(c)  half-sine  pulse 
pulse. 


Frequency  in  1/7* 
(b) 


Frequency  in  1  IT 
(c) 
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the  same  BER  from  simulation,  the  three  different  polar  modulations  require  drastically  dillerent  amount 
of  channel  bandwidth.  


COMPUTER  EXERCISE  1 1 .2:  ON-OFF  BINARY  SIGNALING 

Next,  we  present  an  exercise  that  implements  and  tests  the  on-off  signaling  as  well  as  a  more  generic 
orthogonal  type  of  signaling.  Recall  that  on-off  signaling  is  a  special  form  of  orthogonal  binary  signaling. 
MATLAB  program  Exll_2  ,m  will  measure  the  receiver  BER  ot  both  signaling  schemes. 


%  MATLAB  PROGRAM  <Exll_2.m> 

%  This  Matlab  exercise  <Exll_2.m>  generate 
%  on/off  baseband  signals  using  root-raised  cosine 
%  pulseshape  (rolloff  factor  =0.5)  and  orthogonal  baseband 
%  signal  before  estimating  the  bit  error  rate  (BER)  at  different 
%  Eb/N  ratio  for  display  and  comparison 

clear;  elf  .  „ 

L=1000000 ;  %  Total  data  symbols  in  experiment  is  1  million 

%  To  display  the  pulse  shape,  we  oversample  the  signal 

%  by  factor  of  f__ovsamp=8 

f_ovsamp=16;  %  Oversampling  factor  vs  data  rate 

^Generating  root-raised  cosine  pulseshape  (rolloff  factor  =0.5) 
prcos=rcosflt([  11,1.  f_°vsamp,  'sqrt',  0.5,  delay_rc) ; 
prcos=prcos ( 1 : end- f _ovsamp+ 1 ) , 
prcos=prcos/norm (prcos) ; 
pcmatch=prcos (end: -1 : 1) ; 

%  Generating  a  rectangular  pulse  shape 
psinh=sin ( [0 : f_ovsamp-l] *pi/f_ovsamp) ; 
psinh=psinh/norm (psinh) , 
phmatch=psinh (end: -1:1) ; 

%  Generating  a  half -sine  pulse  shape 
psine=sin( [0:f_ovsamp-l] *2*pi/f_ovsamp) ; 
psine=psine/norm (psine) ; 

£°r  -°i“  ■is"’*iin9 

%_upiampllTto^natch  the  'fictitious  oversampling  rate' 

%  which  is  f_ovsamp/T  (T-l  is  the  symbol  duration) 
s  up=upsample (s_data, f_ovsamp) , 
s~cp=upsample (l-s_data, f_ovsamp) ; 

%  identify  the  decision  delays  due  to  pulse  shaping 
%  and  matched  filters 

delayrc=2*delay_rc*f_ovsamp; 

foe»er«;°p””rlign.lin3  of  different  pnise-sh.ping 

xrcos=conv(s  up,p^cos) ; 

xorth=conv ( s_up , ps inh) +conv ( s_cp , psine ) ; 
t= (1 :200) /f_ovsamp; 
figure (1) 

f igwavel-plot (t.xrcos  (delayrc/2  :delayrc/2+199) ) ; 
title ('  (a?  On/off  root-raised  cosine  pulse.  ), 
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set ( f igwavel , ' Linewidth ' , 2 ) ; 
subplot (212) 

f igwave2=plot (t , xorth (delayrt : delayrt+199) ) ; 
title  (Mb)  Orthogonal  modulation.') 
set ( f igwave2 , ' Linewidth' , 2 ) ; 

%  Find  the  signal  length 

Lrcos=length (xrcos) ;Lrect=length (xorth) ; 

BER=  []  ; 

noiseq=randn (Lrcos, 1) ; 

%  Generating  the  channel  noise  (AWGN) 
for  i=l : 12 , 

Eb2N (i) =i ; 

Eb2N_num=l(T  (Eb2N(i) /10)  ; 

Var_n=l/ (2*Eb2N_num) ; 
signois=sqrt (Var_n) ; 
awgnois=signois*noiseq; 

%  Add  noise  to  signals  at  the  channel  output 
yrcos=xrcos+awgnois/sqrt (2) ; 
yorth=xorth+awgnois (1 :Lrect) ; 


% (Eb/N  in  dB) 

%  Eb/N  in  numeral 
%1/SNR  is  the  noise  variance 
%  standard  deviation 
%  AWGN 


%  Apply  matched  filters  first 

zl=conv (yrcos , pcmatch) ; clear  awgnois,  yrcos; 

z2=conv(yorth,phmatch) ; 

z3=conv (yorth, psmatch) /clear  yorth; 

%  Sampling  the  received  signal  and  acquire  samples 
zl=zl (delayrc+1 :f_ovsamp:end) ; 
z2=z2 (delayrt+1 :f_ovsamp:end-f_ovsamp+l) ; 
z3=z3 (delayrt+1 : f_ovsamp:end-f_ovsamp+l) ; 

%  Decision  based  on  the  sign  of  the  samples 

decl=round ( (sign(zl (1:L) -0.5)+l) *.5) ;dec2=round( (sign (z2-z3) +1)  * 
%  Now  compare  against  the  original  data  to  compute  BER  for 
%  the  three  pulses 

BER= [BER;sum(abs (s_data-decl) ) /L  sum(abs (s_data-dec2) ) /L] ; 

Q (i) =0 . 5*erfc (sqrt (Eb2N_num/2) ) ;  %  Compute  the  Analytical  BER 

end 

figure (2) 
subplot (111) 

f igber=semilogy (Eb2N, Q, ' k- ' , Eb2N, BER ( : , 1) ,  'b-*' , Eb2N, BER ( : , 2) ,  'r-o') 
fleg=legend( 'Analytical' ,  'Root-raised  cosine  on/of f' Orthogonal 
signaling' ) ; 

fx=xlabel ( ' E_b/N  (dB) ' ) ;fy=ylabel ( 'BER' ) ; 
set (figber, 'Linewidth' ,2) ; set (f leg, ' FontSize' ,11) ; 
set (fx, ' FontSize' , 11) ; 
set (fy, ' FontSize' , 11) ; 

•s  We  can  plot  the  individual  pulses  used  for  the  binary  orthogonal 
%  signaling 
figure (3) 
subplot (111) ; 

pulse=plot ( (0 : f_ovsamp) /f_ovsamp, [psinh  0] , ' k- ' , . . . 

( 0 : f_ovsamp) / f_ovsamp, [psine  0],'k-o'); 
pleg=legend (' Half -sine  pulse',  'Sine  pulse'); 
ptitle=title (' Binary  orthogonal  signals'); 
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Figure  1 1 .46 

Waveforms  of 
the  two  pulses 
used  in 
orthogonal 
binary  signaling: 
solid  curver, 
half-sine  pulse; 
curve  with 
circles,  sine 
pulse. 


Figure  1 1 .47 

Measured  BER 
results  in 

comparison  with 
analytical  BER. 


set (pulse , ' Linewidth  ,2), 

set(pleg, 'Fontsize' ,10) ; 
setfptitle, 'FontSize'  ,11)  ; 


.  et  ,n.,iino  we  will  continue  to  use  the  root-raised  cosine  pulse  from  Computer  Exer- 
For the  on-off  s.gnahng.  wrwriloom  ^  ^  ^  ^  shapes  of  ,ength  r.  Figure  ,  ,.46 

cise  I U.  Forr°lfr  Rg.  U. 47  displays  the  measured  BER  for  both  signaling  schemes 
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Figure  1 1 .48 

Eye  diagram  of 
the  real 
(in-phase) 
component  of  the 
16-QAM 
transmission  at 
the  receiver 
matched  filter 
output. 


5 


—5 - ‘ - 1 - 1 - 

-1  -0.5  0  0.5  1 

Time,  Ts 


COMPUTER  EXERCISE  1 1 .3:  16-QAM  MODULATION 

In  this  exercise,  we  will  consider  a  more  complex  QAM  constellation  for  transmission.  The  M  -ary  QAM 
was  analyzed  in  Sec.  1 1 .6.6.  In  MATLAB  program  Exll_3  .  m,  we  control  the  transmission  bandwidth 
by  applying  the  root-raised  cosine  pulse  with  roll-off  factor  of  0.5  as  the  baseband  pulse  shape.  For  each 
symbol  period  7,  eight  uniform  samples  are  used  to  approximate  and  emulate  the  continuous  time  signals. 
Figure  1 1 .48  illustrates  the  open  eye  diagram  of  the  in-phase  (real)  part  of  the  matched  filter  output  prior 
to  being  sampled.  Very  little  IS1  is  observed  at  the  point  of  sampling,  validating  the  use  of  the  root-raised 
cosine  pulse  shape  in  conjunction  with  the  matched  filter  detector  for  ISl-free  transmission. 

%  Matlab  Program  <Exll_3.m> 

%  This  Matlab  exercise  <Exll_3.m>  performs  simulation  of 
%  QAM- 16  baseband  polar  transmission  in  AWGN  channel. 

%  Root-raised  cosine  pulse  of  rolloff  factor  =  0.5  is  used 
%  Matched  filter  receiver  is  designed  to  detect  the  symbols 
%  The  program  estimates  the  symbol  error  rate  (BER)  at  different  Eb/N 
clear; elf ; 

L=1000000;  %  Total  data  symbols  in  experiment  is  1  ntilli° 

%  To  display  the  pulse  shape,  we  oversample  the  signal 
%  by  factor  of  f_ovsamp=8 

f_ovsamp=8;  %  Oversampling  factor  vs  data  rate 

delay_rc=4 ; 

%  Generating  root-raised  cosine  pulseshape  (rolloff  factor  =  0.5) 
prcos=rcosf It ( [  11,1,  f_ovsamp,  'sqrt',  0.5,  delay_rc); 
prcos=prcos (1 : end-f_ovsamp+l) ; 
prcos=prcos/norm (prcos) ; 
pcmatch=prcos (end: -1 : 1) ; 

■s  Generating  random  signal  data  for  polar  signaling 
s_data=4* round (rand (L,  1) ) +2* round (rand (L# 1) ) -3+.  .  . 

+j* (4*round (rand (L,l) )+2*round (rand (L,l) ) -3)  ; 

%  upsample  to  match  the 
%  'oversampling  rate' 
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%  which  is  f_ovsamp/T  (T=l  is  the  symbol  duration) 
s_up=up sample  (s_data,  f_ovsamp)  ; 

%  Identify  the  decision  delays  due  to  pulse  shaping 

%  and  matched  filters 

de 1 ay r c= 2 * de 1 ay_r c * f _ovsamp ; 

%  Generate  QAM- 16  signaling  with  pulse- shaping 
xrcos=conv ( s_up , prcos ) ; 


%  Find  the  signal  length 
Lrcos=length (xrcos) ; 

SER=  []  ; 

noiseq=randn (Lrcos ,  1)  +  j *randn (Lrcos ,  1 ) ; 
Es=10;  %  symbol  energy 

%  Generating  the  channel  noise  (AWGN) 


for  i=l:9, 

Eb2N(i) = i *  2 ; 

Eb2N_num= 1 0  ~  ( Eb2N ( i ) / 1 0 )  ; 
Var_n=Es/ (2*Eb2N_num) ; 
signois=sqrt (Var_n/2) ; 
awgnois=signois*noiseq; 

%  Add  noise  to  signals  at  the 

yrcos=xrcos+awgnois ; 


% (Eb/N  in  dB) 

%  Eb/N  in  numeral 
%1/SNR  is  the  noise  variance 
%  standard  deviation 
%  AWGN 
:1  output 


%  Apply  matched  filters  first 

zl=conv (yrcos,pcmatch) /clear  awgnois,  yrcos; 

%  Sampling  the  received  signal  and  acquire  samples 

zl=zl (delayrc+1 : f_ovsamp : end) ; 


%  Decision  based  on  the  sign  of  the  samples 
dec l=sign (real (zl (1:L) ) ) +sign (real (zl(l:L))-2)+... 
sign (real (zl(l:L) )+2)+. . . 

j  * (sign(imag (zl (1 :L) ) ) +sign(imag (zl (l:L))-2)  +  .. . 
sicrn  (imag  (zl  (1  :L)  )  +2)  )  ; 

%  Now  compare  against  the  original  data  to  compute  BER  for 

%  the  three  pulses  % 

%BER= [BER; sum(abs (s_data-decl) ) / (2  L) ] 

SER= [SER; sum (s_data~=decl) /L] ; 

Q (i) =3*0 . 5*erfc (sqrt ( (2*Eb2N_num/5) /2) ) ; 

%Compute  the  Analytical  BER 


, Eb2N, SER, 'b-*' ) 


figure (1) 
subplot (111) 

f igber= semi logy (Eb2N,Q, ;k- 

ay{e ( f2  18  .99e-5  1]);  , x 

aX  S  L  i  ' Root -raised  cosine' ) ; 

error  probability. . , 

setlfigber, 'Linewidth' ,2) ; 

%  Constellation  plot 

figure (2) 
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Figure  1 1 .49 

Symbol  error 
probability  of 
1 6-QAM  using 
root-raised 
cosine  pulse  in 
comparison  with 
the  analytical 
result. 


axis  ( '  square'  ) 

xlabel('Real  part  of  matched  filter  output  samples') 
ylabel (' Imaginary  part  of  matched  filter  output  samples') 

Because  the  signal  uses  1 6-QAM  constellations,  instead  of  measuring  the  BER,  we  will  measure 
the  symbol  error  rate  (SER)  at  the  receiver.  Figure  1 1.49  illustrates  that  the  measured  SER  matches  the 
analytical  result  from  Sec.  1 1 .6  very  closely. 


The  success  of  the  optimum  QAM  receiver  can  also  be  shown  by  observing  the  real  part 
and  the  imaginary  part  of  the  samples  taken  at  the  matched  filter  output.  By  using  a  dot  to 
represent  each  measured  sample,  we  create  what  is  known  as  a  “scatter  plot,"  which  clearly 
demonstrates  the  reliability  of  the  decision  that  follows.  If  the  dots  in  the  scatter  plot  are  closely 
clustered  around  the  original  constellation  point,  then  the  decision  is  mostly  likely  going  t° 
be  reliable.  Conversely,  large  number  of  decision  errors  can  occur.  Figure  11.50  illustrates 
the  scatter  plot  from  the  measurement  taken  at  the  receiver  when  Eb/J\f  =  18  dB.  The  close 
clustering  of  the  measured  sample  points  is  a  strong  indication  that  the  resulting  SER  will  be 
very  low. 

COMPUTER  EXERCISE  1 1 .4:  NONCOHERENT  FSK  DETECTION 

To  test  the  results  of  a  noncoherent  binary  FSK  receiver,  we  provide  MATLAB  program  Exll_4 
which  assumes  the  orthogonality  of  the  two  frequencies  used  in  FSK.  As  expected,  the  measured  B 
results  in  Figure  1 1.51  matches  the  analytical  BER  results  very  well. 

%  MATLAB  PROGRAM  <Exll_4.m> 

%  This  program  provides  simulation  for  noncoherent  detection  of 
%  orthogonal  signaling  including  BFSK.  Noncoherent  MFSK  detection 
■6  only  needs  to  compare  the  magnitude  of  each  frequency  bin. 
L=100000;  % Number  of  data  symbols  in  the  simulation 
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Figure  1 1.50 

Scatter  plot  of 
the  matched  filter  ™ 
output  for  the 
16-QAM 
signaling  with 
root-raised 
cosine  pulse 
when 

Eb/JST  =  18  dB. 


-3  -2  -1  0 

Real  part  of  matched  filter  output  samples 


Figure  11.51 

BER  from 
noncoherent 
detection  of 
binary  FSK. 


a  Hat-a=round (rand  (L,  1) )  ; 

%"  Generating  random  phases  on  the  two  frequencies 

xbasel=[exp(j*2*pi*rand)  0] ; 

xbase0=[0  exp(j*2*pi*rand)]  ; 

f  Modulating  two  orthogonal  ^equencres 
,  .  aaha*Ybasel+(l-s  data) *xbase0 ; 

%m°Gene rating  noise  sequences  for  both  frequency  channels 

noisei=randn (L, 2) ; 
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noiseq=randn (L, 2) ; 

BER=  []  ; 

BER_az=  []  ; 

%  Generating  the  channel  noise  (AWGN) 
for  i=l : 12 , 

Eb2N (i) =i ;  % ( Eb/N  in  dB) 

Eb2N_num=10~ (Eb2N (i) /10)  ;  %  Eb/N  in  numeral 

Var_n=l/ (2*Eb2N_num) ;  %1/SNR  is  the  noise  variance 

signois=sqrt (Var_n) ;  %  standard  deviation 

awgnois=signois* (noisei+ j *noiseq) ;  %  AWGN  complex  channels 

%  Add  noise  to  signals  at  the  channel  output 
ychout=xmodsig+awgnois ; 

%  Non-coherent  detection 
ydiml=abs (ychout (1,1)) ; 
ydim2=abs (ychout (: ,2) ) ; 
dec= (ydiml>ydim2) ; 

%  Compute  BER  from  simulation 
BER=  [BER;  sum (dec~=s_data) /L]  ; 

%  Compare  against  analytical  BER. 

BER_az= [BER_az;  0 . 5* exp ( -Eb2N_num/2) ] ; 


end 

f igber=semilogy (Eb2N,  BER_az ,  '  k-  '  ,  Eb2N,  BER,  '  k-o'  )  ; 
set ( f igber , ' Linewidth ' ,2) ; 

legend ( 'Analytical  BER' ,  'Noncoherent  FSK  simulation'); 
fx=xlabel ( ' E_b/N  (dB) ' ) ; 
fy=ylabel ( ' Bit  error  rate' ) ; 

set (fx, ' FontSize' , 11) ;  set (fy, ' Fontsize' , 11) ; 


COMPUTER  EXERCISE  1 1 .5:  NONCOHERENT  DETECTION  OF  BINARY 
DIFFERENTIAL  PSK 

To  test  the  results  of  a  binary  differential  phase  shift  keying  system,  we  present  MATLAB  program 
Exll_5  •  m.  As  in  previous  cases,  the  measured  BER  results  in  Figure  1 1 .52  matches  the  analytical  BER 
results  very  well. 

%  MATLAB  PROGRAM  <Exll_5.m> 

%  This  program  provides  simulation  for  differential  detection  of 
%  binary  DPSK.  Differential  detetion  only  needs  to  compare  the 
%  successive  phases  of  the  signal  samples  at  the  receiver 
% 

clear; elf 

L=1000000 ;  %Number  of  data  symbols  in  the  simulation 

s_data=round ( rand ( L , 1 ) ) ; 

%  Generating  initial  random  phase 
initphase= [2*rand] ; 

%  differential  modulation 
s_denc=mod(cumsum( [0;s_data] ) ,2) ; 

%  define  the  phase  divisible  by  pi 
xphase=initphase+s_denc ; 
clear  s_denc; 

%  modulate  the  phase  of  the  signal 
xmodsig=exp(j*pi*Xphase) ;  clear  xphase; 


*»—  — 


11.12  MATLAB  Exercises  701 


els 


Figure  1 1 .52 

Analytical  BER 
results  from 
noncoherent 
detection  of 
binary  DPSK 
simulation  (round 
points). 


rogram 

alBER 


Lx=length(xmodsig) ; 

%  Generating  noise  sequence 

noiseq=randn (Lx, 2 ) ; 

BER=  []  ; 

BER_az=[]; 

%  Generating  the  channel  noise  ( AWGN 


EOr  i  %  (Eb/N  in  dB) 

Eb2N (l) =i;  . 

Eb2N  num=10~(Eb2N(i)/10);  %  Eb/N  in  numeral 

Var  n=l/ (2*Eb2N_num) ;  %1/SNR  is  the  noise  variance 

.  -  .  ivar  n} •  %  standard  deviation 

signoi  q  -  '  %  AWGN  complex  channels 

awgnois=signois* (noiseq 

%  Add  noise  to  signals  at  the  channel  output 


ychout=xmodsig+awgnois ; 

%  Non-coherent  detection 

yphase=angle (ychout ) ; 

clear  ychout; 
ydf dec=dif f (yphase) /pi; 
clear  yphase; 
dec= (abs (ydf dec) >0.5) ; 
clear  ydf dec; 

%  Compute  BER  from  simulation 
BER= [BER;  sum(dec~=s_data) /L] ; 

%  Compare  against  analytical  BER- 
BER  az= [BER_az ;  0.5*exp(-Eb2N_num) ] ; 


%f ind  the  channel  output  phase 
%calculate  phase  difference 
%make  hard  decisions 


end 

%  now  plot  the  results 
f igber=  semi logy ( Eb2N , BER_az , 
axis (  [1  11  .  99e-5  1] ) ; 
set (figber, 'Linewidth' ,2) ; 


Eb2N, BER, 'k-o' ) ; 


I 
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legend ( 'Analytical  BER' ,  'Binary  DPSK  simulation'); 
fx=xlabel ( ' E_b/N  (dB) ' ) ; 
fy=ylabel ( ' Bit  error  rate'); 

set (fx, ' FontSize' , 11) ;  set (fy, ' Fontsize' ,  11)  ; 
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PROBLEMS 


11.1-1  In  a  baseband  binary  transmission,  binary  digits  are  transmitted  by  using 

A  •  p(t)  0  <t  <Tb  sending  1 
-A  •  p(t)  0  <  t  <  Tb  sending  0 

The  bit  duration  is  Tb  second  and  the  pulse  shape  is 

Pit)  =  1  -  — .  0<t<Tb 

Tb 

0  and  1  are  equally  likely.  The  channel  noise  is  AWGN  with  power  spectrum  A/72. 

(a)  Find  the  optimum  receiver  filter  h(t)  for  sampling  instant  t m  =  Tb  and  sketch  hit)  •" the 
time  domain. 

(b)  Determine  the  probability  of  error  as  a  function  of  the  Eb/M  ratio. 

(c)  Compare  the  results  in  parts  (a)  and  (b)  for  the  case  when  the  pulse  shape  is 

P(')  =  1  -  ~  0  <t<Tb 
Tb 


Figure 

P.11.1-3 


Figure 

P.11.1-4 


Figure 
P.  11 .1-3 


Figure 

P.11.1-4 
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11.1- 2  Prove  the  following  forms  of  the  Cauchy-Schwarz  inequality: 

(a)  Let  x  and  y  be  real-valued  random  variables.  Then 

|E(xy)|2<£{|x|2)E)|y|2} 

with  equality  if  and  only  if  y  =  A.  •  x. 

(b)  Let  jc  andy  be  real-valued  vectors  of  the  same  size.  Then 

\xTy\2  <  II* II2  •  llyll2 

with  equality  if  and  only  ify  =  kx. 

11.1- 3  The  so-called  integrate-and-dump  filter  is  shown  in  Fig.  PI  1.1-3.  The  feedback  amplifier  is  an 

ideal  integrator.  The  switch  s}  closes  momentarily  and  then  opens  at  the  instant  /  =  Th.  thus 
dumping  all  the  charge  on  C  and  causing  the  output  to  go  to  zero.  The  switch  ,v2  samples  the 
output  immediately  before  the  dumping  action. 

(a)  Sketch  the  output  p0(t )  when  a  square  pulse  p(t)  is  applied  to  the  input  ot  this  filter. 

(b)  Sketch  the  output  po(0  of  the  filter  matched  to  the  square  pulse  p(t). 

(c)  Show  that  the  performance  of  the  integrate-and-dump  filter  is  identical  to  that  of  the  matched 
filter;  that  is,  show  that  p  in  both  cases  is  identical. 


Dump  switch 


p(t ) 

J 

-K— 

Sample  switch 

>— *• — oTSo— ► — 

T„  /— ►  1 

^  pjD 

1 1  1-4  An  alternative  to  the  optimum  filter  is  a  suboptimum  filter,  where  we  assume  a  particular  filter 
‘  form  and  adjust  its  parameters  to  maximize  p.  Such  filters  are  inferior  to  the  opt.mum  filter  but 

m  Fofa^tangilaJpXpfr)  of  height  A  and  width  Th  at  the  input  (Fig.  PI  1.1-4)  determine 
„  if  ins,ead  of  the  matched  filter,  a  one-stage  RC  filter  with  H{w)  =1/(1  +j<oRc)  is  used^ 
Assume  a  white  Gaussian  noise  of  PSD  Af/2.  Show  that  the  optimum  performance  is  achieved 

when  1  /RC  =  1-26/7),. 

Hint:  Set  dp2/dx  =  0  (*  =  Tb/RC). 


(a) 
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11.2-1  In  coherent  detection  of  a  binary  PPM,  a  half-width  pulse  po(t )  of  is  transmitted  with  different 
delays  for  binary  digit  0,  and  1  over  0  <  t  <  T^.  Note  that 


/?0(0  =  u(t)  -  u(t  -  Tfy/2) 


The  binary  PPM  transmission  is  to  simply  transmit 


po(t)  if  0  is  sent 

P0(t  -  Th/2)  if  1  is  sent 


The  channel  noise  is  additive,  white,  and  Gaussian,  with  spectrum  level  J\f /2. 

(a)  Determine  the  optimum  receiver  architecture  for  this  binary  system.  Sketch  the  optimum 
receiver  filter  response  in  time  domain. 

(b)  If  P[ 0]  =  0.4  and  P[l]  =  0.6,  find  the  optimum  threshold  and  the  resulting  receiver  bit 
error  rate. 

(c)  The  receiver  was  misinformed  and  believes  that  P[0]  =  0.5  =  P[  1].  It  hence  designed 
a  receiver  based  on  this  information.  Find  the  true  probability  of  error  when,  in  fact,  the 
actual  prior  probabilities  are  P[0)  =  0.4  and  P[  1]  =  0.6.  Compare  this  result  with  the 
result  in  part  (b). 

11.2-2  In  coherent  detection  of  binary  chirp  modulations,  the  transmission  over  0  <  t  <  is 


A  cos  (atf2  -f  #o)  if  0  is  sent 
A  cos  (cr i r2  +  0\)  if  1  is  sent 


The  channel  noise  is  additive,  white,  and  Gaussian,  with  spectrum  J\f/2.  The  binary  digits  are 
equally  likely. 

(a)  Design  the  optimum  receiver. 

(b)  Find  the  probability  of  bit  error  for  the  optimum  receiver  in  part  (a). 

In  coherent  schemes,  a  small  pilot  is  added  for  synchronization.  Because  the  pilot  does  not 
carry  information,  it  causes  degradation  in  P Consider  coherent  PSK;  use  the  following  two 
pulses  of  duration  T b  each: 


where  Am  sin  (oct  is  the  pilot.  Show  that  when  the  channel  noise  is  white  Gaussian, 


Hint:  Use  Eq.  (1 1.25b). 


11.2-4  For  a  polar  binary  communication  system,  each  error  in  the  decision  has  some  cost.  Supf 
that  when  m  =  1  is  transmitted  and  we  read  it  as  m  =  0  at  the  receiver,  a  quantitative  pen 
or  cost,  Cjo  is  assigned  to  such  an  error,  and,  similarly,  a  cost  Coi  is  assigned  when  m  - 
transmitted  and  we  read  it  as  m  =  1.  For  the  polar  case,  where  Pm(0)  =  Pm(D  5=5  S 


=  1.  For  the  polar  case,  where  Pm(0)  =: 


system,  each  error  in  the  decision  has  some  cost.  SUPP^ 
i  we  read  it  as  m  =  0  at  the  receiver,  a  quantitative  Pena^  ^ 
error,  and,  similarly,  a  cost  Coi  is  assigned  when  m  ' 

1  Pnr  fhp  1  'I  r  r>non  ,  iiUnni  P  -  Pm  (11  —  II  •* 
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that  for  white  Gaussian  channel  noise  the  optimum  threshold  that  minimizes  the  overall  cost  is 
not  0  but  is  a0 ,  given  by 


Hint:  See  Hint  for  Prob.  8.2-8. 

11.2-5  For  a  polar  binary  system  with  unequal  message  probabilities,  show  that  the  optimum  decision 
threshold  aQ  is  given  by 


_  A  P m(0)Q)i 

a°  “  7  n  pm(i)C|0 


where  Cq\  and  Cio  are  the  cost  of  the  errors  as  explained  in  Prob.  1 1 .2-4.  and  P  m(0)  and  Pm  ( 1 ) 
are  the  probabilities  of  transmitting  0  and  1.  respectively. 

Hint:  See  Hint  for  Prob.  8.2-8. 

1 1.2-6  For  4-ary  communication,  messages  are  chosen  from  any  one  of  four  message  symbols,  m ,  = 
00,  m2  =  01,  m3  =  10,  and  m4  =  11.  which  are  transmitted  by  pulses  ±p(t),  0.  and  ±3p(r). 
respectively.  A  filter  matched  to  p(t)  is  used  at  the  receiver.  Denote  the  energy  of  p(t)  as  Ep. 
The  channel  noise  is  AWGN  with  spectrum  A/72. 

(a)  If  r  is  the  matched  filter  output  at  tm,  plot  pr(r|m/)  (00,  01. 1 0.  and  1 1 ).  for  the  four  message 
symbols  and  if  all  message  symbols  are  equally  likely. 

(b)  To  minimize  the  probability  of  detection  error  in  part  (a),  determine  the  optimum  decision 
thresholds  and  the  corresponding  error  probability  Pe  as  a  function  of  the  average  symbol 
energy  to  noise  ratio. 

11  2-7  Binary  data  is  transmitted  by  using  a  pulse  pU )  for  0  and  a  pulse  ypU)  tor  1  Let  y  > >  1  Find 
that  the  optimum  receiver  for  this  case  is  a  filter  matched  to  pU)  with  a  detection  threshold 
as  shown  in  Fig.  PI  1 .2-7.  Determine  the  error  probability  Ph  of  this  receiver  as  a  function  of 
Eb/N  if  0  and  1  are  equiprobable. 


Figure 

P.11.2-7 


Decision: 

0  if  r  <  threshold 
1  if  r  >  threshold 


a  raised-cosine  roll-off  pulse  p(t)  with  roll-off  factor  0.2  is  used  for 
ion.  The  ideal,  low-pass  channel  has  a  bandwidth  of/o  =  5000  Hz. 


11.2-8  In  a  binary  transmission,  a  r; 
baseband  polar  transmission. 


(a)  If  the  channel  noise  is  additive,  white,  and  Gaussian,  with  spectrum  AT/  2,  find  the  optimum 
receiver  filter  and  sketch  its  frequency  response. 

(b)  If  the  channel  noise  is  Gaussian  with  spectrum 


find  the  optimum  receiver  filter  and  sketch  its  frequency  response. 
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11.3-1  In  an  FSK  system.  RF  binary  signals  are  transmitted  as 


0  :  \/2sin  (nt/T^c os[nv  —  (Aco/2)]t  0  <t  <Tb 

1  :  v^sin  (nt/Tb)  cos  [coc  +  (Aa>/2)]r  0  <t<Tb 

The  channel  noise  is  additive,  white,  and  Gaussian.  Let  the  binary  inputs  be  equally  likely. 

(a)  Derive  the  optimum  coherent  receiver  and  the  optimum  threshold. 

(b)  Find  the  minimum  probability  of  bit  error. 

(c)  Is  it  possible  to  find  the  optimum  A co  to  minimize  the  probability  of  bit  error? 

11.4-1  Consider  four  signals  in  the  time  interval  (0,  T ): 


p0(t)  -  u(t)  -  u(t  -  T) 

p\ (t )  =  sin  (2 nt/T)[u(t)  -  «(/  -  7’)] 

P2(t)  =  sin  (nt/T)[u(t)  -  «(/  -  7)] 

p3(r)  =  cos(^r/7)[«(r)  -  «(/  -  T )] 

Apply  the  Gram-Schmidt  procedure  and  find  a  set  of  orthonormal  basis  signals  for  this  signal 
space.  What  is  the  dimension  of  this  signal  space? 

11.4-2  The  basis  signals  of  a  three-dimensional  signal  space  are  given  by  (p\(t)  =  p(U>  VlW  = 
P(t  ~  T0),  and  v>3  (r)  =  p(l  -  2 T0),  where 


PU)  =  J^sin  [«(0  -  «(*  -  To)\ 


(a)  Sketch  the  waveforms  of  the  signals  represented  by  (1,  1,  1 ),  ( — 2,  0,  1),  (1/3,  2,  —j)’ 
and  (—5,  —1,  2)  in  this  space. 

(b)  Find  the  energy  of  each  signal  in  part  (a). 


11.4-3  Repeat  Prob.  11.4-2  if 


<Pl(0  = 


1 

vz 


,  ,  1  2  n  I  2  2  jr 

tP2(0  =./  — cos  — /  <p3(t)  =  ./  —  cos  — r 


To  T0 


T„ 


0  <t  <T0 


11.4-4  For  the  three  basis  signals  given  in  Prob.  1 1 .4-3,  assume  that  a  signal  is  written  as 


x(t)=  1  -F  2  sin3  ^2^ 

(a)  Find  the  best  approximation  of  x(t)  using  the  three  basis  signals  in  terms  of  minimum  erR,r 
energy.  What  is  the  minimum  approximation  error  energy? 

(b)  Adding  another  basis  signal 


<p4(/)  = 


7T 


0<t<T() 


and  find  the  reduction  of  minimum  approximation  error  energy. 
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11.4- 5  Assume  that  p(t)  is  as  in  Prob.  11.4-2  and 

<Pk(t)  =  Pl*  “  (*  —  1)7„]  k  =  1,2,3. 4,5 

(a)  Sketch  the  signals  represented  by  (—1,  2,  3,  1,  4),  (2,  1,  —4,  —4,  2),  (3,  —2,  3,  4,  D.and 
(-2,  4,  2,  2,  0)  in  this  space. 

(b)  Find  the  energy  of  each  signal. 

(c)  Find  the  angle  between  all  pairs  of  the  signals. 

Hint :  Recall  that  the  inner  product  between  vectors  a  and  b  is  related  to  the  angle  0  between 
the  two  vectors  via  <  a,  b  >=  ||fl||  •  1 1^1 1  cos  (0). 

11.5- 1  Assume  that  p(t)  is  as  in  Prob.  1 1 .4-2  and 

sk(t)  =  p[t  -  (k  -  \)T()]  *  =  1,2,3, 4,5 

When  s*( t)  is  transmitted,  the  received  signal  under  noise  nM(f)  is 
y(0  =  tyO)  +  nw(0  0</<57„ 

Assume  also  that  the  noise  nw(t)  is  white  Gaussian  with  spectrum  M /2. 

(a)  Define  a  set  of  basis  functions  for  y(/)  such  that 

£{|y(0-£ym(')l2}=o 

(b)  Characterize  the  random  variable  y ,•  when  $*(f)  is  transmitted. 

(c)  Determine  the  joint  probability  density  function  of  random  variable  {y, . y5)  when 

sji  (t)  is  transmitted. 

11.5- 2  For  a  certain  stationary  Gaussian  random  process  x(f),  it  is  given  that  «x(r)  =  e"r  .  Determine 

the  joint  PDF  of  RVs  x(/).  x(r  +  0.5).  x(r  +  1 ).  and  x(r  +  2). 

11.5- 3  A  Gaussian  noise  is  characterized  by  its  mean  and  its  autocorrelation  function.  A  stationary 

Gaussian  noise  x(0  has  zero  mean  and  autocorrelation  function  R% (r). 

(a)  If  x(f)  is  the  input  to  a  linear  time-invariant  system  with  impulse  response  h(t).  determine 
the  mean  and  the  autocorrelation  function  of  the  linear  system  output  y(/). 

(b)  If  x(r)  is  the  input  to  a  linear  time-vaiying  system  whose  output  is 

/oo 

h(t,  z)x(z)dx 

-OO 

show  what  kind  of  output  process  this  generates  and  determine  the  mean  and  the  autocorrelation 
function  of  the  linear  system  output  y(/). 

11.5-4  Determine  the  output  PSD  of  the  linear  system  in  Prob.  11.5-3a. 

11.5-5  Determine  the  output  PSD  of  the  linear  system  in  Prob.  11.5-3b. 

11.6-1  Consider  the  preprocessing  of  Fig.  1 1.17.  The  channel  noise  n*(0  is  white  Gaussian. 

,a)  Find  the  signal  energy  of  r(f)  and  q (/>  over  the  finite  time  interval  |0,  TM }. 

,b)  Prove  that  although  r(r)  and  q(r)  are  not  equal,  both  contain  all  the  useful  signal  content. 


I" 
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(c)  Show  that  the  joint  probability  density  function  of  (qj .  q2  ..  .qw).  under  the  condition  that 
Sfr  ( t )  is  transmitted,  can  be  written  as 

p*{q)=wfiwexp(-"q~Sk]]2/Af) 

11.6-2  Consider  an  additive  white  noise  channel.  After  signal  projection,  the  received  N  x  1  signal 
vector  is  given  by 


q  =  Si  4-  n, 

when  message  m/  is  transmitted,  n  has  joint  probability  density  function 


Find  the  (MAP)  detector  that  can  minimize  the  probability  of  detection  error. 

Follow  the  derivations  of  optimum  detector  for  AWGN,  derive  the  optimum  receiver 
structure  for  this  non-Gaussian  white  noise  channel. 

Show  how  the  decision  regions  are  different  between  Gaussian  and  non-Gaussian  noises 
in  a  two-dimensional  (N  =  2)  signal  space. 

11.6- 3  A  binary  source  emits  data  at  a  rate  of  400,000  bit/s.  Multiamplitude  shift  keying  (PAM)  with 

M  =  2,  16,  and  32  is  considered.  In  each  case,  determine  the  signal  power  required  at  the 
receiver  input  and  the  minimum  transmission  bandwidth  required  if  Sn(&0  =  10  s  and  the  bit 
error  rate  P is  required  to  be  less  than  10-6. 

11.6- 4  Repeat  Prob.  11.6-3  for  M  -ary  PSK. 

11.6- 5  A  source  emits  M  equiprobable  messages,  which  are  assigned  signals  sj ,  $2*  •••  *  sM,ass^own 

in  Fig.  PI  1 .6-5.  Determine  the  optimum  receiver  and  the  corresponding  error  probability  PeM 
for  an  AWGN  channel  as  a  function  of  E^/M. 


(a) 

(b) 

(c) 


Figure 

P.11.6-5 

rk  r 

.  ^1-  ..  .  i 

— _ , 

T  i 

f  T  a — n 

A 

V 

SM/2 

— v— 

SM 

1 1 .6- 6  A  source  emits  eight  equiprobable  messages,  which  are  assigned  QAM  signals  s\ «  s>2»  •  *  *  *  ^ 

as  shown  in  Fig.  PI  1.6-6, 

(a)  Find  the  optimum  receiver  for  an  AWGN  channel. 

(b)  Determine  the  decision  regions  and  the  error  probability  Pe\i  of  the  optimum  receiver 
a  function  of  £/,. 

11.6- 7  Prove  that  tor  £/,/ J\f  1  and  M  2,  the  error  probability  approximation  ot  Eq  (  ^ 

for  MPSK  holds. 


Figure 

P.11.6-6 


Figure 

P.11.6-12 
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Figure 
P.  11 .6-6 


figure 

P.11.6-12 
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r 

•*2 
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•*6 
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•*8 

11.6- 8  Use  the  approximation  of  Eq.  ( 1 1 . 1 09b)  for  1 6  PSK  to  compare  the  symbol  error  probability  of 

16-QAM  and  16-PSK.  Show  approximately  how  many  decibels  of  E^/Af  (SNR)  loss  16-PSK 
incurs  versus  16-QAM  (by  ignoring  the  constant  difference  in  front  of  the  Q  function). 

11.6- 9  Compare  the  symbol  error  probabilities  of  16-PAM,  16-PSK,  and  16-QAM.  Sketch  them  as 

functions  of  E^/Af. 

11.6- 10  Show  that  for  MPSK,  the  optimum  receiver  of  the  form  in  Fig.  1 1.19a  is  equivalent  to  a  phase 

comparator.  Assume  all  messages  equiprobable  and  an  AWGN  channel. 

11.6- 11  A  ternary  signaling  has  three  signals  for  transmission: 

m0  :  0  m\  :  2 pit)  rnj  :  —2 pit) 

(a)  If  P(m0)  =  P(tn\ )  =  P(rri2)  =  1/3,  determine  the  optimum  decision  regions  and  Pe\i  of 
the  optimum  receiver  as  a  function  of  E.  Assume  an  AWGN  channel. 

(b)  Find  P€m  as  a  function  of  E/Af. 

(c)  Repeat  parts  (a)  and  (b)  if  P(m0)  =  1/2  and  P(m\ )  =  Pirn 2)  =  0.25. 

11  6-12  A  16-ary  signal  configuration  is  shown  in  Fig.  PI  1.6-12.  Write  the  expression  (do  not  evaluate 
various  integrals)  for  the  PeM  of  the  optimum  receiver,  assuming  all  symbols  to  be  equiprobable. 
Assume  an  AWGN  channel. 


11.6-13  A  five-signal  configuration  in  a  two-dimensional  space  is  shown  in  Fig.  PI  1.6-13. 

(a)  Choose  the  <p\  (t)  =  W^cos  a>ct  and  <n(t)  =  s/VZ,  sin  and  sketch  the  waveforms 
of  the  five  signals. 

(b)  In  the  signal  space,  sketch  the  optimum  decision  regions,  assuming  an  AWGN  channel. 


I '  I 


710  PERFORMANCE  ANALYSIS  OF  DIGITAL  COMMUNICATION  SYSTEMS 

(c)  Determine  the  error  probability  PeM  as  a  function  of  E  of  the  optimum  receiver. 


Figure 

P.11.6-13 


-+- 

d 

2 


11.6-14  A  16-point  QAM  signal  configuration  is  shown  in  Fig.  PI  1 .6-14.  Assuming  that  all  symbols  are 
equiprobable,  determine  the  error  probability  PeM  as  a  function  of  E ^  of  the  optimum  receiver 
for  an  AWGN  channel.  Compare  the  performance  of  this  scheme  with  the  result  of  rectangular 
16-point  QAM  in  Sec.  1 1-6.6. 


Figure 

P.11.6-14 


11.7-1  The  vertices  of  an  N -dimensional  hypercube  are  a  set  of  2N  signals 


N 


sk (0  = 


;= i 


-1. 


where  [<p\  (f),  (p2(t),  . . . ,  ^v(O)  is  a  set  of  N  orthonormal  signals,  and  ajy  is  either  1  or 
Note  that  all  the  N  signals  are  at  a  distance  of  sfNd/2  from  the  origin  and  form  the  vertices 
the  A -dimensional  cube. 

(a)  Sketch  the  signal  configuration  in  the  signal  space  for  N  =  1 , 2,  and  3. 

(b)  For  each  configuration  in  part  (a),  sketch  one  possible  set  of  waveforms. 

(c)  If  all  the  2n  symbols  are  equiprobable,  find  the  optimum  receiver  and  ^annel- 


probability  P€m  of  the  optimum  receiver  as  a  function  of  E assuming  an 


11.7-2  An  orthogonal  signal  set  is  given  by 

sk{t)  =  >/E(pk(t) 


k=  1,  2 . N 
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A  biorthogonal  signal  set  is  formed  from  the  orthogonal  set  by  augmenting  it  with  the  negative 
of  each  signal.  Thus,  we  add  to  the  orthogonal  set  another  set 

5_*(f)  =  -y/E  <pk(t) 

This  gives  2 N  signals  in  an  N -dimensional  space.  Assuming  all  signals  to  be  equiprobable 
and  an  AWGN  channel,  obtain  the  error  probability  of  the  optimum  receiver.  How  does  the 
bandwidth  of  the  biorthogonal  set  compare  with  that  of  the  orthogonal  set? 

11.8- 1  (a)  What  is  the  minimum  energy  equivalent  signal  set  of  a  binary  on-off  signal  set? 

(b)  What  is  the  minimum  energy  equivalent  signal  set  of  a  binary  FSK  signal  set? 

(c)  Use  geometrical  signal  space  concepts  to  explain  why  the  binary  on-off  and  the  binary 
orthogonal  sets  have  identical  error  probabilities  and  why  the  binary  polar  energy  requirements 
are  3  dB  lower  than  those  of  the  on-off  or  the  orthogonal  set. 

11.8- 2  A  source  emits  four  equiprobable  messages,  m  i ,  m3,  and  mq,  encoded  by  signals  s\  (f ),  52(0* 

53(0*  and  54(f),  respectively,  where 

5 1  (/ )  =  20>/2  sin  jr-/ 

52(0  =  0 

53(0  =  lOv^  cos  J^t 
54(f)  =  — 10>/2  cos  j2-t 

Each  of  these  signal  durations  is  0  <t  <TM  and  is  zero  outside  this  interval.  The  signals  are 
transmitted  over  AWGN  channels. 

(a)  Represent  these  signals  in  a  signal  space. 

(b)  Determine  the  decision  regions. 

(c)  Obtain  an  equivalent  minimum  energy  signal  set. 

(d)  Determine  the  optimum  receiver. 

11.8- 3  A  quaternary  signaling  scheme  uses  four  waveforms, 

5i  (f)  =  4^i(f) 

52(f)  =  2<p\  (0  +  2  <P2(0 
53(f)  =  -2<p\{t)  -2(p2(t) 

54(f)  =  -4  <?2(0 

where  q>\ (r)  and  <p2(0  are  orthonormal  basis  signals.  All  the  signals  are  equiprobable.  and  the 
channel  noise  is  white  Gaussian  with  PSD  Sn(tu)  =  10 

(a)  Represent  these  signals  in  the  signal  space,  and  determine  the  optimum  decision  regions. 

(b)  Compute  the  error  probability  of  the  optimum  receiver. 

(c)  Find  the  minimum  energy  equivalent  signal  set. 

(d)  Determine  the  amount  of  average  energy  reduction  of  the  minimum  energy  equivalent 
signal  set  is  transmitted. 

11.8- 4  An  M  =  4  orthogonal  signaling  signaling  uses  JE  •  -VE  ■  41  ■  <P3 (/),  and 

71  •  <p4(t)  in  its  transmission. 
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(a)  Find  the  minimum  energy  equivalent  signal  set. 

(b)  Sketch  the  minimum  energy  equivalent  signal  set  in  three  dimensional  space. 

(c)  Determine  the  amount  of  average  energy  reduction  by  using  the  minimum  energy  equivalent 
signal  set. 

11.8- 5  A  ternary  signaling  scheme  (Af  =  3)  uses  the  three  waveforms 

si  (t)  =  [u(t)  -  u(t  -  To/3)] 

52W  =  u{t)  -  u(t  -  7o) 

53(0  =  -[U(t  -  27/3)  -  u(t  -  7b)] 

The  transmission  rate  is  l/7b  =  200  x  10^  symbols  per  second.  All  three  messages  are 
equiprobable,  and  the  channel  noise  is  white  Gaussian  with  PSD  Sn(<*0  =  2  x  10 

(a)  Determine  the  decision  regions  of  the  optimum  receiver. 

(b)  Determine  the  minimum  energy  signal  set  and  sketch  the  waveforms. 

(c)  Compute  the  mean  energies  of  the  signal  set  and  its  minimum  energy  equivalent  set,  found 
in  part  (b). 

11.8- 6  Repeat  Prob.  11.8-5  if  P(m{)  =  0.5,  P(m2)  =  0.25,  and  P(m3)  =  0.25. 

11.8- 7  A  binary  signaling  scheme  uses  the  two  waveforms 

0.001  \  .  v  A  /f  -  0.001  ^ 

*l<,,  =  n(~o^r)  “d  *(,)— Ar5om/ 

(See  Chapter  3  for  the  definitions  of  these  signals.)  The  signaling  rate  is  1000  pulses  per 
second.  Both  signals  are  equally  likely,  and  the  channel  noise  is  white  Gaussian  with 
S =  2.  x  10“4. 

(a)  Determine  the  minimum  energy  equivalent  signal  set. 

(b)  Determine  the  error  probability  of  the  optimum  receiver. 

(c)  Use  a  suitable  orthogonal  signal  space  to  represent  these  signals  as  vectors. 

Hint:  Use  Gram-Schmidt  orthogonal ization  to  determine  the  appropriate  basis  signals  i0 
and  (p2(t). 

11.10- 1  In  a  binary  transmission  with  messages  mo  and  m\ ,  the  costs  are  defined  as 

Qx)  =  Cil  =  l  and  Q)i  =  Cio  =  4 

The  two  messages  are  equally  likely.  Determine  the  optimum  Bayes  receiver. 

11.10- 2  In  a  binary  transmission  with  messages  mo  and  mj ,  the  cost  are  defined  as 

0)0  =  0,  =0  and  Q)i  =  C10  =  C 

The  probability  of  mo  is  1/3  and  the  probability  of  mj  is  2/3. 

(a)  Determine  the  optimum  Bayes  receiver. 

(b)  Determine  the  minimum  probability  of  error  receiver. 
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(c)  Determine  the  maximum  likelihood  receiver. 

(d)  Compare  the  probability  of  error  between  the  two  receivers  in  parts  (b)  and  (c). 

11.11-1  Plot  and  compare  the  probabilities  of  error  for  the  noncoherent  detection  of  binary  ASK.  binary 
FSK,  and  binary  DPSK. 


O  SPREAD  SPECTRUM 
Z  COMMUNICATIONS 


In  traditional  digital  communication  systems,  the  design  of  baseband  pulse-shaping  and 
modulation  techniques  aims  to  minimize  the  amount  of  bandwidth  consumed  by  the  mod¬ 
ulated  signal  during  transmission.  This  principal  objective  is  clearly  motivated  by  the  desire 
to  achieve  good  spectral  efficiency  and  thus  to  conserve  bandwidth  resource.  Nevertheless, 
a  narrowband  digital  communication  system  exhibits  two  major  weaknesses.  First,  its  con¬ 
centrated  spectrum  makes  it  an  easy  target  for  detection  and  interception  by  unintended  users 
(e.g.,  battlefield  enemies  and  unauthorized  eavesdroppers).  Second,  its  narrow  band,  having 
very  little  redundancy,  is  more  susceptible  to  jamming,  since  even  a  partial  band  jamming  can 
ruin  the  signal  reception. 

Spread  spectrum  technologies  were  initially  developed  for  the  military  and  intelligence 
communities  to  overcome  the  two  aforementioned  shortcomings  against  interception  and  jam¬ 
ming.  The  basic  idea  was  to  expand  each  user  signal  to  occupy  a  much  broader  spectrum  than 
necessary.  For  fixed  transmission  power,  a  broader  spectrum  means  both  lower  signal  po^vr 
level  and  higher  spectral  redundancy.  The  low  signal  power  level  makes  the  communication 
signals  difficult  to  detect  and  intercept,  whereas  high  spectral  redundancy  makes  the  signa 
more  resistant  to  partial  band  jamming,  whether  intentional  or  unintentional. 

There  are  two  dominant  spread  spectrum  technologies:  frequency  hopping  spread  spectrum 
(FHSS)  and  direct  sequence  spread  spectrum  (DSSS).  In  this  chapter,  we  provide  detaie 
descriptions  on  both  systems. 


12.1  FREQUENCY  HOPPING  SPREAD  SPECTRUM 
(FHSS)  SYSTEMS 


The  concept  of  frequency  hopping  spread  spectrum  (FHSS)  is  in  fact  quite  simple  and  easy 
to  understand.  Each  user  can  still  use  its  conventional  modulation.  The  only  difference 1  ^ 

now  the  carrier  frequency  can  vary  over  regular  intervals.  When  each  user  can  vary  its c 
frequency  according  to  a  predetermined,  pseudorandom  pattern,  its  evasive  signal  ettec 
occupies  a  broader  spectrum  and  becomes  harder  to  intercept  and  jam.  tw0 

The  implementation  of  an  FHSS  system  is  shown  in  Fig.  12.1.  If  we  first  ignore  t 
frequency  converters,  this  system  is  no  different  from  a  simple  digital  communication  sy 
with  an  FSK  modulator  and  a  demodulator.  The  only  difference  in  this  FHSS  system 
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Figure  12.1 

Frequency 
hopping  spread 
spectrum  system. 


FHSS  FSK  transmitter  _ Receiver 


the  carrier  frequency  hopping  controlled  at  the  transmitter  by  the  pseudonoise  (PN)  generator. 
To  track  the  hopping  carrier  frequency,  the  receiver  must  utilize  the  same  PN  generator  in 
synchronization  with  the  transmitter  PN  generator. 

We  note  that  most  FHSS  signals  adopt  binary  or  Af-ary  FSK  modulations  instead  of  the 
more  efficient  PAM,  PSK,  or  QAM.  The  motivation  for  choosing  FSK  stems  from  its  ability 
to  utilize  the  less  complex  noncoherent  detection.  In  contrast,  coherent  detection  is  generally 
needed  for  PAM,  PSK.  and  QAM  modulations.  Due  to  the  PN  hopping  pattern,  coherent 
detection  would  require  the  receiver  to  maintain  phase  coherence  with  the  transmitter  at  every 
one  of  the  frequencies  used  in  the  hopping  pattern.  Such  requirement  would  be  difficult  to  satisfy 
during  frequency  hopping.  On  the  other  hand,  FSK  detection  can  be  noncoherent  without  the 
need  for  carrier  phase  coherence  and  can  be  easily  incorporated  into  FHSS  systems. 

The  frequency  upconverter,  as  discussed  in  Example  4.2  of  Chapter  4,  can  be  a  mixer  or 
a  multiplier  followed  by  a  bandpass  filter.  Denote  Ts  as  the  symbol  period.  Then  the  M- ary 
FSK  modulation  signal  can  be  written  as 

5fsk  (/)  =  A  cos  (oW  +  <t>m)  ntTs  <t  <(m+\)T,  ( 1 2. 1  a) 


in 


which  the  M- ary  FSK  angular  frequencies  are  specified  by 


<Dm 


I  3  M  ~  1  A 

=  ov  ±  -A CO,  O)c±-A(o . (Dc  ±  — 2~ 


(12.1b) 


The  frequency  synthesizer  output  is  constant  tor  a  period  olTc 
denote  the  frequency  synthesizer  output  as  <oh  in  a  given  chip. 


often  known  as  a  “chip.”  If  we 
then  the  FHSS  signal  is 


Sm(f)  =  A  COS  [((Dh  +  (Dm) I  +  (pm I 


(12.2) 


.  .  ,  T  Jhe  frequency  hopping  pattern  is  controlled  by  the  PN 

for  .he  particular  ch  P  ™  1  «  J  an  lde„,ical  PN  g eneruror  enables 

genera, or  and  typical I,  looks  hke band  <i.e..  ,he  band  ,he 

FSK  slgna,  on,,  has  bandwidrh  H,  ,hen  ,he  FHSS 
signal  will  occupy  a  bandwidth  L  times  larger 


This  factor  Lis  known  as  the ^sprea  g  ^  corresponding  symbol  rate  \sR<  -  \/Ts  and 
For  symbol  period  T  and  chip  p  ^  offreqUency  hopping  in  FHSS.  If  T,  >  T„ 
the  hopping  rate  is  Rc  -  1/  c-  '  .  <  -  jt  u  known  as  fast  FHSS.  and  there  are 

then  the  FH  is  known  as  s  ow  FT  -  ds  under  fast  hopping,  each  data  symbol 

multiple  hops  within  each  data  symbol. 
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Figure  12.2 

Typical  Bluetooth 
frequency 
hopping  pattern. 


t 

Time  slots  (chips) 


Figure  12.3 

Effects  of 

(a)  narrowband 
jamming, 

(b)  FHSS  under 
broadband 
jamming,  and 

(c)  partial  band 
jamming. 


is  spread  across  multiple  frequency  bands  because  of  the  fast  hopping,  and  must  be  detecte 
by  detection  over  these  frequency  bands. 

One  major  advantage  of  FHSS  lies  in  its  ability  to  combat  jamming.  Suppose  a  jamming 
source  has  a  finite  level  of  jamming  power  Pj.  Against  a  narrowband  signal  with  an 
width  Bs>  the  jamming  source  can  transmit  within  Bs  at  all  time,  creating  an  interference 
PSD  level  of  Pj /Bs.  Hence,  the  signal-to-interference  ratio  (SIR)  for  the  narrowband  ( 
transmission  is 


( _  Eb  _  (12.3a) 

V  /  /nb  ”  Pj/Bs  ~  Pj  ' 

On  the  other  hand,  against  FHSS  signal  with  total  bandwidth  of  Bc,  the  jamming  source  must 
divide  its  limited  power  and  will  generate  a  much  lower  level  of  interference  PSD  with  ave  r 
value  Pj  /Bc.  As  a  result,  at  any  given  time,  the  signal  bandwidth  is  still  Bs  and  the  SIR  1 


(12.3b) 


Therefore,  with  a  spreading  factor  of  L,  an  FH  signal  is  L  times  more  resistant  to  a  jamming 
signal  with  finite  power  than  a  narrowband  transmission.  Figure  12.3a  and  b  illustra 
ditterent  effects  of  the  finite  jamming  power  on  narrowband  and  FHSS  signals.  sjanal 
On  the  other  hand,  the  jammer  may  decide  to  concentrate  all  its  power  Pj  in  a  narrov  ^ 
bandwidth  against  FHSS.  This  will  achieve  partial  band  jamming.  If  the  frequency  <W  ^ 
is  slow,  such  that  Tc  =  Ts ,  then  on  average  one  out  of  every  L  user  symbols  will  err 
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the  strong  interference,  as  shown  in  Fig.  12.3c,  Consider  BFSK.  We  can  assume  a  very  strong 
interference  such  that  the  bits  transmitted  in  the  jammed  frequency  band  have  the  worst  BER 
of  0.5.  Then,  after  averaging  of  the  L  bands,  the  total  BER  of  this  partially  jammed  FHSS 
system  will  be 


Pb  = 


L-  1 
L 


(12.4) 


Thus,  the  partially  jammed  FHSS  signal  detection  has  rather  high  BER  under  slow  hopping.  By 
employing  a  strong  enough  forward  error  correction  (FEC)  codes,  to  be  discussed  in  Chapter 
15,  such  data  errors  can  be  corrected  by  the  receiver. 


Example  12.1  Consider  the  case  of  a  fast  hopping  system  in  which  Tc  <K  Ts.  There  are  L  frequency  bands  for 
this  FHSS  system.  Assume  that  a  jamming  source  jams  one  of  the  L  bands.  Let  the  number  of 
hops  per  Ts  be  less  than  L  and  no  frequency  is  repeated  in  each  T*.  Derive  the  BER  performance 
of  a  fast  hopping  BFSK  system  under  this  partial  jamming. 


With  fast  hopping,  each  user  symbol  hops  over 

Lh  —  Ts/Tc  Lli  <  L 


narrow  bands.  Hence  on  average,  a  user  symbol  will  encounter  partial  jamming  with  a 
probability  of  Lh/L.  When  a  BFSK  symbol  does  not  encounter  partial  jamming  during 
hopping  its  BER  remains  unchanged.  If  a  BFSK  symbol  does  encounter  partial  band  jam¬ 
ming  we  can  approximate  its  BER  performance  by  discarding  the  energy  in  the  jammed 
band.  In  other  words,  we  can  approximate  the  BFSK  symbol  performance  under  jamming 
by  letting  its  useful  signal  energy  be 


Thus,  on  average,  the  BFSK  performance  under  fast  hopping  consists  of  statistical  average 
of  the  two  types  of  BFSK  bits: 


Pb  =  j  exP 


-) 


In  particular,  when  L 


1  fast  hopping  FHSS  clearly  achieves  much  better  BER  as 


In  other  words,  by  using  fas,  hopping.  the  BER  performance  of  FHSS  under  partial 
jamming  approaches  the  BER  without  jammtng. 
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1 2  2  MULTIPLE  FHSS  USER  SYSTEMS 
AND  PERFORMANCE 

Clearly,  FHSS  systems  provide  better  security  against  potential  enemy  jammers  or  interceptors. 
Without  full  knowledge  of  the  hopping  pattern  that  has  been  established,  adversaries  cannot 
follow,  eavesdrop  on,  or  jam  an  FHSS  user  transmission.  On  the  other  hand,  if  an  FHSS  system 
has  only  one  transmitter,  then  its  use  of  the  much  larger  bandwidth  Bc  would  be  too  wasteful. 
To  improve  the  frequency  efficiency  of  FHSS  systems,  multiple  users  may  be  admitted  over 
the  same  frequency  band  Bc  with  little  performance  loss. 

As  shown  in  Fig.  12.4,  each  of  the  M  users  is  assigned  a  unique  PN  hopping  code  that 
controls  its  frequency  hopping  pattern  in  FHSS.  The  codes  can  be  chosen  so  that  the  users 
never  or  rarely  collide  in  the  spectrum  with  one  another.  With  multiple  users  accessing  the 
same  L  bands,  spectral  efficiency  can  be  made  equal  to  the  original  FSK  signal  without  any 
loss  of  FHSS  security  advantages.  Thus,  multiple  user  access  becomes  possible  by  assigning 
these  distinct  hopping  (spreading)  codes  to  different  users,  leading  to  code  division  multiple 

access  (CDMA).  . 

Generally,  any  overlapping  of  two  or  more  user  PN  sequences  would  lead  to  signal  collision 

in  frequency  bands  where  the  PN  sequence  values  happen  to  be  identical  during  certain  chips. 
Theoretically,  well-designed  hopping  codes  can  prevent  such  user  collisions.  However,  in 
practice,  the  lack  of  a  common  synchronization  clock  observable  by  all  users  means  that  eac 
user  exercises  frequency  hopping  independently.  Also,  sometimes  there  are  more  than  L  active 
users  gaining  access  to  the  FHSS  system.  Both  cases  lead  to  user  collision.  For  slow  and  fast 
FHSS  systems  alike,  such  collision,  would  lead  to  significant  increases  in  user  detection  errors. 

Performance  of  FHSS  with  Multiple  User  Access 

For  any  particular  FHSS  CDMA  user,  the  collision  problem  would  typically  be  limited  to  is 
partial  band.  In  fact,  the  effect  of  such  collisions  is  similar  to  the  situation  of  parti  an 
jamming,  as  analyzed  next.  . 

Recall  that  the  performance  analysis  of  FSK  systems  has  been  discussed  in  aP 
11  (Section  11.7)  under  AWGN  channels.  It  has  been  shown  that  the  probability  of  sym 
detection  error  for  noncoherent  M- ary  FSK  signals  is 


PeM  =  1  -  P cM  = 


(— l)m+l  e~mEb  log2M/AAm+l) 
m  +  1 


(12.5) 


For  slow  FHSS  systems,  each  data  symbol  is  transmitted  using  a  fixed  frequency 
Therefore,  the  detection  error  probability  of  slow  FHSS  system  is  identical  to  Eq. 


Figure  12.4 

CDMA  in  FHSS 
in  which  each  of 
the  M  users  is 
assigned  a 
unique  PN  code. 
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In  particular,  the  BER  of  the  binary  FSK  system  is  shown  to  be  (see  Eq.  ( 1 1 .145)  in  Sec.  1 1 . 1 1 1 


Pb  =  Le~E„/2V 


However,  it  two  users  transmit  simultaneously  in  the  same  frequency  band,  a  collision  or 
"hit  occurs.  In  this  case  we  will  assume  that  the  probability  of  error  is  O.5.*  Thus  the  overa 
probability  of  bit  error  can  be  modeled  as 


p„  =  ^-£*'w(  i-p,)+-2n 


(12.6) 


where  Ph  is  the  probability  of  a  hit,  which  we  must  determine.  Consider  random  hopping.  If 
there  are  L  frequency  slots,  there  is  a  \/L  probability  that  a  given  interferer  will  be  present  in 
the  desired  user’s  slot.  If  there  are  M  -  I  interferers  or  other  users,  the  probability  that  at  least 
one  is  present  in  the  desired  frequency  slot  is 


(12.7) 


assuming  L  is  large.  Substituting  this  into  Eq.  ( 1 2.6)  gives 


(12.8) 


If  M  =  1 ,  the  probability  of  error  reduces  to  the  BER  of  BFSK.  If  M  ^  1 ,  by  letting  Ei,/M  to 
approach  infinity,  we  see  that  under  random  hopping, 


which  illustrates  the  irreducible  floor  of  the  detected  bit  error  rate  due  to  multiple  access 
interference  (MAI).  It  is  therefore  important  to  design  hopping  patterns  to  reduce  Ph  with 
multiple  users. 

Asynchronous  FHSS 

The  previous  analysis  assumes  that  all  users  hop  their  carrier  frequencies  in  synchronization. 
This  is  known  as  slotted  frequency  hopping.  Such  kind  of  time  slotting  is  easy  to  maintain 
if  distances  between  all  transmitter-receiver  pairs  are  essentially  the  same.  This  may  not  be 
a  realistic  scenario  for  many  FHSS  systems.  Even  when  synchronization  can  be  achieved 
between  individual  user  clocks,  different  transmission  paths  will  not  arrive  synchronously  due 
to  the  various  propagation  delays.  A  simple  development  for  asynchronous  performance  can 
be  shown  following  the  approach  of  Geronoitis  and  Pursley,1  which  shows  that  the  probability 
of  a  hit  in  the  asynchronous  case  is 


(12.10) 


* 


This  is  actually  pessimistic,  since  studies  have  shown  that  this  value  can  be  lower. 
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where  Nh  is  the  number  of  bits  per  hop.  Comparing  Eqs.  ( 1 2. 1 0)  and  ( 1 2.7)  we  see  that,  for 
the  asynchronous  case,  the  probability  of  a  hit  is  increased,  as  expected.  By  using  Eq.  (12.10) 
in  Eq.  (12.6),  we  obtain  the  probability  of  error  for  the  asynchronous  case  as 


(12.11) 


As  in  the  case  of  partial  band  jamming,  the  BER  of  the  FHSS  users  decreases  as  the  spreading 
factor  increases.  Additionally,  by  incorporating  a  sufficiently  strong  FEC  at  the  transmitter 
code,  the  FHSS  CDMA  users  can  accommodate  most  of  the  collisions. 


Example  1  2.2  Consider  an  AWGN  channel  with  noise  level  Af  =  10-11.  A  user  signal  is  a  binary  FSK 
modulation  of  data  rate  16  kbit/s  that  occupies  a  bandwidth  of  20  kHz.  The  received  signal 
power  is  -20 dBm.  An  enemy  has  a  jamming  source  that  can  jam  either  a  narrowband  or  a 
broadband  signal.  The  jamming  power  is  finite  such  that  the  total  received  jamming  signal 
power  is  at  most  -26  dBm.  Use  a  spreading  factor  L  =  20  to  determine  the  approximate 
improvement  of  signal-to-noise  ratio  for  the  FHSS  system  under  jamming. 


Since  Ps  =  -20  dBm  =  10~5  W  and  Th  =  1/16.000,  the  energy  per  bit  equals 

1 


Eb  —  Ps'Tb  — 


1.6  x  109 


On  the  other  hand,  the  noise  level  is  M  =  1 0“ 1 1 .  Let  the  jamming  signal  have  Gaussian 
distribution.  The  jamming  power  level  equals  Pj  =  —26  dBm  =4x10  W. 

When  jamming  occurs  over  the  narrow  band  of  20kHz,  the  power  level  ot  t  e 
interference  is 

Jn  = - — - =  2  x  10_l° 

20,000  Hz 


Thus,  the  resulting  signal-to-noise  ratio  is 

Eb  (1.6  x  lO9)"1 


Jn  +  Af  2  X  10-'°+  10~10 


;  4.74  dB 


If  the  jamming  must  cover  the  entire  spread  spectrum  L  times  wider,  then  the  power  le 
of  the  interference  becomes  20  times  weaker: 


Jn  = 


Pj 


400,000  Hz 

Thus,  the  resulting  signal-to-noise  ratio  in  this  case  is 

Eb  _  (1.6  x  109)"1 
Jn+Af  ~  10-"  +  10-"  ~ 


=  1  x  10-11 


14.95  dB 


The  improvement  of  SNR  is  approximately  lOdB. 
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12.3  APPLICATIONS  OF  FHSS 

FHSS  has  been  adopted  in  several  practical  applications.  The  most  notable  ones  among  them 
are  the  wireless  local  area  network  (WLAN)  standard  for  Wi-Fi,  known  as  the  IEEE  802.1 12 
and  the  wireless  personal  area  network  (WPAN)  standard  of  Bluetooth. 

From  IEEE  802.11  to  Bluetooth 

IEEE  802.1 1  was  the  first  Wi-Fi  standard  initially  released  in  1997.  With  data  rate  limited  to 
2  Mbit/s,  802.1 1  only  had  very  limited  deployment  before  1999,  when  the  release  and  much 
broader  adoption  of  IEEE  802.11a  and  802.1  lb  removed  the  FHSS  option.  Now  virtually 
obsolete,  IEEE  802.1 1  was  miraculously  revived  in  the  highly  successful  commercial  product 
sold  as  Bluetooth ?  Bluetooth  differs  from  Wi-Fi  in  that  Wi-Fi  systems  are  required  to  provide 
higher  throughput  and  covers  greater  distances.4  Wi-Fi  can  also  be  more  costly  and  consumes 
more  power. 

Bluetooth,  on  the  other  hand,  is  an  ultra-short-range  communication  system  used  in  elec¬ 
tronic  products  such  as  cellphones,  computers,  automobiles,  modems,  headsets,  and  appliances. 
Replacing  line-of-sight  infrared,  Bluetooth  can  be  used  when  two  or  more  devices  are  in  prox¬ 
imity  to  each  other.  It  does  not  require  high  bandwidth.  Because  Bluetooth  is  basically  the 
same  as  the  IEEE  802. 1 1  frequency  hopping  (FH)  option,  we  only  need  to  describe  its  details. 

The  protocol  operates  in  the  license-free  industrial,  scientific,  and  medical  (ISM)  band  of 
2.4  to  2.4835  GHz.  To  avoid  interfering  with  other  devices  and  networks  in  the  ISM  band,  the 
Bluetooth  protocol  divides  the  band  into  79  channels  of  1  MHz  bandwidth  and  executes  (slow) 
frequency  hopping  at  a  rate  of  up  to  1600  Hz.  Two  Bluetooth  devices  synchronize  frequency 
hopping  by  communicating  in  a  master-slave  mode  relationship.  A  network  group  of  up  to 
eight  devices  form  a  piconet.  which  has  one  master.  A  slave  node  of  one  piconet  can  be  the 
master  of  another  piconet.  Relationships  between  master  and  slave  nodes  in  piconets  are  shown 
in  Fig.  12.5.  A  master  Bluetooth  device  can  communicate  with  up  to  seven  active  devices.  At 
any  time.  The  master  device  can  bring  into  active  status  up  to  255  further  inactive,  or  parked, 
devices.  One  special  feature  of  Bluetooth  is  its  ability  to  implement  adaptive  frequency  hopping 
(AFH).  This  adaptivity  is  built  in  to  allow  Bluetooth  devices  to  avoid  crowded  frequencies  in 
the  hopping  sequence. 

The  modulation  of  the  (basic  rate)  Bluetooth  signal  is  shown  in  Fig.  1 2.6.  The  binary  signal 
is  transmitted  by  means  of  Gaussian  pulse  shaping  on  the  FSK  modulation  signal.  As  shown 


Figure  12.5 

An  area  with  the 
coverage  of 
three  piconets: 
ro,  master 
nodes;  s,  slave 
nodes;  s/m, 
slave/master.  A 
^ode  can  be 
doth  a  master 
of  one  piconet 
(no.  1)  and  a 
^ave  of  another 
(no.  3). 


Piconet  no.2 
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Figure  12.6 

FHSS  modulation 
in  802.1 1  and 
Bluetooth. 


IMDLC  I  X.  I 

Major  Specifications  of  802.1 1  FHSS  and  Bluetooth. 


802.1 1  FHSS  Bluetooth  (basic  rate) 

Frequency  band 

ISM  (2.4-2.4835  GHz) 

Duplex  format 

TDD 

Single-channel  bandwidth 

1  MHz 

Number  of  nonoverlapping  channels 

79 

BTS  product 

0.5 

Minimum  hopping  distance 

6 

Modulation 

GFSK-2  and  GFSK-4  GFSK-2 

Data  rate 

1  Mbit/s  and  2  Mbit/s  723.1  kbit/s 

Hopping  rate 

2.5-160  Hz  1600  Hz 

in  Fig.  12.6,  a  simple  binary  FSK  replaces  the  Gaussian  low-pass  filter  with  a  direct  path.  The 
inclusion  of  the  Gaussian  low-pass  filter  generates  what  is  known  as  the  Gaussian  FSK  (or 
GFSK)  signal.  GFSK  is  a  continuous  phase  FSK.  It  achieves  better  bandwidth  etficiency  by 
enforcing  phase  continuity.  Better  spectral  efficiency  is  also  achieved  through  partial  response 
signaling  (PRS)  in  GFSK.  The  Gaussian  filter  response  stretches  each  bit  over  multiple  symbo 
periods. 

More  specifically,  the  Gaussian  LPF  impulse  response  is  ideally  given  by 


h(t)  = 


or  = 


\ftn2 
2  ttB 


where  B  is  the  3  dB  bandwidth  of  the  Gaussian  low-pass  filter.  Because  this  response  is 
noncausal,  the  practical  implementation  truncates  the  filter  response  to  4 Ts  seconds.  This^ay 
each  bit  of  information  is  extended  over  a  window  3  times  broader  than  the  bit  duration  $• 
Note  that  the  selection  of  B  is  determined  by  the  symbol  rate  1  /Ts.  In  802. 1 1  and 
B  =  0.5/7*  is  selected.  The  FM  modulation  index  must  be  between  0.28  and  0.35.  The  ^ 
symbol  rate  is  always  1  MHz;  binary  FSK  and  four-level  FSK  can  be  implemented  as  G 
and  GFSK-4,  achieving  data  throughput  of  1  and  2  Mbit/s,  respectively.  Table  1 2. 1  summarize 
the  key  parameters  and  differences  in  IEEE  802.1 1  and  the  Bluetooth  (basic  rate).  j 

We  note  that  our  discussions  on  Bluetooth  have  focused  on  the  (basic  rate)  versions 
and  1 .2.  More  recently,  version  2  of  Bluetooth  has  been  released.4  Version  2.0  implementatio  ^ 
feature  Bluetooth  Enhanced  Data  Rate  (EDR)  and  reach  2.1  Mbit/s.  Technically,  versl°n^s 
devices  retain  the  FHSS  feature  but  resort  to  the  more  efficient  (differential)  PSK  modu  ati 


SINCGARS 

SINCGARS  stands  for  single  channel  ground  and  airborne  radio  system.  It  represents a  ^ 
ily  of  VHF-FM  combat  radios  used  by  the  U.S.  military.  First  produced  by  ITT  m  ^ 
SINCGARS  transmits  voice  with  FM  and  data  with  binary  CPFSK  at  16  kbit/s,  occup>^-^. 
bandwidth  of  25  kHz.  There  can  be  as  many  as  2320  channels  within  the  operational 
30  to  87.975  MHz. 
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To  combat  jamming,  SINCGARS  radios  can  implement  frequency  hopping  at  the  rather 
slow  rate  of  100  Hz.  Because  the  hopping  rate  is  quite  slow,  SINCGARS  is  no  longer  effective 
against  modem  jamming  devices.  For  this  reason,  SINCGARS  is  being  replaced  by  the  newer 
and  more  versatile  JTRS  (joint  tactical  radio  system). 


From  Hollywood  to  CDMA 

Like  many  good  ideas,  the  concept  of frequency  hopping  also  had  multiple  claims  of  inventors. 
One  such  patent  that  gained  little  attention  was  awarded  to  Willem  Broertjes  of  Amsterdam, 
Netherlands,  in  August  1932  (U.S.  Patent  no.  1 ,869,659).”'  However,  the  most  intriguing  patent 
on  frequency  hopping  came  from  one  of  Hollywood’s  well-known  actresses  during  World 
War  II,  Hedy  Lamarr.  In  1 942  she  and  her  coinventor  George  Antheil  (an  eccentric  composer) 
were  awarded  U.S.  patent  no.  2,292,387  for  their  “Secret  Communications  System.”  The  patent 
was  designed  to  make  radio-guided  torpedoes  harder  to  detect  or  to  jam.  Largely  because  of  the 
Hollywood  connection,  Hedy  Lamarr  became  a  legendary  figure  in  the  wireless  communication 
community,  often  credited  as  the  inventor  of  CDMA,  whereas  other  less  glamorous  figures 
such  as  Willem  Broertjes  have  been  largely  forgotten. 

Hedy  Lamarr  was  a  major  movie  star  of  her  time.6  Born  Hedwig  Eva  Maria  Kiesler  in 
Vienna,  Austria,  she  first  gained  fame  in  the  1933  Austrian  film  Ecstasy  for  some  shots  that 
were  highly  unconventional  in  those  days.  In  1937,  escaping  the  Nazis  and  her  first  husband  (a 
Nazi  arms  dealer),  she  went  to  London,  where  she  met  Louis  Burt  Mayer,  cofounder  and  boss 
of  the  MGM  studio.  Mayer  helped  the  Austrian  actress’s  Hollywood  career  by  giving  her  a 
movie  contract  and  a  new  name — Hedy  Lamarr.  Lamarr  starred  with  famous  colleagues  such 
as  Clark  Gable,  Spencer  Tracy,  and  Judy  Garland,  appearing  in  more  than  a  dozen  films  during 
her  film  career. 

Clearly  gifted  scientifically,  Hedy  Lamarr  worked  with  George  Antheil,  a  classical  com¬ 
poser,  to  help  the  war  effort.  They  originated  an  idea  of  a  sophisticated  antijamming  device  for 
use  in  radio-controlled  torpedoes.  In  August  1942,  under  her  married  name  at  the  time,  Hedy 
Kiesler  Markey,  Hedy  Lamarr  was  awarded  U.S.  Patent  no.  2,292,387  (Fig.  1 2.7),  together  with 
George  Antheil.  They  donated  the  patent  as  their  contribution  to  the  war  effort.  Drawing  inspi¬ 
ration  from  the  composer’s  piano,  their  invention  of  frequency  hopping  uses  88  frequencies, 
one  for  each  note  on  a  piano  keyboard. 

However,  the  invention  would  not  be  implemented  during  World  War  D.  It  was  simply 
too  difficult  to  pack  vacuum  tube  electronics  into  a  torpedo.  The  idea  of  frequency  hopping, 
nevertheless,  became  reality  20  years  later  during  the  1962  Cuban  missile  crisis,  when  the 
system  was  installed  on  ships  sent  to  block  communications  to  and  from  Cuba.  Ironically, 
by  then  the  Lamarr-Antheil  patent  had  expired.  The  idea  of  frequency  hopping,  or  more 
broadly  the  idea  of  spread  spectrum,  has  since  been  extensively  used  in  military  and  civilian 
communications,  including  cellular  phones,  wireless  LAN,  Bluetooth,  and  numerous  other 


wireless  communications  systems. 

Only  in  recent  years  has  Hedy  Lamarr  started  receiving  a  new  kind  of  recognition  as  a 
celebrity  inventor  In  1997  Hedy  Lamarr  and  George  Antheil  received  the  Electronic  Frontier 
Foundation  (EFF)  Pioneer  Award.  Furthermore,  in  August  1 997,  Lamarr  was  honored  with  the 
prized  BULBIE  Gnass  Spirit  of  Achievement  Bronze  Award  (the  “Oscar"  of  inventing).  If  she 
had  won  an  Academy  Award  for  her  film  works  in  film,  she  would  have  been  the  only  person 
to  receive  two  entirely  different  “Oscar”  awards!  Still,  inventors  around  the  world  are  truly 
delighted  to  welcome  a  famous  movie  celebrity  into  their  ranks^ 

Inventor  Hedy  Kiesler  Markey  died  in  2000  at  the  age  of  86. 
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Figure  12.7 

Figure  1  from  the 
Lamarr-Antheil 
patent  (From 
U.S.  Patent  and 
Trademark 
Office.) 
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1 2.4  DIRECT  SEQUENCE  SPREAD  SPECTRUM 

FHSS  systems  exhibit  some  important  advantages,  including  low-complexity  transceivers  and 
resistance  to  jamming.  However,  the  difficulty  of  carrier  synchronization  under  frequency 
hopping  means  that  only  noncoherent  demodulations  for  FSK  and  DPSK  are  actually  practica 
As  shown  in  the  analysis  from  Sec.  11.11,  FSK  and  DPSK  tend  to  have  poorer  BER  performance 
(power  efficiency)  and  poorer  bandwidth  efficiency  compared  with  QAM  systems,  w  11 
require  coherent  detection.  Furthermore,  its  susceptibility  to  collision  makes  FHSS  a  less 
effective  technology  for  CDMA.  As  modern  communication  systems  have  demonstrated,  direct 
sequence  spread  spectrum  (DSSS)  systems  are  much  more  efficient  in  bandwidth  and  p°^er 
utilization.7  Today,  DSSS  has  become  the  dominant  CDMA  technology  in  advanced  wire  ess 
communication  systems.  It  is  not  an  exaggeration  to  state  that  DSSS  and  CDMA  are  a  m 
synonymous. 


Optimum  Detection  of  DSSS  PSK  Hh 

Direct  sequence  spread  spectrum  is  a  technology  that  is  more  suitable  for  integration  ^ 
bandwidth-efficient  linear  modulations  such  as  QAM/PS K.  Although  there  are  several  dit  er^ 
ways  to  view  DSSS,  its  key  operation  of  spectrum  spreading  is  achieved  by  a  PN  sequeIJ  ^ 
also  known  as  the  PN  code  or  PN  chip.  The  PN  sequence  is  mostly  binary,  consisting  o  ^ 
and  Os,  which  are  represented  by  polar  signaling  of  +1  and  -1.  To  minimize  interfere^ 
and  to  facilitate  chip  synchronization,  the  PN  sequence  has  some  nice  autocorrelation 
cross-correlation  properties.  . 

Direct  sequence  spread  spectrum  (DSSS)  expands  the  traditional  narrowband  signa 
utilizing  a  spreading  signal  c(t).  As  shown  in  Fig.  12.8,  the  original  data  signal  is  l'ne 


Figure  12.8 

DSSS  system. 


Figure  12.9 

(a)  PN  sequence 
autocorrelation 
function. 

(b)  Six-stage 
generator  of  a 
maximum  length 
PN  sequence. 
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Output 


modulated  into  a  QAM  signal  *qam(0-  Instead  of  transmitting  this  signal  directly  over  its 
required  bandwidth.  DSSS  modifies  the  QAM  signal  by  multiplying  the  spreading  chip  sig¬ 
nal  c(r)  with  the  QAM  narrowband  signal.  Although  the  signal  carrier  frequency  remains 
unchanged  at  coc,  the  new  signal  after  spreading  becomes 


•sds  (0  =  •sqam  0  )c(t )  (12.12) 

Hence,  the  transmitted  signal  susto  »s  a  Product  of  two  siSnals  whose  sPread  band" 
width  is  equal  to  the  bandwidth  sum  of  of  the  QAM  signal  sQmU)  and  the  spreading 

signal  c(t). 


PN  Sequence  Generation  .  ..  .  , 

A  good  PN  sequence  c(t)  is  characterized  by  an  autocorrelation  that  is  similar  to  that  of 

a  white  noise  This  means  that  the  autocorrelation  function  of  a  PN  sequence  should  be 
high  near  r  =  0  and  low  for  all  r  *0.  as  shown  in  Fig.  12.9a.  Moreover,  in  CDMA  appli¬ 
cations  several  users  share  the  same  band  using  different  PN  sequences.  Hence,  it  is  necessary 
that  the  cross-correlation  among  different  pairs  of  PN  sequences  be  small  to  reduce  mutual 

i  ntprfprpncc 

A  PN  code  is  periodic.  A  digital  shift  register  circuit  with  output  feedback  can  gen¬ 
erate  a  sequence  with  long  period  and  low  susceptibility  to  structural  identification  by  an 
outsider  The  most  widely  known  binary  PN  sequences  are  the  maximum  length  shift  reg¬ 
ister  sequences  (^-sequences).  Such  a  sequence,  which  can  be  generated  by  an  m-stage 
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shift  register  with  suitable  feedback  connection,  has  a  length  L  =  2m  -  1  bits,  the  maxi¬ 
mum  period  for  such  a  finite  state  machine.  Figure  12.9b  shows  a  shift  register  encoder  for 
m  =  6  and  L  =  63.  For  such  “short”  PN  sequences,  the  autocorrelation  function  is  nearly  an 
impulse  and  is  periodic 


r  =  0,  ±LTC, 
r  #  0,  ±LTC, 


(12.13) 


As  a  matter  of  terminology,  a  DSSS  spreading  code  is  a  short  code  if  the  PN  sequence 
period  equals  the  data  symbol  period  Ts.  A  DSSS  spreading  code  is  a  long  code  if  the  PN 
sequence  period  is  a  (typically  large)  multiple  of  the  data  symbol  period. 

Single-User  DSSS  Analysis 

The  simplest  analysis  of  DSSS  system  can  be  based  on  Fig.  1 2.8.  To  achieve  spread  spectrum, 
the  chip  signal  c(r)  typically  varies  much  faster  than  the  QAM  symbols.  As  shown  in  Fig.  12.8, 
there  are  multiple  chips  of  ±1  within  each  symbol  duration  of  Ts.  Denote  the  spreading 
factor 


L  =  Ts/Tc  Tc  =  chip  period 


Then  the  spread  signal  spectrum  is  essentially  L  times  broader  than  the  original  modulation 
spectrum 


Bc  —  (L  +  1  )BS  %  L  •  Bs 


Note  that  the  spreading  signal  c(t )  =  ±  1  at  any  given  instant.  Given  the  polar  nature  of  the 
binary  chip  signal,  the  receiver,  under  an  AWGN  channel,  can  easily  “despread"  the  received 
signal 


easily  “despread”  the  received 


y(t)  =  •vdsCO  +  n(r)  =  .sqam(0c(0  +  n(0 


(12.14) 


by  multiplying  the  chip  signal  with  the  received  signal 


r(r)  =  c(t)y(t) 

=  *qam  (t)c2(t)  +  n(t)c(t) 
=  •vqamO)  +  n(r)c(r) 


(12.15) 

(12.16) 


x(0 


Thus,  this  multiplication  allows  the  receiver  to  successfully  “despread”  the  spread  spectni"1 
signal.  The  analysis  of  the  DSSS  receiver  depends  on  the  characteristics  of  the  noise  * ^ 
Because  c(r)  is  deterministic,  and  n(r)  is  Gaussian  with  zero  mean,  x(t)  remains  GaUS^p 
with  zero  mean.  As  a  result,  the  receiver  performance  analysis  requires  finding  only  the 


of  x(f). 


of  the  noise  *(/) 
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To  determine  the  power  spectral  density  of  the  “despread"  noise  x(f)  =  n(/)c(r),  we  can 
start  from  the  definition  of  PSD  (Sec.  9.3): 


Sx(/)  =  lim 

T—>oo 

1 

rs 

x 

=  lim 

T->o o 

'  1  fT/2  fT/2 

7  J-T/2J-T/2 

=  lim 

T-*oc  ‘ 

J  (.r/2  fT/2 

-  /  x(ri)x(r2)e  j2;r/('2  ,')dt\dt2 

'  J-T/2  J-T/2 

\  rT/2  fT/2 

=  lim  —  /  /  c(t\)c(t2)nU\Mt2)e~j2nJ(,2~'l) 

t^oo  /  J-t/2  J-t/2 

(12.17a) 


■  [■ T/2  fT/2 

=  lim  -  /  /  c{t\)c(t2)Ra(t2-t\)e->27'fu'-")dt\dl2  (12.17b) 

T'-’-o o  I  J-t/2  J-T/2 


Recall  that 


/OO 

Sn(v)ei23TV{,2~,,>  dv 

-OO 


We  therefore  have 


-fi”V-vWi-h)dt[dt2dv 


I  poo  pU^ 

Sx(/)=  lim  I  /  /  /  c(tl)c(t2)Sn(v)e-j2jT«-vn,2-,')dt]dt2dv  (12.18a) 

T—*oo  T  J- oo  y_ r/2  ^-r/2 

foo  1  fT/2  [T/2 

=  LSM^fLj-rncMcM‘~ 
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-00 


•oo  |Ct-(/  -  v)|2 

t(v)  lim  - - - 

r->oo  / 


/oo 

5n(v)5t.(/-v)r/u 

-oo 


(12.18b) 


The  last  equality  comes  from  the  definition  of  PSD  for  c(/).  Equation  ( 1 2. 1 8)  illustrates  the 
dependency  of  the  detector  noise  PSD  on  the  chip  signal  c(l).  As  long  as  the  PN  sequence  is 
almost  orthogonal  such  that  it  satisfies  Eq.  (12.13),  then 


Rc(r)  *  LTC  ~  *  ‘ LTc) 


(12.19a) 


SAf)  *  LT,  ■  ±  £ Hf  ~  W)  =  E«- 

LI , 


(12.19b) 
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and 


SX(/)  =  £S  n(f-k/LTc)  (12.20) 

k 

In  other  words,  as  long  as  the  chip  sequence  is  approximately  orthogonal,  the  noise  at  the  QAM 
detector  remains  a  white  Gaussian  with  zero  mean.  For  practical  reasons,  the  white  noise  n(t) 
is  filtered  at  the  receiver  to  be  band-limited  to  1  /2TC.  As  a  result,  the  noise  spectrum  after  the 
despreader  still  is 


SAf)  =  y  (lI21> 

In  other  words,  the  spectral  level  also  remains  unchanged.  Thus,  the  performance  analysis 
carried  out  for  coherent  QAM  and  PSK  detections  in  Chapter  1 1  can  be  applied  directly. 

In  Sec.  11.6  we  showed  that  for  a  channel  with  (white)  noise  of  PSD  A/72,  the  error 
probability  of  optimum  receiver  for  polar  signaling  is  given  by 

p*  =  e(/f) 

where  £/,  is  the  energy  per  bit  (energy  of  one  pulse).  This  result  demonstrates  that  the  error 
probability  of  an  optimum  receiver  is  unchanged  regardless  of  whether  or  not  we  use  DSSS. 
While  this  result  appears  to  be  somewhat  surprising,  in  fact,  it  is  quite  consistent  with  the  AWGN 
analysis.  For  single  user ,  the  only  change  in  DSSS  lies  in  the  spreading  of  transmissions  over  a 
broader  spectrum  by  effectively  using  a  new  pulse  shape  c(f).  Hence,  the  modulation  remains 
QAM  whereas  the  channel  remains  AWGN.  Consequently,  the  coherent  detection  analysis  ol 
Sec.  1 1.6  is  fully  applicable  to  DSSS  signals. 


1 2.5  RESILIENT  FEATURES  OF  DSSS 

As  in  FHSS,  DSSS  systems  provide  better  security  against  potential  jamming  or  interception 
by  spreading  the  overall  signal  energy  over  a  bandwidth  L  times  broader.  First,  its  low  power 
level  is  difficult  for  interceptors  to  detect.  Furthermore,  without  the  precise  knowledge  ot  t  « 
user  spreading  code  [or  c(r)],  adversaries  cannot  despread  and  recover  the  baseband  QA^ 
signal  effectively.  In  addition,  partial  band  jamming  signals  interfere  with  only  a  portion  o 
the  signal  energy.  They  do  not  block  out  the  entire  signal  spectrum  and  are  hence  not  effective 
against  DSSS  signals. 

To  analyze  the  effect  of  partial  band  jamming,  consider  an  interference  i(t)  that  impmges 
on  the  receiver  to  yield 


y(t)  =  sqamUMO  +  i(t) 

Let  the  interference  bandwidth  be  Bj.  After  despreading,  the  output  signal  plus  interference 
becomes 


y(t)c(t)  =  sqam(0  +  i(t)c{t) 


(12.23) 
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figure  12.10 

Narrowband 
interference 
"litigation  by 
the  DSSS 
^spreader. 


It  is  important  to  observe  that  the  interference  term  has  a  new  frequency  response  because  of 
despreading  by  c(t) 


tflW  =  i(t)c(t)  <=>  /(/)  *  C(f) 


(12.24) 


which  has  approximate  bandwidth  Bc  +  B ,  =  LBS  -f  Bj. 

DSSS  Analysis  against  Narrowband  Jammers 

If  the  interference  has  the  same  bandwidth  as  the  QAM  signal  Bs ,  then  the  “despread”  inter¬ 
ference  ia(t)  will  now  have  bandwidth  equal  to  (L  +  1)#5.  In  other  words,  the  narrowband 
interference  i(t)  will  in  fact  be  spread  L  times  larger  by  the  “despreading”  signal  c(t). 

If  the  narrowband  interference  has  total  power  P,  and  bandwidth  Bs ,  then  the  original 
interference  spectral  level  before  despreading  is 


Si(f)  =  ^  /  €  (fe  -  0.5#,,  fc  +  0.5#, ) 

#v 


After  despreading,  the  spectrum  of  the  interference  ia(t)  becomes 


Because  of  the  despreading  operation,  the  narrowband  interference  is  only  l/(L  +  I)  the 
original  spectral  strength.  Note  that  the  desired  QAM  signal  still  has  its  original  bandwidth 
(Wc  -  ji  bs,  coc  + 1 r#,).  Hence,  against  narrowband  interferences,  despreading  can  reduce  the 
signal-to-interference  ratio  (SIR)  by  a  factor  of 


(12.25) 


Pi/(L+  1)B, 


This  result  illustrates  that  DSSS  is  very  effective  against  narrowband  (partial  band)  jamming 
signals^  It  effectively  improves  the  SIR  by  the  spreading  factor.  The  “spreading”  effect  of  the 
despreader  on  a  narrowband  interference  signal  is  illustrated  in  Fig.  12.10. 

The  ability  of  DSSS  to  combat  narrowband  jamming  also  means  that  a  narrowband  com¬ 
munication  signal  can  coexist  with  DSSS  signals.  The  SIR  analysis  and  Fig.  12.10  already 
established  the  resistance  of  DSSS  signals  to  narrowband  interferers.  Conversely,  if  a  nar¬ 
rowband  signal  must  be  demodulated  in  the  presence  of  a  DSSS  signal,  then  the  narrowband 


figure  12.10 
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Figure  12.11 

Equivalent 
baseband 
diagram  of 
DSSS  system. 


signal  can  also  be  extracted  with  little  interference  from  the  DSSS  signal  by  replacing  the 
despreader  with  a  narrow  bandpass  filter.  In  this  case,  the  roles  of  signal  and  interference  are 
in  fact  reversed. 


DSSS  Analysis  against  Broadband  Jammers 

In  many  cases,  interferences  come  from  broadband  sources  that  are  not  generated  from  the 
DSSS  spreading  approach.  Against  such  interferences,  the  despreading  operation  only  mildly 
broadens  and  weakens  the  interference  spectrum. 

Let  the  interference  be  broadband  with  the  same  bandwidth  LBS  as  the  spread  signal. 
Based  on  Eq.  (12.24),  the  interference  after  despreading  would  be  which  has  bandwidth 
of  2 LBS.  In  other  words,  broadband  interference  /(/)  will  in  fact  be  expanded  to  a  spectrum 
nearly  twice  as  wide  and  half  as  strong  in  intensity.  From  this  discussion,  we  can  see  that  a 
DSSS  signal  is  most  effective  against  narrowband  interferences  and  not  as  effective  against 
broadband  interferences. 


1 2.6  CODE  DIVISION  MULTIPLE-ACCESS  (CDMA) 

OF  DSSS 

The  RF  diagram  of  a  DSSS  system  can  be  equivalently  represented  by  the  baseband  diagram  ot 
Fig.  12. 1 1 ,  which  provides  a  new  perspective  on  the  DSSS  system  that  is  amenable  to  analysis. 
Let  the  (complex-valued)  QAM  data  symbol  be 

Sk  =  ak  +jbk  (k  -  1)7,  <  /  <  kTs  (l2-26) 


Then  it  is  clear  from  the  PN  chip  sequence  that  the  baseband  signal  after  spreading  is 


st  ■  c(t)  =  ( ak  +jbk)  ■  c(t)  (k  —  \)TS  <  t  <  kTs 


(12.27) 


In  other  words,  the  symbol  sk  is  using 


c(t)  (k-\)Ts<t  <  kTs 

as  its  pulse  shape  for  transmission.  Consequently,  at  the  receiver,  the  optimum  receiver  w°ul<j 
require  c(t )  to  be  used  as  a  correlator  receiver  (or,  equivalently,  a  matched  filter).  As  evi 
from  the  diagram  of  Fig.  12. 1 1,  the  despreader  serves  precisely  the  function  of  the.optl^er 
matched  filter  (or  correlator  receiver).  Such  a  receiver  is  known  as  a  conventional  single 
optimum  receiver. 


12.6  Code  Division  Multiple-Access  (CDMA)  of  DSSS 
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Figure  12.12 

A  code  division 
multiple-access 
(CDMA)  system 
based  on  DSSS. 


We  have  shown  that  DSSS  systems  enjoy  advantages  against  the  threat  of  narrowband 
jamming  and  attempts  at  interception.  However,  if  a  DSSS  system  has  only  one  signal  to 
transmit,  then  its  use  of  the  larger  bandwidth  Bc  would  be  too  wasteful.  Just  as  in  FHSS,  CDMA 
of  DSSS  can  be  achieved  by  letting  multiple  users,  each  given  a  distinct  PN  spreading  signal 
access  the  broad  bandwidth  of  LBS  simultaneously.  Such  a  multiple-access  system  with 
M  users  based  on  CDMA  is  shown  in  Fig.  12.12.  Each  user  can  apply  a  single-user  optimum 
receiver. 

Because  these  CDMA  users  will  be  transmitting  without  time  division  or  frequency  divi¬ 
sion,  multiple-access  interference  (MAI)  exists  at  each  of  the  receivers.  To  analyze  a  DSSS 
system  with  M  multiple-access  users,  we  compute  the  interference  at  the  output  of  a  given 
receiver  caused  by  the  remaining  M  -  1  users.  It  is  simpler  to  focus  on  the  time  interval 
[(£  -  1  )TS,  kTs ]  and  the  kth  symbol  of  all  M  users.  In  Fig.  12.12,  we  have  made  the  multiple 
assumptions  for  analytical  simplicity.  Here  we  state  them  explicitly: 

•  The  ith  user  transmits  one  symbol  over  the  interval  \(k  -  1  )TV,  kTs  |. 

•  There  is  no  relative  delay  among  A/  users,  and  each  receiver  receives  the  Ath  symbol  of  all 
Af  users  within  [(A  —  1)7^,  kTs]. 

(0  2 

•  All  user  symbols  have  unit  power;  that  is,  £{|^  I")  =  !• 

•  The  /th  user’s  transmission  power  is  P,. 

•  The  ith  user  channel  has  a  scalar  gain  of  g,. 

•  The  channel  is  AWGN  with  noise  n(/). 

The  first  two  assumptions  indicate  that  all  M  users  are  synchronous.  While  asynchronous 
CDMA  systems  are  commonplace  in  practice,  their  analysis  is  a  straightforward  but  nontrivial 
generalization  of  the  synchronous  case. 

Because  all  users  share  the  same  bandwidth,  every  receiver  will  have  equal  access  to  the 
same  channel  output  signal 


M 


y(0  =  E^V^s*)c-'(r)  +  n(0 


(12.28a) 


*  In  asynchronous  CDMA  analysis,  the  analysis  window  mus.be  enlarged  .0  translate  it  into  a  nearly  equivalent 
synchronous  CDMA  case  with  many  more  equivalent  users. 
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After  application  of  the  matched  filter  (despreading),  the  ith  receiver  output  at  the  sampling 
instant  t  =  kTs  is 


JO 


rkTs 

=  /  Ci(t)y(t)dt 

J  (k—\)Ts 

M 


M  pkTs  fkTs 

=  Y,gj>/Pjsk  Ci(t)Cj(t)dt+  a 

“  J(k-l)Ts 


(t)n(t)dt 


M 


=  J2gj^jRij(k)Sk  +n*k) 

j=  1 


(12.28b) 


For  notational  convenience,  we  have  defined  the  (time-varying)  cross-correlation  coefficient 
between  two  spreading  codes  as 


rkTs 

Rij(k)  =  I  Ci(t)Cj(t)dt 

J(k-\)T, 

and  the  ith  receiver  noise  sample  as 

rkTs 

m(k)  =  I  Cj(t)n(t)dt  (12.28d) 

J(k-l)T, 

It  is  important  to  note  that  the  noise  samples  of  Eq.  ( 1 2.28d)  are  Gaussian  with  mean 


(12.28c) 


_  rkTs  _ 

n  i(k)  =  /  Cj(t)n(t)dt  =  0 

J(k-l)Ts 


The  cross-correlation  between  two  noise  samples  can  be  found  as 


rkTs 

rlTs 

n,(*)n/(£)  =  / 

/  Cj(t j  )cj(t2)n(t\ )n(f2 )  dt \  dt2 

J(k-\)Ts  • 

ht-\)Ts 

rkTs 

rtTs 

=  / 

/  Cj(ti)cj(t2)Rn(t2  -  t\)dt\  dt2 

J(k-\)TS  . 

hi-\)Ts 

rkTs 

feTs  A f 

= 

/  Cj(t\)cj(t2)—8(t2  -  t\)dt\  dt2 

J(k-])Ts . 

J(e-i)Ts  2 

J^rr, 

fkT, 

=  - 

a r 

i\  /  Ci{tX)Cj{tX)dtx 

J(k—l)Ts 

=  - Rij(k)8[k  -  i] 

(12.29a) 

(12.29b) 


Equation  (12.29)  shows  that  the  noise  samples  at  the  DSSS  CDMA  receiver  are  teniporal  > 
white.  This  means  that  the  Gaussian  noise  samples  at  different  sampling  time  instants  are 
independent  of  one  another.  Therefore,  the  optimum  detection  of  [s^)  can  be  based  on  t  e 
samples  {rj'1 }  at  time  t  =  kT. 

For  short-code  CDMA,  (c,(r))  are  periodic  and  the  period  equals  Ts.  In  other  words,  t^e 
PN  spreading  signals  {c,(r)|  are  identical  over  each  period  [(k  -  1  )TS,  kTs].  Therefore. 
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short  code  CDMA  systems,  the  cross-correlation  coefficient  between  two  spreading  codes  is  a 
constant 


Rij(k)  =  Rij 


(12.30) 


Note  that  the  decision  variable  of  the  ith  receiver  is 

M 

rf  =  gi/P.RiAk)^  +  Y,  gjy/PjRij(k)sf  +n  ,(*)  ( 1 2.3 1 ) 


'f 

The  term  '  ’  is  an  additional  term  resulting  from  the  multiple-access  interference  of  the  M  —  1 
interfering  signals.  When  the  spreading  codes  are  selected  to  satisfy  the  orthogonality  condition 


RiAk)  —  0  i  A  j 


then  the  CDMA  multiple-access  interference  is  zero,  and  each  CDMA  user  obtains  performance 
identical  to  that  of  the  single  DSSS  user  or  a  single  baseband  QAM  user. 

There  are  various  ways  to  generate  orthogonal  spreading  codes.  Walsh-Hadamard  codes 
are  the  best-known  orthogonal  spreading  codes.  Given  a  code  length  of  L  identical  to  the 
spreading  factor,  there  are  a  total  of  L  orthogonal  Walsh-Hadamard  codes.  A  simple  example 
of  the  Walsh-Hadamard  code  for  L  =  8  is  given  here.  Each  row  in  the  matrix  of  Eq.  ( 1 2.32)  is 
a  spreading  code  of  length  8: 
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+1 

-1 

+1 

+1 

-1 

(12.32) 


At  the  next  level.  Walsh-Hadamard  code  has  length  16,  which  can  be  obtained  from  WK  via 


W2t 


K*-' 


W*-*  1 


In  fact,  starting  from  W,  =  [  1  1  with  k  =  0.  this  recursion  can  be  used  to  generate  length 
L  =  2k  Walsh-Hadamard  codes. 

Gaussian  Approximation  of  Nonorthogonal  MAI 

In  practical  applications,  many  user  spreading  codes  are  not  fully  orthogem.  As  a  result,  the 
effect  of  MAI  on  user  detection  performance  may  be  serious.  To  analyse  the  effect  of  MAI  on  a 
single-user  receiver,  we  need  to  study  the  MAI  probability  distribution.  The  exact  probability 
analysis  of  h  is  difficult.  An  alternative  is  to  use  a  good  approximation.  When  M  ,s  large,  one 
may  invoke  the  central  limit  theorem  to  approximate  the  MAI  as  a  Gaussian  random  variable. 
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Recall  that  the  QAM  symbols  are  independent  with  zero  mean  and  unit  variance,  that  is, 


=  0 


W  =  0 


i  /y 


AD 


=  1 


Hence,  we  can  approximate  the  MAI  as  Gaussian  with  mean 

M  _ 

c<»  _  | 


=  EWw*)** 


(12.33) 


and  variance 


M 


r<0 


=  £bf 


(12.34) 


The  effect  of  this  MAI  approximation  is  a  strengthened  channel  noise.  Effectively,  the 
performance  of  detection  based  on  decision  variable  r[‘ 1  is  degraded  by  the  additional  Gaussian 
MAI.  Based  on  single-user  analysis,  the  new  equivalent  SNR  is  degraded  and  becomes 

2  Eb 


Eb  (l£, I2  Pi  !*,,/(*) I2)  1  \gj\2  Pj  |/?,v(*)|2 

For  the  special  case  of  BPSK  or  polar  signaling,  the  BER  of  the  ith  CDMA  user  is  approximately 

/ 


2  Eb 


Eh  (lg,|2  Pi  \RiAk)\2Y'  Eh  \gj\2  Pj 


(12.35) 


Observe  that  when  a  single  user  is  present  (Af  =  1 ),  Eq.  ( 1 2.35)  becomes  the  well-known 
polar  BER  result  of 


'•-«(*) 

as  expected.  The  same  result  is  also  true  when  all  spreading  codes  are  mutually  orthogonal 
such  that  Rij(k)  =  0 ,i  £  j.  . 

In  the  extreme  case  of  noise-free  systems,  when  the  signal-to-noise  ratio  is  very 
(Eb/Af  — ►  oo),  we  obtain 


lim  Ph  =  Q 

Eb/N^  oo 


(  Ig.l2  Pi  |i?,y(*)|2  \ 

\E%i\gj\2  Pj  \RiM2) 


This  shows  the  presence  of  an  irreducible  error  floor  for  the  MAI  limited  case.  This  noise  fl°° 
vanishes  when  the  spreading  codes  are  mutually  orthogonal  such  that  Rij(k)  =  0  if 4 
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The  Near-Far  Problem 

The  Gaussian  approximation  of  the  MAI  has  limitations  when  used  to  predict  system  perfor¬ 
mance.  While  the  central  limit  theorem  implies  that  /f  will  tend  toward  a  Gaussian  distribution 
near  the  center  of  its  distribution,  convergence  may  require  very  large  number  of  CDMA 
users  M .  In  a  typical  CDMA  system,  the  user  number  M  is  only  in  the  order  of  64  to  128. 
When  M  is  not  sufficiently  large,  the  Gaussian  approximation  of  the  MAI  may  be  highly 
inaccurate,  particularly  in  a  near-far  environment. 

The  so-called  near-far  environment  describes  the  following  scenario. 

•  The  desired  transmitter  is  much  farther  away  from  its  receivers  than  some  interfering 
transmitters. 

•  The  spreading  codes  are  not  mutually  orthogonal;  that  is,  Rjj(k)  /  0  when  i  j. 

If  we  assume  identical  user  transmission  power  in  all  cases,  (i.e..  P,  =  P„),  in  the  near-far 
environment  the  desired  signal  channel  gain  g,  is  much  smaller  than  some  interferers’  channel 
gains.  In  other  words,  there  may  exist  some  user  set  J  such  that 

gi<gj  jeJ  (12.36) 


As  a  result,  Eq.  (12.31)  becomes 

if  =  VFogiRu(k)s^  +  V^£  SjRijik)*?  +  V^£  +  M*) 

jej  [  j*J 

=  VKgiRiAk)*?  +  v^£  gjRijiVs?  +  n  j(k)  ( 1 2.37) 

fey 

where  we  have  defined  an  equivalent  noise  term 

n'(k)  =  s[F„  £ gjRij(k)s f  +  n,(*)  (1 2.38) 

si3 

that  is  approximately  Gaussian. 

In  a  near-far  environment,  it  becomes  likely  that  the  smaller  signal  channel  gain  and  the 
nonzero  cross-correlation  result  in  the  domination  of  the  (far)  signal  component 

g,7?„/(k).vf 


by  the  strong  (near)  interference 

£sy*v(*>*f 

jzJ 

The  Gaussian  approximation  analysis  of  the  BER  in  Eq.  (12.35)  no  longer  applies. 


Sample  12  3  Consider  a  CDMA  system  with  two  users  (M  =2).  Both  signal  transmission  powers  are  10  mW. 

The  receiver  for  user  1  can  receive  signals  from  both  user  signals.  To  this  receiver,  the  two 

signal  channel  gains  are 
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The  spreading  gain  equals  L  =  128  such  that 


R\,i(k)  =  128  fli.2(*)  =  — 1 


The  sampled  noise  i\\(k)  is  Gaussian  with  zero  mean  and  variance  of  10  6.  Determine  the 
BER  for  the  desired  user  1  signal. 

The  receiver  decision  variable  at  time  k  is 

r*  =  VlO-2  •  10~4  •  128-4°  +  v^lO-2  •  1°~’  •  (-D '  42>  +  niW 

=  1(T2  [0.1284°  ~42)  +  100n  ,(*)j 

For  equally  likely  data  symbols  ±1,  the  BER  of  user  1  is 

Pb  =  0.5  P  [r*  >  0|4°  =  -l]  +  0.5  •  P  [r*  <  0|s[' '  =  l] 

=  P  [rjt  <  0|4°  =  l] 

=  P  j^0.128  —  4°  +  100ni  (k)  <  o] 

Because  of  the  equally  likely  data  symbol  P  1  =  ±1  j  =  0.5,  we  can  utilize  the  total 
probability  theorem  to  obtain 

Pb  =  0.5 P  [o.  1 28  -  s[2)  +  100ni(*)  <  0|s‘2)  =  l] 

+  0.5P  [0.128  -5j‘2)  +  100m (k)  <  0|42)  =  -l] 

=  0.5P  [0.128-  1  +  100n, (k)  <  0)  +  0.5P  [0.128+  1  +  lOOni  (*)  <  0] 

=  0.5 P  [lOOmOt)  <  0.872]  +  0.5P  [100m(it)  <  -1.128] 

=  0.5[1  -  Q  (8.72)]  +  0.50  ( 1 1  -28) 

*=0.5 

Thus,  the  BER  of  the  desired  signal  is  essentially  0.5,  which  means  that  the  desired  user 
is  totally  dominated  by  the  interference  in  this  particular  near-far  environment. 


Power  Control  in  CDMA  he 

Because  the  near-far  problem  is  a  direct  result  of  difference  in  user  signal  powers  at 
receiver,  one  effective  approach  to  overcome  the  near-far  effect  is  to  increase  the  P°w^ 
the  “far”  users  while  decrease  the  power  of  the  “near”  users.  This  power  balancing  appr 
is  known  in  CDMA  as  power  control. 

Power  control  assumes  that  all  receivers  are  collocated.  For  example,  cellular  comm 
cations  take  place  by  connecting  a  number  of  mobile  phones  within  each  cell  to  a  base  sta 
that  serves  the  cell.  All  mobile  phone  transmissions  within  the  cell  are  received  and  ete 
at  the  base  station.  The  transmission  from  a  mobile  unit  to  the  base  station  is  known  as 
uplink  or  reverse  link ,  as  opposed  to  downlink  or  forw  ard  link  when  the  base  station  trans 
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to  a  mobile  user.  It  is  clear  that  the  near-tar  effect  does  not  occur  during  downlink.  In  fact, 
because  multiple  user  transmissions  can  be  perfectly  synchronized,  downlink  CDMA  can  be 
easily  made  synchronous  to  maintain  orthogonality.  Also  at  each  mobile  receiver,  all  signal 
transmissions  have  equal  channel  gain  because  all  originate  from  the  same  base  station.  Neither 
near-tar  condition  can  be  satisfied.  For  this  reason,  CDMA  mobile  users  in  downlink  do  not 
require  power  control  or  other  means  to  combat  strong  MAI. 

When  CDMA  is  used  on  the  uplink  to  enable  multiple  mobile  users  to  transmit  their  signals 
to  the  base  station,  the  near-far  problem  will  often  occur.  By  adopting  power  control,  the  base 
station  can  send  instructions  to  the  mobile  phones  to  increase  or  to  decrease  their  transmission 
powers.  The  goal  is  for  all  user  signals  to  arrive  at  the  base  station  receivers  with  similar 
power  levels  despite  their  different  channel  gains.  In  other  words,  a  constant  value  of  \gi\2Pi 
is  achieved  because  power  control  via  receiver  feedback  provides  instructions  to  the  mobile 
transmitters. 

One  of  the  major  second-generation  cellular  standards,  cdmaOne  (also  known  as  IS-95), 
pioneered  by  Qualcomm,  is  a  DSSS  CDMA  system.  It  applies  power  control  to  overcome  the 
near-far  problem  at  base  station  receivers. 

Power  control  takes  two  forms:  open  loop  and  closed  loop.  Under  open-loop  power  control, 
a  mobile  station  adjusts  its  power  based  on  the  strength  of  the  signal  it  receives  from  the  base 
station.  This  presumes  that  a  reciprocal  relationship  exists  between  forward  and  reverse  links, 
an  assumption  that  may  not  hold  if  the  links  operate  in  different  frequency  bands.  As  a  result, 
closed-loop  power  control  is  often  required  because  the  base  station  can  order  the  mobile 
station  to  change  its  transmitted  power. 

Near-Far  Resistance 

An  important  concept  of  near-far  resistance  was  defined  by  S.  Verdu.10  The  main  objective 
is  to  determine  whether  a  CDMA  receiver  can  overcome  the  MAI  by  simply  increasing  the 
signal-to-noise  ratio  Eb/M.  A  receiver  is  defined  as  near-far  resistant  if,  for  every  user  in  the 
CDMA  system,  there  exists  a  nonzero  y  such  that  no  matter  how  strong  the  interferences  are, 
the  probability  of  bit  error  as  a  function  of  Eb/fl  satisfies 


PuUEh/Af) 


This  means  that  a  near-far  resistant  receiver  should  have  no  BER  floor  as  A  -*■  0.  Our 
analysis  of  the  conventional  matched  filter  receiver  based  even  on  Gaussian  approximation  has 
demonstrated  the  lack  of  near-far  resistance  by  the  conventional  single-user  receiver.  Although 
power  control  alleviates  the  near-far  effect,  it  does  not  make  the  conventional  receiver  near-far 
resistant  To  achieve  near-far  resistance,  we  will  need  to  apply  multiuser  detection  receivers  to 
jointly  detect  all  user  symbols  instead  of  approximating  the  sum  of  interferences  as  additional 


Gaussian  noise. 
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For  MUD,  the  general  assumption  is  that  the  receiver  has  access  to  all  M  signal  samples 
of  Eq.  (12.31).  In  addition,  the  receiver  has  knowledge  of  the  following  information: 

1 .  User  signal  strengths  gi\/P ;. 

2.  Spreading  sequence  cross-correlation 

3.  Statistics  of  the  noise  samples  n,  (k). 

To  explain  the  different  MUD  receivers,  it  is  more  convenient  to  write  Eq.  (12.31)  in 
vector  form: 
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(12.39a) 


We  can  define  the  vectors 


r  k  = 

Sk  = 

r  4°  1 

s<2) 

sk 

nk  = 

ni  (k) 
n2  (k) 

JM) 

L  Tk  J 

AM) 

L  sk  J 

_  n M(k)  _ 

(12.39b) 


We  can  also  define  matrices 


Rk  = 


D  = 


R\.\(k) 

R\,2  {k)  ■ 

R\.M(k) 

Ru(k) 

R22(k)  ■ 

••  R\.M(k) 

RM,m 

Rm,  2(k)  ■ 

R\t.M(k) 

g\\fp\ 

82y/Pl 

(12.39c) 


(I2.39d) 


gM\fP\i 

Then  the  M  output  signal  samples  available  for  MUD  can  be  written  as 

r*  =  Rk  ■  D  ■  sk  +  tv 

Notice  that  the  noise  vector  iv  is  Gaussian  with  zero  mean  and  covariance  matrix  [Eq- 


(12.39c) 


n*  K)r  = 


(12.40) 
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The  goal  of  MUD  receivers  is  to  determine  the  unknown  user  data  vector  s *  based  on  the 
received  signal  vector  value  r*  =  r*.  Based  on  the  system  model  of  Eq.  (12.39),  different  joint 
MUD  receivers  can  be  derived  according  different  criteria. 

To  simplify  our  notation  in  MUD  discussions,  we  denote  A*  as  the  conjugate  of  matrix 
A  and  A  as  the  transpose  of  matrix  A.  Moreover,  we  denote  the  conjugate  transpose  of 
matrix  A  as 


The  conjugate  transpose  of  a  matrix  is  also  known  as  its  Hermitian. 

Optimum  MUD:  Maximum  Likelihood  Receiver 

The  optimum  MUD  based  on  the  signal  model  of  Eq.  (12.39)  is  the  maximum  likelihood 
detector  (MLD)  under  the  assumption  of  equally  likely  input  symbols.  As  discussed  in  Sec.  1 1 .6, 
the  optimum  receiver  with  minimum  probability  of  symbol  error  is  the  MAP  receiver 


sk  =  argmax/?(s*|r*) 


(12.41a) 


If  all  possible  values  of  s*  are  equally  likely,  then  the  MAP  detector  reduces  to  the  maximum 
likelihood  detector  (or  MLD) 


Sk  =  argmax/j(r*|s*) 


(12.41b) 


Because  the  noise  vector  nr  is  jointly  Gaussian  with  zero  mean  and  covariance  matrix  0.5 NRk , 
we  have 


(12.42) 


The  MLD  receiver  can  be  implemented  as 


maxp  (rt  |s*)  <  >  min  (rk  —  Rk  Dsk)^  (rk  RkI)sk) 

St  sk 

«=>  min  Rk~l/2  (fk  ~  Rk  I^Sk) I 


(12.43) 


The  maximum  likelihood  MUD  receiver  is  illustrated  in  Fig.  1 2. 1 3. 

Thus,  the  maximum  likelihood  MUD  receiver  must  calculate  and  compare  the  values  of 


2 


Rk-'l2{rk-RkDsk) 


for  all  possible  choices  of  the  unknown  user  symbol  vector  s* .  It  each  user  uses  1 6-QAM  to 
modulate  its  data,  the  complexity  of  this  optimum  MUD  receiver  requires  16"  evaluations 
of  Eq  ( 12  43)  It  is  evident  that  the  optimum  maximum  likelihood  MUD  has  a  rather  high 
complexity  Indeed,  the  computational  complexity  increases  exponentially  with  the  number  of 
CDMA  users  10  This  is  the  price  paid  for  this  optimum  and  near-far  resistant  CDMA  receiver. 
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Figure  12.13 

Maximum 
likelihood 
multiuser 
detection  (MUD) 
receiver. 


Figure  12.14 

Decorrelator 
MUD  receiver. 


Decorrelator  Receiver 

The  high  complexity  of  the  maximum  likelihood  MUD  receiver  reduces  its  attractiveness 
in  practical  applications.  To  bring  down  the  computational  cost,  several  low-complexity  an 
suboptimum  MUD  receivers  have  been  proposed.  The  decorrelator  MUD  is  a  linear  methc 
that  simply  uses  matrix  multiplication  to  remove  the  MAI  among  different  users.  Base  on 
Eq.  ( 1 2.39),  the  MAI  among  different  users  is  caused  by  the  nondiagonal  correlation  matrix  *• 
Thus,  the  MAI  effect  can  be  removed  by  premultiplying  r*  with  the  pseudoinverse  of  k t0 
“decorrelate”  the  user  signals. 


R : 


rk  =  Dsk  +RkX  ■  nk 


(12.44) 


This  decorrelating  operation  leaves  only  the  noise  term  Rk  1  nj  that  can  affect  the  user  signal 
A  QAM  hard-decision  device  can  be  applied  to  detect  the  user  symbols 

sk  =  dec(fl~‘  r*)  (1'4'> 

Figure  1 2. 14  is  the  block  diagram  of  a  decorrelator  MUD  receiver.  Since  the  major°?^| 
tion  of  a  decorrelating  MUD  receiver  lies  in  the  matrix  multiplication  of  Rk  1 .  the  compute  i 
complexity  increases  only  in  the  order  of  0(M2).  The  decorrelator  receiver  is  near-tar  res 
as  detailed  by  Lupas  and  Verdti.1 1 


Djt 


In 


Minimum  Mean  Square  Error  (MSE)  Receiver 

The  drawback  of  the  decorrelator  MUD  receiver  lies  in  the  noise  transformation  by  Rk 
fact,  when  the  correlation  matrix  Rk  is  ill  conditioned,  the  noise  transformation  has  the  nec  ^  ^ 
effect  of  noise  amplification.  To  mitigate  this  risk,  a  different  and  more  robust  M  ^ 
to  minimize  the  mean  square  error  by  applying  a  good  linear  MUD  receiver  by  tin  1 


optimum  matrix  Gk: 
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min  £{||s*  -Gr*||2}  (12.46) 

This  G  still  represents  a  linear  detector.  Once  G  has  been  determined,  the  MUD  receiver  simply 
takes  a  hard  decision  on  the  linearly  transformed  signal,  that  is, 

h  =  dec  (G  r*)  (12.47) 

The  optimum  matrix  G  can  be  determined  by  applying  the  principle  of  orthogonality 
[Eq.  (8.84),  Sec.  8.5].  The  principle  of  orthogonality  requires  that  the  error  vector 

sk  -  G  rk 

be  orthogonal  to  the  received  signal  vector  r* .  In  other  words. 


(sk-G  r*)if=0  (12.48) 

Thus,  the  optimum  receiver  matrix  G  can  be  found  as 

G=sk  if  [r*  if  j  '  (12.49) 

Because  the  noise  vector  n*  and  the  signal  vectors*  are  independent. 


=  0 


is  their  cross-correlation. 

In  addition,  we  have  earlier  established  equalities 

sksk  =  / MxM  n*  if  =  ~  Rk 

where  we  use  1m  xm  to  denote  the  M  x  M  identity  matrix.  Hence,  we  have 

/vf  =  RkDDHRHk  +  jRk  ( 1 2.50a) 

7^  =  DHRHk  (12.50b) 

The  optimum  linear  receiver  matrix  is  therefore 

Gk  =DHRHk  {RkDDHRHk  +  y Rk  )  (12.51) 

It  is  clear  that  when  the  channel  noise  is  zero  (i.e.,7V  =  0).  then  the  optimum  matrix  given  by 
Eq.  ( 1 2.5 1 )  degenerates  into 

Gk=DHRHk  (RkDD"R?y'  =  ( RkD)~ 1 

which  is  essentially  the  decorrelator  receiver. 

The  MMSE  linear  MUD  receiver  is  shown  in  Fig.  12.15.  Similar  to  the  decorrelator 
receiver,  the  major  computational  requirement  comes  from  the  matrix  multiplication  of  Gk. 
The  MMSE  linear  receiver  is  also  near-far  resistant.1 1 
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Figure  12.15 

Minimum  mean 
square  error 
MUD  receiver. 


Decision  Feedback  Receiver 

We  note  that  both  the  decorrelator  and  the  MMSE  MUD  receivers  apply  linear  matrix  pro¬ 
cessing.  Hence,  they  are  known  as  linear  receivers  with  low  complexity.  On  the  other  hand, 
the  optimum  MUD  receiver  is  nonlinear  but  requires  much  higher  complexity.  There  is  also  a 
very  popular  suboptimum  receiver  that  is  nonlinear.  This  method  is  based  on  the  concept  of 
successive  interference  cancellation,  known  as  the  decision  feedback  MUD  receiver. 

The  main  motivation  behind  the  decision  feedback  MUD  receiver  lies  in  the  fact  that  in 
a  near-far  environment,  not  all  users  suffer  equally.  In  a  near-far  environment,  the  stronger 
signals  are  actually  winners,  whereas  the  weaker  signals  are  losers.  In  fact,  when  a  particular 
user  has  a  strength  \fP~igi  that  is  stronger  than  those  of  all  other  users,  its  conventional  matched 
filter  receiver  can  in  fact  deliver  better  performance  than  is  possible  in  an  environment  of  equal 
strength.  Hence,  it  would  make  sense  to  rank  the  received  users  in  the  order  of  their  individual 
strength  measured  by  {Pig}).  The  strongest  user  QAM  symbols  can  then  be  detected  first,  using 
only  the  conventional  matched  filter  receivers  designed  for  single  users.  Once  the  strongest 
user  symbols  is  known,  its  interference  effects  on  the  remaining  user  signals  can  be  canceled. 
By  canceling  the  strongest  user  symbol  from  the  received  signal  vectors,  there  are  only  M  -  1 
unknown  user  symbols  for  detection.  Among  them,  the  next  strongest  user  signal  can  be 
detected  more  accurately  after  the  strongest  interference  has  been  removed.  Hence,  its  effect 
can  also  subsequently  be  canceled  from  received  signals,  to  benefit  the  M  —  2  remaining  user 
symbols,  and  so  on.  Finally,  the  weakest  user  signal  will  be  detected  last,  after  all  the  MAI  has 
been  canceled. 

Clearly,  the  decision  feedback  MUD  receiver  relies  on  the  successive  interference  can 
cellation  of  stronger  user  interferences  for  the  benefit  of  weaker  user  signal  detection.  For  this 
reason,  the  decision  feedback  MUD  receiver  is  also  known  as  the  successive  interference  can¬ 
cellation  (SIC)  receiver.  The  block  diagram  of  the  decision  feedback  MUD  receiver  appeal 
in  Fig.  12.16.  Based  on  Eq.  (12.31),  the  following  steps  summarize  the  SIC  receiver. 


Decision  Feedback  MUD 


Step  1.  Rank  all  user  signal  strengths  {Pig}).  Without  loss  of  generality,  we  assume 


that 


8\  >  Pig\  >  >  PM-\gh-\  >  Pm  Zm 


Let 


y\,]  =  rk] 
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Figure  12.16 

Decision  feed¬ 
back  MUD 
receiver  based 
on  successive 
interference 
cancellation 
(assuming  that 
all  M  users  are 
ranked  in  the 
order  of 
descending 
gains). 


<2) 


and 


£=  1 

Step  2.  Detect  the  Ith  (strongest)  user  symbol  via 

= dec  (>r) 

Step  3.  Cancel  the  first  (strongest)  user  interference  from  the  received  signals 

>{+i  =  -  gt\fPtRi,t(k)s^ ’  i  =  e  +  i . m 

Step  4.  Let  i  =  t  +  1  and  repeat  step  2  until  l  =  M . 


A  decision  feedback  MUD  receiver  requires  very  little  computation,  since  the  interference 
cancellation  step  requires  only  0(M~)  complexity.  It  is  a  very  sensible  and  low-complexity 
receiver.  Given  correct  symbol  detection,  strong  interference  cancellation  from  received  weak 
signals  completely  eliminates  the  near-far  problem.  The  key  drawback  or  weakness  of  the 
decision  feedback  receiver  lies  in  the  effect  of  error  propagation.  Error  propagation  takes  place 
when,  in  step  2,  a  user  symbol  is  detected  incorrectly.  As  a  result,  this  erroneous  symbol 
used  in  the  interference  cancellation  of  step  3  may  in  fact  strengthen  the  MAI.  This  leads  to 
the  probability  of  more  decision  errors  of  the  subsequent  user  symbol,  '  ",  which  in  turn 
can  cause  more  decision  errors.  Analysis  on  the  effect  of  error  propagation  can  be  found  in 
Refs.  14  and  15. 


12.8  MODERN  PRACTICAL  DSSS  CDMA  SYSTEMS 

Since  the  1990s.  many  important  commercial  applications  have  emerged  for  spread  spectrum, 
including  cellular  telephones,  personal  communications,  and  position  location.  Here  we  discuss 
several  popular  applications  of  CDMA  technology  to  illustrate  the  benefits  of  spread  spectrum. 

1 2.8. 1  CDMA  in  Cellular  Phone  Networks 

Cellular  Networks 

The  cellular  network  divides  a  service  area  into  smaller  geographical  cells  ( Fig.  1 2. 1 7).  Each 
cell  has  a  base  station  tower  to  connect  with  mobile  users  it  serves.  All  base  stations  are  wired 
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Telephone  central 
office 


to  the  mobile  telephone  switching  office  (MTSO),  which  in  turn  is  wired  to  the  telephone 
central  office.  A  caller  communicates  via  radio  channel  to  its  base  station,  which  sends  the 
signal  to  the  MTSO.  The  MTSO  connects  to  the  receiver  either  via  the  land-based  telephone 
system  or  via  another  base  station.  As  the  caller  moves  from  one  cell  to  another,  a  handoff 
process  takes  place.  During  handoff,  the  MTSO  automatically  switches  the  user  to  an  available 
channel  in  the  new  cell  while  the  call  is  in  progress.  The  handoff  is  so  rapid  that  users  usually 
do  not  notice  it. 

The  true  ingenuity  of  the  cellular  network  lies  in  its  ability  to  reuse  the  same  frequency 
band  in  multiple  cells.  Without  cells,  high-powered  transmitters  can  be  used  to  cover  an  entire 
city.  But  this  would  allow  a  frequency  channel  to  be  used  only  by  one  user  in  the  city  at  any 
moment.  This  posed  serious  limitations  on  the  number  of  channels  and  simultaneous  users. 
The  limitation  is  overcome  in  the  cellular  scheme  by  reusing  the  same  frequencies  in  all  the 
cells  except  those  immediately  adjacent.  This  is  possible  because  the  transmitted  powers  are 
kept  small  enough  to  prevent  the  signals  from  one  cell  from  reaching  beyond  the  immediately 
adjacent  cells.  We  can  accommodate  any  number  of  users  by  increasing  the  number  of  cells  as 
we  reduce  the  cell  size  and  the  power  levels  correspondingly. 

The  1G  (first-generation)  analog  cellular  schemes  use  audio  signal  to  modulate  an 
signal  with  transmission  bandwidth  30  kHz.  This  wideband  FM  signal  results  in  a  good  SN 
but  is  highly  inefficient  in  bandwidth  usage  and  frequency  reuse.  The  2G  (second-generation 
cellular  systems  are  all  digital.  Among  them,  the  GSM  and  cdmaOne  are  two  of  the  most  wide  y 
deployed  cellular  systems.  GSM  adopts  a  TDMA  technology  through  which  eight  users  share 
a  200  kHz  channel.  The  competing  technology  of  cdmaOne  (known  earlier  as  IS-95)  is  a 
system. 


Why  CDMA  in  Cellular  Systems?  j 

Although  spread  spectrum  is  inherently  well  suited  against  narrowband  interferences  ai 
affords  a  number  of  advantages  in  the  areas  of  networking  and  handoff,  the  key  charact 
istic  underlying  the  broad  application  of  CDMA  for  wireless  cellular  systems  is  the  Potent1^ 
for  improved  spectral  utilization.  The  capacity  for  improvement  has  two  key  sources.  ^ 
the  use  of  CDMA  allows  improved  frequency  reuse.  Narrowband  systems  cannot  use  ^  *a 
transmission  frequency  in  adjacent  cells  because  of  the  potential  for  interference.  CDM 
inherent  resistance  to  interference.  Although  users  using  different  spreading  codes  fr0™  a  . 
cent  cells  will  contribute  to  the  total  interference  level,  their  contribution  will  be  siguinc'  - 
less  than  the  interference  from  the  same  cell  users.  This  leads  to  a  much  improved  trequ 
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Figure  12.18 

RF  bandwidth 
requirements  for 
IS-95  uplink  and 
downlink. 


Base  station 


Figure  12.19 

Forward  link 
modulation  and 
Walsh  code 
spreading  of 
cdmaOne 
(IS-95). 


reuse  efficiency.  In  addition,  CDMA  provides  better  overall  capacity  when  the  data  traffic  load 
is  dynamic.  This  is  because  users  in  a  lightly  loaded  CDMA  system  would  have  a  lower  inter¬ 
ference  level  and  better  performance,  whereas  TDM  A  users  with  fixed  channel  bandwidth  do 
not  enjoy  such  benefit. 

CDMA  Cellular  System:  cdmaOne  (IS-95) 

The  first  commercially  successful  CDMA  system  in  cellular  applications  was  developed  by  the 
Electronic  Industries  Association  (EIA)  as  interim  standard-95  (IS-95).  Now  under  the  official 
name  of  cdmaOne.  it  employs  DSSS  by  adopting  1 ,2288-Mchip/s  spreading  sequences  on 
both  uplink  and  downlink.  The  uplink  and  downlink  transmissions  both  occupy  1.25  MHz  of 
RF  bandwidth,  as  illustrated  in  Fig.  12.18. 

The  QCELP  (Qualcomm  code-excited  linear  prediction)  vocoder  is  used  for  voice  encod¬ 
ing.  Since  the  voice  coder  exploits  gaps  and  pauses  in  speech,  the  data  rate  is  variable  from 
1 .2  to  9.6  kbit/s.  To  keep  the  symbol  rate  constant,  whenever  the  bit  rate  falls  below  the  peak 
bit  rate  of  9.6  kbit/s,  repetition  code  is  used  to  fill  the  gaps.  For  example,  if  the  output  of 
the  voice  coder  (and  subsequently  the  convolutional  coder)  falls  to  2.4  kbit/s,  the  output  is 
repeated  three  more  times  before  it  reaches  the  interleaver.  The  transmitter  of  cdmaOne  takes 
advantage  of  this  repetition  time  by  reducing  the  output  power  during  three  out  of  the  four 
identical  symbols  by  at  least  20  dB.  In  this  way,  the  multiple-access  interference  is  diminished. 
This  “voice  activity  gating”  reduces  MAI  and  increases  overall  system  capacity. 

The  modulation  of  cdmaOne  uses  QPSK  on  the  downlink,  and  the  uplink  uses  a  variant  of 
QPSK  known  as  the  offset  QPSK  (or  OQPSK).  There  are  other  important  differences  between 
the  forward  and  reverse  links.  Figure  12.19  outlines  the  basic  operations  of  spreading  and 
modulation  on  the  forward  link.  After  a  rate  1/2  convolutional  error  correction  code,  the  voice 
data  becomes  19.2  kbit/s.  Interleaving  then  shuffles  the  data  to  alleviate  burst  error  effects, 
and  long-code  scrambling  provides  some  nominal  privacy  protection.  The  data  rate  remains 
at  19.2  kbit/s  before  being  spread  by  a  length  64  Walsh-Hadamard  short-code  to  result  in  a 
sequence  of  rate  1.2288  Mbit/s.  Because  forward  link  uses  synchronous  transmissions,  in  the 
absence  of  channel  distortions,  there  can  be  as  many  as  64  orthogonal  data  channels,  each  using 
a  distinct  Walsh-Hadamard  code.  Both  the  in-phase  (I)  and  the  quadrature  (Q)  components  of 
the  QPSK  modulations  carry  the  same  data  over  the  1 .25  MHz  bandwidth,  although  different 
masking  codes  are  applied  to  I  and  Q. 

The  performance  of  the  reverse  link  is  of  greater  concern  for  two  reasons.  First,  as  dis¬ 
cussed  earlier  the  reverse  link  is  subject  to  near-far  effects.  Second,  since  all  transmissions 
on  the  forward  link  originate  at  the  same  base  station,  it  uses  the  orthogonal  Walsh-Hadamard 
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spreading  codes  to  generate  synchronous  signals  with  zero  cross-correlation.  Reverse-link  does 
not  enjoy  this  luxury.  For  this  reason,  more  powerful  error  correction  (rate  1/3)  is  employed 
on  the  reverse  link.  Still,  like  the  forward  link,  the  raw  QCELP  vocoder  bit  rate  is  9.6  kbit/s, 
which  is  eventually  spread  to  1.2288  Mchip/s  over  a  1.25  MHz  bandwidth. 

As  mentioned  earlier,  the  near-far  problem  needs  to  be  addressed  when  spread  spectrum  is 
utilized  in  mobile  communications.  To  combat  this  problem,  1S-95  uses  power  control.  On  the 
forward  link  there  is  a  subchannel  for  power  control  purposes.  Every  1 .25  ms,  the  base  station 
receiver  estimates  the  signal  strength  of  the  mobile  unit.  It  it  is  too  high,  the  base  transmits  a 
1  on  the  subchannel.  If  it  is  too  low,  it  transmits  a  0.  In  this  way,  the  mobile  station  adjusts  its 
power  based  on  the  800  bit/s  power  control  signal  to  reduce  interference  to  other  users. 


3G  Cellular  Services16  l 

In  the  new  millennium,  wireless  service  providers  are  shifting  their  voice-centric  2G  cellular 
systems  to  the  next-generation  (3G)  wireless  systems,  which  are  capable  of  supporting  high¬ 
speed  data  transmission  and  internet  connection.  For  this  reason,  the  International  Mobile 
Telecommunications-2000  standard  (IMT-2000)  is  the  global  standard  for  third-generation 
wireless  communications.  IMT-2000  provides  a  framework  for  worldwide  wireless  access  of 
fixed  and  mobile  wireless  access  systems.  The  goal  is  to  provide  wireless  cellular  coverage  up 
to  144  kbit/s  for  high-speed  mobile,  384  kbit/s  for  pedestrian,  and  2.048  Mbit/s  for  indoor  users. 
Among  the  3G  standards,  there  are  three  major  wireless  technologies  based  on  CDMA  DSSS. 
namely,  the  two  competing  versions  of  wideband  CDMA  from  the  3rd  Generation  Partnership 
Project  (3GPP)  and  the  3rd  Generation  Partnership  Project  2  (3GPP2),  plus  the  TD-SCDMA 
from  the  3GPP  for  China. 

Because  3G  cellular  systems  continue  to  use  the  existing  cellular  band,  a  high  data  rate 
for  one  user  means  a  reduction  of  service  for  other  active  CDMA  users  within  the  same  cell. 
Otherwise,  given  the  limited  bandwidth,  it  is  impossible  to  serve  the  same  number  of  active 
users  as  in  cdmaOne  while  supporting  data  rate  as  high  as  2.048  Mbit/s.  Thus,  the  data  rate 
to  and  from  the  mobile  unit  must  be  variable  according  to  the  data  traffic  intensity  within  the 
cell.  Since  most  data  traffic  patterns  (including  internet  usage)  tend  to  be  bursty,  variable  rate 
data  service  offered  by  3G  cellular  is  suitable  for  such  applications. 

Unlike  FDMA  and  TDMA,  CDMA  provides  a  perfect  environment  for  variable  data  rate 
and  requires  very  simple  modifications.  While  FDMA  and  TDMA  would  require  grouping 
multiple  frequency  bands  or  time  slots  dynamically  to  support  variable  rate,  CDMA  needs  to 
change  only  the  spreading  gain.  In  other  words,  at  higher  data  rates,  a  CDMA  transmitter  can 
use  a  lower  spreading  factor.  In  this  mode,  its  MAI  to  other  users  is  high,  and  fewer  such  users 
can  be  accommodated.  At  lower  data  rates,  the  transmitter  uses  a  larger  spreading  factor, 
allowing  more  users  to  transmit. 

In  3GPP2s  CDMA2000  standard,  there  are  two  ratio  transmission  modes:  lxRTTuti  \i^ 
ing  one  1 .25  MHz  band  and  3xRTT  that  aggregates  three  1 .25  MHz  bands.  On  1  xRTT  forW^ 
link,  the  maximum  data  rate  is  307.2  kbit/s  with  a  spreading  gain  of  4.  Thus,  the  chip  ra 
is  still  1.2288  Mchip/s.  A  more  recent  3GPP2  release  is  called  CDMA  2000  lx  EV-DO  r^ 
sion  A,  where  EV-DO  stands  for  “evolution  data-optimized.”  It  can  support  a  peak  data 
of  3.1  Mbit/s  on  the  forward  link  of  1.25  MHz  bandwidth.  It  does  so  by  applying  a  *P ^ 
coding  and  adaptive  modulations,  including  QPSK,  8-PSK,  and  16-QAM.  At  the  pea  ra 
the  spreading  gain  is  1  (i.e.,  no  spreading).  a?oOO. 

At  the  same  time,  the  WCDMA  by  3GPP  applies  similar  ideas.  Unlike  CDM  , 
WCDMA  has  a  standard  bandwidth  of  5  MHz.  When  spreading  is  used,  the  chip  rate  is  ^ 
Mchip/s.  On  downlink,  the  variable  spreading  factor  of  3GPP  WCDMA  ranges  from  5 
With  QPSK  modulation,  this  provides  a  variable  data  rate  from  16  kbit/s  to  2.048  c 
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Similar  to  CDMA2000, 3GPP  WCDMA  also  has  a  counterpart  to  EV-DO  known  as  high-speed 
packet  access  (HSPA).  On  downlink,  the  recent  HSPA  release  (Release  6)  achieves  the  peak 
rate  of  14.4  Mbit/s.  However,  existing  deployments  can  support  a  peak  rate  of  only  7.2  Mbit/s. 
Still,  at  this  high  rate,  most  data  users  would  be  quite  satisfied,  with  the  exception  perhaps  of 
high-definition  TV  viewers. 

Power  Control  vs.  MUD 

It  is  interesting  to  note  that  despite  intense  academic  research  interest  in  multiuser  CDMA 
receivers  (in  the  1980s  and  1990s),  all  cellular  CDMA  systems  described  here  rely  on  power 
control  to  combat  the  near-far  problem.  The  reason  lies  in  the  fact  that  power  control  is  quite 
simple  to  implement  and  has  proven  to  be  very  effective.  On  the  other  hand,  MUD  receivers 
require  more  computational  complexity.  To  be  effective,  MUD  receivers  also  require  too  much 
channel  and  signal  information  about  all  active  users.  Moreover,  MUD  receivers  alone  cannot 
completely  overcome  the  disparity  of  performance  in  a  near-far  environment. 

12.8.2  CDMA  in  the  Global  Positioning  System  (GPS) 

What  Is  GPS? 

The  Global  Positioning  System  (GPS)  is  the  only  fully  functional  global  satellite  navigation 
system.  Utilizing  a  constellation  of  at  least  24  satellites  in  medium  Earth  orbit  to  transmit 
precise  RF  signals,  the  system  enables  a  GPS  receiver  to  determine  its  location,  speed,  and 
direction. 

A  GPS  receiver  calculates  its  position  based  on  its  distances  to  three  or  more  GPS  satellites. 
Measuring  the  time  delay  between  transmission  and  reception  of  each  GPS  microwave  signal 
gives  the  distance  to  each  satellite,  since  the  signal  travels  at  a  known  speed.  The  signals  also 
carry  information  about  the  satellites’  location.  By  determining  the  position  of,  and  distance  to, 
at  least  three  satellites,  the  receiver  can  compute  its  position  using  triangularization.  Receivers 
typically  do  not  have  perfectly  accurate  clocks  and  therefore  track  one  or  more  additional 
satellites  to  correct  the  receiver’s  clock  error. 

Each  GPS  satellite  continuously  broadcasts  its  (navigation)  message  via  BPSK  at  the  rate 
of  50  bit/s.  This  message  is  transmitted  by  means  of  two  CDMA  spreading  codes:  one  for  the 
coarse/acquisition  (C/A)  mode  and  one  for  the  precise  (P)  mode  (encrypted  lor  military  use). 
The  C/A  spreading  code  is  a  PN  sequence  with  period  of  1023  chips  sent  at  1.023  Mchip/s. 
The  spreading  gain  is  L  =  20,460.  Most  commercial  users  access  only  the  C/A  mode.* 

Originally  developed  for  the  military,  GPS  is  now  finding  many  uses  in  civilian  life  such  as 
marine,  aviation,  and  automotive  navigation,  as  well  as  surveying  and  geological  studies.  GPS 
allows  a  person  to  determine  the  time  and  the  person’s  precise  location  (latitude,  longitude,  and 
altitude)  anywhere  on  earth  with  an  accuracy  of  inches.  The  person  can  also  find  the  velocity 
with  which  he  or  she  is  moving.  GPS  receivers  have  become  small  and  inexpensive  enough  to 
be  carried  by  just  about  everyone  in  cars  and  boats.  Handheld  GPS  receivers  arc  plentiful  and 
have  even  been  incorporated  into  popular  cellular  phone  units. 

How  Does  GPS  Work? 

A  GPS  receiver  operates  by  measuring  its  distance  from  a  group  of  satellites  in  space,  which 
are  acting  as  precise  reference  points.  Since  the  GPS  system  consists  of  24  satellites,  there  will 
always  be  more  than  four  orbiting  bodies  visible  from  anywhere  on  Earth.  The  24  satellites 


*  The  P  spreading  code  rate  is  10.23  Mchip/s  with  a  spreading  gain  of  L  =  204.600.  The  Pcode  period  is 
6.1871  x  I012  bits  long.  In  fact.  a.  the  chip  rate  of  10.23  Mchip/s.  the  code  period  ,s  one  week  long! 
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are  located  in  six  orbital  planes  at  a  height  of  22,200  km.  Each  satellite  circles  the  earth  in 
12  hours.  The  satellites  are  constantly  monitored  by  the  U.S.  Department  of  Defense,  which 
knows  their  exact  locations  and  speeds  at  every  moment.  This  information  is  relayed  back  to 
the  satellites.  All  the  satellites  have  atomic  clocks  of  unbelievable  precision  on  board  and  are 
synchronized  to  generate  the  same  PN  code  at  the  same  time.  The  satellites  are  continuously 
transmitting  this  PN  code  and  the  information  about  their  locations  and  time.  A  GPS  receiver 
on  the  ground  is  also  generating  the  same  PN  code,  although  not  in  synchronism  with  that  of 
the  satellites.  This  is  because  of  the  necessity  to  make  GPS  receivers  inexpensive.  Hence,  the 
timing  of  the  PN  code  generated  by  the  receiver  will  be  off  by  an  amount  of  a  seconds  (timing 
bias)  from  that  of  the  PN  code  of  the  satellites. 

To  begin,  let  us  assume  that  the  timing  bias  a  =  0.  By  measuring  the  time  delay  between 
its  own  PN  code  and  that  received  from  one  satellite,  the  receiver  can  compute  its  distance 
d  from  that  satellite.  This  information  places  the  receiver  anywhere  on  a  sphere  of  radius  d 
centered  at  the  satellite  location  (which  is  known),  as  shown  in  Fig.  1 2.20a.  Simultaneous 
measurements  from  three  satellites  place  the  receiver  on  the  three  spheres  centered  at  the  three 
known  satellite  locations.  The  intersection  of  two  spheres  is  a  circle  (Fig.  12.20b).  and  the 
intersection  of  this  circle  with  the  third  sphere  narrows  down  the  location  to  just  two  points,  as 
shown  in  Fig.  12.20c.  One  of  these  points  is  the  correct  location.  But  which  one  ?  Fortunately, 
one  of  the  two  points  would  give  a  ridiculous  answer.  The  incorrect  point  may  not  be  on  Earth, 
or  it  may  indicate  an  impossibly  high  receiver  velocity.  The  computer  in  a  GPS  receiver  has 
various  techniques  for  distinguishing  the  correct  point  from  the  incorrect  one. 

In  practice,  the  timing  bias  a  is  not  zero.  To  solve  this  problem,  we  need  a  distance 
measurement  from  a  fourth  satellite.  A  user  locates  his  or  her  position  by  receiving  the  signal 
from  four  of  the  possible  24  satellites,  as  shown  in  Fig.  12.20d.  There  are  tour  unknowns,  the 
coordinates  in  the  three-dimensional  space  of  the  user  along  with  a  timing  bias  in  the  users 
receiver.  These  four  unknowns  can  be  solved  by  using  four  range  equations  to  each  ot  the  tour 
satellites. 

Since  DSSS  signals  consist  of  a  sequence  of  extremely  short  pulses,  it  is  possible  to 
measure  their  arrival  times  accurately.  The  GPS  system  can  result  in  accuracies  ot  10  meters 
anywhere  on  Earth.  The  use  of  differential  GPS  can  provide  accuracy  within  centimeters-  11 
this  case  we  use  one  terrestrial  location  whose  position  is  known  exactly.  Comparison  o 
known  coordinates  with  those  read  by  a  GPS  receiver  (for  the  same  location)  gives  us  the  error 
(bias)  of  the  GPS  system,  which  can  be  used  to  correct  the  errors  of  GPS  measurements  ot  ot  or 
locations.  This  is  based  on  the  fact  that  satellite  orbits  are  so  high  that  any  errors  measure 
by  one  receiver  will  be  almost  exactly  the  same  for  any  other  receiver  in  the  same  o  ^ 
Differential  GPS  is  currently  used  in  such  diverse  applications  as  surveying,  laying  Pelr0  e 
pipelines,  aviation  systems,  marine  navigation  systems,  and  preparing  highly  accurate 
of  everything  from  underground  electric  cabling  to  power  poles. 


Why  Spread  Spectrum  in  GPS?  (rp|)1 

The  use  of  spread  spectrum  in  the  GPS  system  accomplishes  three  tasks.  First,  the  signa  * 
the  satellites  can  be  kept  from  unauthorized  use.  Second,  and  more  important  in  a  pra^  ^ 
sense,  the  inherent  processing  gain  of  spread  spectrum  allows  reasonable  power  leve  s 
used.  Since  the  cost  of  a  satellite  is  proportional  to  its  weight,  it  is  desirable  to  reduce  t  e  P 
required  as  much  as  possible.  In  addition,  since  each  satellite  must  see  an  entire  hennsp  ^ 
very  little  antenna  gain  is  possible.  For  high  accuracy,  short  pulses  are  required  to 
fine  resolution.  This  results  in  high  spectrum  occupancy  and  a  received  signal  that  is 
decibels  below  the  noise  floor.  Since  range  information  needs  to  be  calculated  only  a  0  ^ 

every  second,  the  data  bandwidth  need  be  only  about  100  Hz.  This  is  a  natural  ll,a 
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Figure  12.20 

(a)  Receiver 
location  from 
one  satellite 
measurement. 

(b)  Location 
narrowed  down 
by  two  satellite 
measurements. 

(c)  Location 
narrowed  down 
by  three  satellite 
measurements. 

(d)  Practical 
global  pos¬ 
itioning  system 
using  four 
satellites. 

(e)  Block  dia¬ 
gram  of  a  GPS 
receiver. 


The  user  is  somewhere 
on  this  sphere 


(a) 


Somewhere  on 
this  circle 


(b) 


At  one  of  these 
two  points 


(c) 


(e) 


spread  spectrum.  Despreading  the  received  signal  in  the  recetver.  m  turn,  yields  a  s.gn.f, can. 
processing  gain,  thus  allowing  good  reception  at  reasonable  power  levels.  The  th.rd  reason  for 
sold  spe^rum  is  that  each  satellite  can  use  the  same  frequency  band,  yet  there  ,s  no  mutual 

interference  owing  to  the  near  orthogonality  of  each  users  signal. 

Each  satellite  circles  the  earth  in  1 2  hours  and  emits  two  PN  sequences  modulated  m  phase 
quadrature  at  two  frequencies.  Two  frequencies  are  needed  to  correct  for  the  delay  introduced 

by  the  ionosphere. 


12  8  3  IEEE  802.1 1  b  Standard  for  Wireless  LAN 

IEEE  802  1  lb  is  a  commercial  standard  developed  for  wireless  local  area  networks  (WLAN, 
to  provide  high-speed  wireless  connection  to  (typically)  laptop  computers. 
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Like  its  predecessor  IEEE  802. 1 1 ,  IEEE  802. 1 1  b  operates  in  the  license-free  ISM  band  of 
2.4  to  2.4835  GHz.  Similar  to  cellular  networks,  all  laptop  computers  within  a  small  coverage 
area  form  1  -to- 1  communication  links  with  an  “access  point."  The  access  point  is  typically 
connected  to  the  Internet  via  a  high-speed  connection  that  can  deliver  the  traffics  to  and  from 
laptop  computers.  In  this  way,  the  access  point  serves  as  a  bridge  between  the  computers  and 
the  Internet. 

The  ISM  band  is  populated  with  signals  from  many  unlicensed  wireless  devices  such  as 
microwave  ovens,  baby  monitors,  cordless  phones,  and  wireless  controllers.  Hence,  to  transmit 
WLAN  data,  interference  resistance  against  these  unlicensed  transmission  is  essential.  For  this 
reason,  spread  spectrum  is  a  very  effective  technology. 

The  simple  FSK  used  in  the  FHSS  IEEE  802.1 1  provides  up  to  2  Mbit/s  data  rate  and 
is  simple  to  implement.  Still,  the  link  data  rate  is  quite  low.  Because  the  laptop  is  a  relative 
powerful  device  capable  of  supplying  moderate  levels  of  power  and  computation,  it  can  support 
more  complex  and  faster  modulation.  IEEE  802.11b  eliminates  the  FHSS  option  and  fully 
adopts  the  DSSS  transmission.  It  pushes  the  data  rate  up  to  11  Mbit/s,  which  is  reasonably 
satisfactory  to  most  computer  connections. 

Internationally,  there  are  14  DSSS  channels  defined  over  the  ISM  band,  although  not  all 
channel  are  available  in  every  country.  In  North  America,  there  are  1 1  (overlapping)  channels 
of  bandwidth  22  MHz.  The  channel  spacing  is  5  MHz.  Table  12.2  illustrates  the  11  DSSS 
channels. 

The  chip  rate  of  IEEE  802.1  lb  is  11  MHz,  and  the  spread  spectrum  transmission  band¬ 
width  is  approximately  25  MHz.  The  802.1  lb  data  rate  can  be  1,  2,  5.5,  and  1 1  Mbit/s.  For 
1  and  2  Mbit/s  data  rates,  differential  BPSK  and  differential  QPSK  are  used,  respectively. 
At  high  data  rates  of  5.5  and  1 1  Mbit/s,  a  more  sophisticated  complementary  code  keying 
(CCK)  was  developed.  The  link  data  rate  is  established  based  on  how  good  the  channel 
condition  is.  The  different  spreading  gains  for  the  802.11b  DSSS  modulation  are  given  in 
Table  12.3. 

Note  that  each  access  point  may  serve  multiple  links.  Additionally,  there  may  be  more 
than  one  access  point  at  a  given  area.  To  avoid  spectral  overlap,  different  network  links  must 
be  separated  by  a  minimum  of  five  channel  numbers.  For  example,  channel  1,  channel  6, 
and  channel  1 1  can  coexist  without  mutual  interference.  Often,  a  neighborhood  may  be  very 


TABLE  12.2 

2.4 GHz  ISM  Channel  Assignment  in  IEEE  802.1  lb 


Channel 

1  2 

3 

4 

5  6 

7 

8 

9 

10 

11 

Center /, 
GHz 

2.412  2.417 

2.422 

2.427 

2.432  2.437 

2.442 

2.447 

2.452 

2.457 

2.462 

TABLE  12.3 

Modulation  Format  and  the  Spreading  Factor 
in  IEEE  802. 1  1  b  Transmission 


Chip  rate 

11  MHz 

Data  rate 

Modulation 
Spreading  gain 

I  Mbit/s 

Differential  BPSK 

II 

2  Mbit/s 

Differential  QPSK 
11 

5.5  Mbit/s 
CCK 

2 

1 1  Mbit/s 
CCK 

1 
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Figure  12.21 

A  wireless  LAN 
with  one  access 
point  and  four 
computer  nodes. 


congested  with  multiple  network  coverage.  Thus,  spectral  overlapping  becomes  unavoidable. 
When  different  networks  utilize  spectrally  overlapping  channels,  signal  collisions  may  take 
place.  Data  collisions  are  not  resolved  by  radio  transmitters  and  receivers  (physical  layer). 
Rather,  network  protocols  are  developed  to  force  all  competing  networks  and  users  to  back  off 
(i.e.,  to  wait  for  a  timer  to  expire  before  transmitting  a  finite  data  packet).  In  802. 1 1  WLAN, 
the  timer  is  set  to  a  random  value  based  on  a  traffic-dependent  uniform  distribution.  This 
backoff  protocol  to  resolve  data  collisions  in  WLAN  is  known  as  the  distributed  coordinator 

function  (DCF). 

To  allow  multiple  links  to  share  the  same  channel,  DCF  forces  each  link  to  vacate  the 
channel  for  a  random  period  of  time.  This  means  that  the  maximum  data  rate  of  1 1  Mbit/s 
cannot  be  achieved  by  any  of  the  competing  users.  As  shown  in  Fig.  1 2.2 1 ,  the  two  computers 
both  using  channel  1 1  to  connect  to  the  access  point  must  resort  to  DCF  to  reduce  their  access 
time  and  effectively  lower  their  effective  data  rate.  In  this  case,  perfect  coordination  would 
be  able  to  allocate  1 1  Mbit/s  equally  between  the  two  users.  This  idealistic  situation  is  really 
impossible  under  the  distributed  protocol  of  DCF.  Under  DCF,  the  maximum  throughput  of 

either  user  would  be  much  lower  than  5.5  Mbit/s.  ... 

IEEE  802  1 1  b  is  without  a  question  one  of  the  most  successful  of  the  wireless  standards  that 
are  responsible  for  opening  up  the  commercial  WLAN  market.  Nevertheless,  to  further  improve 
the  spectral  efficiency  and  to  increase  the  possible  data  rate,  a  new  modulation  scheme  known 
as  orthogonal  frequency  division  multiplexing  (OFDM)  was  incorporated  into  the  follow-up 
standard's  of  IEEE  802.1  la  and  IEEE  802.1  lg.*  The  principles  and  analysis  of  OFDM  will  be 

discussed  next  in  Chapter  13. 
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, _ ontf-rcxercise  we  provide  some  opportunities  for  readers  to  learn  firsthand 

In  this  section  o  co  p  behavjor  0f  spread  spectrum  communications.  We  consider  the 

about  the  implemen  a  ^  spectrum  (FHSS),  direct  sequence  spread  spectrum  (DSSS) 

cases  of  frequency  PP  ^  systems  We  test  the  narrowband  jamming  effect  on  spread 

O"  —  COMA  systems. 


- u  I  CM  hind  as  in  IEEE  802. 1 1  b  and  must  be  backward  compatible.  Thus,  IEEE 

*  IEEE  802.1  lg  operates  mthe^mel  *  mechanisms.  IEEE  802.1  la,  however,  operates  in  the  5  GH/.  band 

802.1  lg  includes  both  the  CDMA  anu 
and  uses  OFDM  exclusively. 
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COMPUTER  EXERCISE  12.1:  FHSS  FSK  COMMUNICATION 
UNDER  PARTIAL  BAND  JAMMING 


Figure  12.22 

Performance  of 
FHSS 

noncoherent 
detection  under 
partial  band 
jamming. 


The  first  MATLAB  program,  Exl2_l  .m,  implements  an  FHSS  communication  system  that  utilizes  FSK 
and  noncoherent  detection  receivers.  By  providing  an  input  value  of  1  (with  jamming)  and  0  (without 
jamming),  we  can  illustrate  the  effect  of  FHSS  against  partial  band  jamming  signals. 


TABLE  12.4 

Parameters  Used  in  Computer  Exercise  12.1 


Number  of  users 
Spreading  factor 
(number  of  FSK  bands) 

Number  of  hops  per  symbol  per  bit 

Modulation 

Detection 

Partial  band  jamming 


Lh=\ 

BFSK 

Noncoherent 
1  fixed  FSK  band 


In  Exl2_l.m,  the  parameters  of  the  FHSS  system  are  given  in  Table  12.4.  When  partial  band 
jamming  is  turned  on,  a  fixed  but  randomly  selected  FSK  channel  is  blanked  out  by  jamming.  Under 
additive  white  Gaussian  channel  noise,  the  effect  of  partial  band  jamming  on  the  FHSS  user  is  shown  in 
Fig.  12.22.  Clearly,  we  can  see  that  without  jamming,  the  FHSS  performance  matches  that  of  the  FSK 
analysis  in  Sec.  12.1  and  Chapter  1 1.  When  partial  jamming  is  turned  on.  the  BER  of  the  FHSS  system 
has  a  floor  of  1  / (2 L)  as  shown  in  Eq.  ( 1 2.4).  As  L  increase  from  4  to  8,  and  to  1 6,  the  performance  clearly 
improves. 

%  MATLAB  PROGRAM  <Exl2_l.m> 

%  This  program  provides  simulation  for  FHSS  signaling  using 
%  non-coherent  detection  of  FSK. 

%  The  jammer  will  jam  1  of  the  L  frequency  bands  and 


Figure  12.22 
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partial  band 
jamming. 


L  =  8,  under  jamming 


L  =  4,  under  jamming 
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%  can  be  turned  on  or  off  by  inputting  jamming=l  or  0 
%  Non-coherent  MFSK  detection 

%  only  needs  to  compare  the  magnitude  of  each  frequency  bin. 
% 


clear; elf 

n=10000 ;  %Number  of  data  symbols  in  the  simulation 

L=8;  %  Number  of  frequency  bands 

Lh=l;  %  Number  of  hops  per  symbol  (bit) 

m=l;  %  Number  of  users 

%  Generating  information  bits 

s_dat a= round (rand (n,m) )  ; 

%  Turn  partial  band  jamming  on  or  off 


j amming= input  (' jamming=?  (Enter  1  for  Yes,  0  for  No)  ); 
%  Generating  random  phases  on  the  two  frequencies 
xbasel= [ exp ( j  *2  *pi*rand (Lh*n , 1 ) ) ] ; 
xbaseO= [exp ( j  *2*pi*rand (Lh*n, 1 ) ) ] ; 


%  Modulating  two  orthogonal  frequencies 

xmodsig= [kron (s_data, ones (Lh,  1) )  .  *xbasel  kron ( (l-s_data) , ones (Lh, 1) ) ,‘xbaseO] 
clear  xbaseO  xbasel; 

%  Generating  a  random  hopping  sequence  nLh  long 
Phop=round(rand(Lh*n, 1) * (L-l) ) +1;  %  PN  hopping  pattern; 

Xsiga=sparse ( 1 : Lh*n, Phop, xmodsig ( : ,  1 )  )  ; 

Xsigb=sparse(l:Lh*n,Phop,xmodsig( : ,2) )  ; 

%  Generating  noise  sequences  for  both  frequency  channels 


noisel=randn(Lh*n, 1) + j *randn (Lh*n, 1) ; 
noise2=randn (Lh*n, 1) + j *randn (Lh*n, 1) ; 


Nsiga=sparse (1 : Lh*n, Phop, noisel )  ; 
Nsigb=sparse (1 :Lh*n, Phop, noise2 ) ; 
clear  noisel  noise2  xmodsig, 


BER= [ ] ; 

BER _ a  z  — [ ] ; 

%  Add  a  jammed  channel 


(randomly  picked) 


if  (jamming) 

nch=round ( rand* (L-l) ) +1? 

Xsiga ( : , nch) =Xsiga ( : , nch) *0 ; 

Xsigb ( : , nch) =Xsigb ( : , nch) *0 ; 

Nsiga ( : , nch) =Nsiga ( : , nch) *0 ; 

Nsigb ( : , nch) =Nsigb ( : , nch) *0 ; 

Tcenerating  the  channel  noise  (AWGN) 
for  i=l:10, 

Eb2N(i)=i; 

Eb2N_num=10''  (Eb2N(i)  /10)  , 
Var_n=l/ (2*Eb2N_num) ; 
signois=sqrt (Var_n) ; 

ychl=Xsiga+signois*Nsiga; 

ych2=Xsigb+signois*Nsigb; 

%  Non-coherent  detection 


% (Eb/N  in  dB) 

%  Eb/N  in  numeral 
%1/SNR  is  the  noise  variance 
%  standard  deviation 
%  AWGN  complex  channels 
%  AWGN  channels 


for  kk=0:n-l, 

Yvecl= [ ] ;Yvec2=[]  ; 


for  kk2=l : Lh, 

Yvecl= [Yvecl 
Yvec2= [Yvec2 


ychl (kk*Lh+kk2 , phop (kk*Lh+kk2 ) ) ] ; 
ych2 (kk*Lh+kk2 , Phop (kk*Lh+kk2 ) ) ] ; 
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end 

ydiml=Yvecl*Yvecl ' ; 
ydim2=Yvec2*Yvec2 ' ; 
dec (kk+1) = (ydiml>ydim2) ; 

end 

clear  ychl  ych2 ; 

%  Compute  BER  from  simulation 
BER= [BER;  sum (dec ' ~=s_data ) /n] ; 

%  Compare  against  analytical  BER. 

BER_az= [BER_az ;  0 . 5*exp ( -Eb2N_num/2 ) ] ; 

end 

f igber=semilogy (Eb2N, BER_az , 'k- ' , Eb2N, BER, #  k-o ' ) ; 
set ( f igber , ' Linewidth' ,  2 )  ; 

legend ( 'Analytical  BER' ,  'FHSS  simulation' ) ; 
fx=xlabel ( ' E_b/N  (dB) ' ) ? 
fy=ylabel ( ' Bit  error  rate'); 

set ( fx,  ' FontSize ' , 11 ) ;  set ( fy , ' Fontsize ' , 11 ) ; 


COMPUTER  EXERCISE  12.2:  DSSS  TRANSMISSION  OF  QPSK 

In  this  exercise,  we  performance  a  DSSS  baseband  system  test  under  narrowband  jamming.  For  spreading 
in  this  case,  we  apply  the  Barker  code  of  length  1 1 


pcode  =[111-1-1-11  -1  -1  1  -1] 

for  spreading  because  of  its  nice  spectrum  spreading  property  as  a  short  code.  We  assume  that  the  channel 
noises  are  additive  white  Gaussian.  M  ATLAB  program  Exl2_2b .  m  provides  the  results  of  a  DSSS  user 
with  QPSK  modulation  under  a  narrowband  jamming. 


%  MATLAB  PROGRAM  <Exl2_2b.m> 

%  This  program  provides  simulation  for  DS-CDMA  signaling  using 
%  coherent  QAM  detection. 

%  To  illustrate  the  CDMA  spreading  effect,  a  single  user  is  spread  by 
%  PN  sequence  of  different  lengths.  Jamming  is  added  as  a  narrowband; 

%  Changing  spreading  gain  Lc; 

%  data  length  in  simulation;  Must  be  divisible  by  8 
%  spreading  factor  vs  data  rate 
%  can  also  use  the  shorter  Lc=7 
%  Generate  QPSK  modulation  symbols 

data_sym=2 * round ( rand ( Ldata ,  1 ) ) -1+ j * ( 2  *round ( rand ( Ldata , 1 ) ) -1 )  > 
j am— 1 data=2  * round ( rand ( Ldata ,  1 ) ) -1  + j  * ( 2  *round ( rand ( Ldata , 1) ) "1 )  • 

%  Generating  a  spreading  code 


clear; elf 
Ldata=20000 ; 
Lc=ll ; 


pcode= [1  1  1 


-l  -l  1-1-11 


-1J 


%  Now  spread 
x_in=kron (data_sym, pcode) ; 

%  Signal  power  of  the  channel  inpui 
%  Jamming  power  is  relative 

SIR=1°;  %  SIR  in  dB 

Pj=2*Lc/ (10" (SIR/10) ) ; 


is  2*Lc 


%  Generate  noise  (AWGN) 

noiseq=randn(Ldata*Lc, 1) +j*randn(Ldata*Lc, 1) ;  %  Power  is  2 
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%  Add  jamming  sinusoid  sampling  frequency  is  fc  =  Lc 
jam_mod=kron ( jam_data,  ones  (Lc,  1) )  ;  clear  jam_data; 

j  ammer=  sqrt ( P j / 2 ) * jam_mod . *exp (j*2*pi*0.12* (1: Ldata*Lc ) )  .  '  ;  %f j / fc=0 . 12  . 
clear  jam_mod; 

[P, x] =pwelch(x_in,  []  ,  []  ,  [4096] ,Lc, ' twoside' ) ; 
figure (1) ; 

semi logy (x-Lc/2 , f f tshif t (P) )  ; 
axis ( [-Lc/2  Lc/2  l.e-2  l.e2]); 
grid; 

xf ont=xlabel ( ' frequency  (in  unit  of  1/T_s) ' ) ; 
yf ont=ylabel ( 'CDMA  signal  PSD' ) ; 

set (xfont, ' FontSize ' # 1 1 ) ;set (yfont, 'FontSize' , 11) ? 

[P,  x]  =pwelch  ( jammer+x_in,  []  ,  []  ,  [4096] ,Lc,  'twoside'  )  ; 

figure (2 ) ; semi logy (x-Lc/2 ,  f f tshif t ( P)  )  ; 

grid; 

axis ([-Lc/2  Lc/2  l.e-2  l.e2]); 
xf ont=xlabel ( ' frequency  (in  unit  of  1/T_s)'); 
yfont=ylabel ( 'CDMA  signal  +  narrowband  jammer  PSD'); 
set (xfont, 'FontSize' , 11) ;set (yfont, 'FontSize' , 11) ; 


BER= [ ] ; 
BER_az= [ ] ; 


for  i=l:10, 

Eb2N (i) = (i-1) ; 

Eb2N_num= 1 0 " ( Eb2N ( i ) / 1 0 ) ; 
Var_n=Lc/ (2*Eb2N_num) ; 
signois=sqrt (Var_n) ; 

awgnois=signois*noiseq; 

%  Add  noise  to  signals  at  the 

y__out=x_in+awgnois+ jammer ; 

Y__out=reshape  (y_out ,  Lc ,  Ldata ) 


% (Eb/N  in  dB) 

%  Eb/N  in  numeral 
% 1 / SNR  is  the  noise  variance 
%  standard  deviation 
%  AWGN 

channel  output 


clear  y_out  awgnois; 


%  Despread  first 

z__out=Y_out*pcode; 

%  Decision  based  on  the  sign  of  the  samples 
«  -^i-nirpallz  out) )+j*sign(imag(z_out) ) ; 
decl  sig  original  data  to  compute  BER 

%  Now  compare  against  the  original 

BER= [BER; su»< [real (data_sy»)  -real (decl)^ . . . 

BER  j“[BER!«X5-«r“%,rt(Eb2N.„™, )  [ ;  %analytlc.l 

end 

figure (3 )  BER_az,'k-',Eb2N,BER,'k-o'); 

legendrNo1  jamming '/'Narrowband  jamming  (-10  dB)'); 
set ( f igber , 'LineWidth' , 2) ; 
xf ont=xlabel ( ' E_b/N  (dB)'); 

,  ,  this  case  is  L  =  1 1 .  the  DSSS  signal  occupies  a  bandwidth  approx- 
Because  the  spreading  factor  in  narrowband  QPSK  jamming  signal  with  a 

imately  II  times  wider  ratio  ,SIR,  c„  be  adjusted  In  Fig.  ,2.23. 

carrier  frequency  offset  of  1.32/7\  The  s.gna 
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Figure  12.23 

(a)  Power 
spectral  densities 
of  DSSS  signal 
using  Barker 
code  of  length 
1 1  for 
spreading: 

(a)  without 
narrowband 
jamming; 

(b)  with 
narrowband 
jamming  at 
SIR  =  1  OdB. 


Frequency,  \/Ts 
(a) 


Frequency,  1/7^ 
(b) 


Figure  12.24 

Bit  error 
probabilities  of 
DSSS  with  QPSK 
modulation 
under 

narrowband 

jamming. 


DSSS  (CDMA)  with  spreading  gain  =1 1 


we  can  witness  power  spectral  densities  before  and  after  the  addition  of  the  jamming  signal  when 
SIR  =  10  dB.  Despreading  at  the  receiver  enables  us  to  find  the  resulting  BER  of  the  QPSK  signa 
under  different  jamming  levels  (Fig.  12.24).  As  the  jamming  signal  becomes  stronger  and  stronger,  vse 
will  need  to  apply  larger  spreading  factors  to  mitigate  the  degrading  effect  on  the  BER- _ _ . — 


COMPUTER  EXERCISE  1 2.3:  MULTIUSER  DS-CDMA  SYSTEM 

To  implement  DS-CDMA  systems,  we  must  select  multiple  spreading  codes  with  good  cross-correlatio 
and  autocorrelation  properties.  Gold  sequences  are  a  very  well-known  class  of  such  good  spaa  >n 
codes.  Note  that  the  Gold  sequences  are  not  mutually  orthogonal.  They  have  some  nonzero  but  sma 
cross-correlations  that  can  degrade  the  multiuser  detection  performance.  We  select  four  Gold  sequent 
to  spread  four  QPSK  users  of  equal  transmission  power.  No  near-far  effect  is  considered  in  this  examp 
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The  first  MATLAB  program,  gold31code  .m,  assigns  four  Gold  sequences  of  length  31  to  the 
four  QPSK  modulated  user  signals: 


%  MATLAB  PROGRAM  <gold3 lcode .m> 

%  to  generate  a  table  of  4  Gold  sequence 


%  with 

length 

31  each. 

GPN= [ 1 

1 

1 

-1 

-1 

1 

-1 

1 

-1 

-1 

1 

1 

1 

1 

-1 

-1 

-1 

-1 

-1 

-1 

1 

1 

1 

1 

1 

1 

-1 

-1 

-1 

-1 

-1 

1 

-1 

1 

-1 

-1 

1 

-1 

-1 

1 

-1 

-1 

1 

1 

1 

1 

-1 

1 

1 

-1 

-1 

-1 

-1 

1 

1 

1 

1 

-1 

1 

1 

-1 

1 

1 

-1 

-1 

-1 

1 

-1 

-1 

1 

1 

-1 

1 

1 

1 

-1 

1 

-1 

1 

1 

1 

-1 

1 

-1 

1 

1 

-1 

-1 

1 

1 

1 

1 

1 

1 

-1 

-1 

-1 

-1 

-1 

-1 

1 

1 

-1 

1 

1 

-1 

-1 

-1 

-1 

1 

-1 

1 

1 

1 

-1 

-1 

1 

1 

1 

1 

1 

1 

1 

i: 

The  main  MATLAB  program.  Exl2_3  .m,  completes  the  spreading  of  the  four  user  signals.  The 
four  spread  CDMA  signals  are  summed  together  a.  the  receiver  before  detection.  Each  of  the  four  users 
n  P  i  ,h.  conventional  despreader  (matched  filter)  at  the  receiver  before  making  the  symbol-by- 
Wlll  apply  ,he  BER  of  all  four  users  in  Fig.  12.25  under  additive  white 

symbol  decisi  .  P  sing|e-user  BER  in  AWGN  channel  as  a  reference.  All  four  users  have 

S3ber Z  2m* . -  “R  -  <*  > BER  * ,te 

nonorthogonal  spreading  codes. 

%  MATLAB  PROGRAM  <Exl 23 ^m>  for  multiuser  DS-CDMA  signaling  using 

%  This  program  provides  simui 

%  coherent  QPSK  for  4  users. 

% 

%clear;clf  .  ta  length  in  simulation;  Must  be  divisible  by  8 

Ldata=10°0°;  %  Spreading  factor  vs  data  rate 

%User  number  =  4; 
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%  Generate  QPSK  modulation  symbols 

data_sym=2*round  ( rand  ( Ldata ,  4 )  )  -1+ j  *  ( 2  *round  ( rand  ( Ldata ,  4 )  )  -1 ) ; 


%  Select  4  spreading  codes  (Gold  Codes  of  Length  11) 

gold31code; 

pcode=GPN; 

%  Spreading  codes  are  now  in  matrix  pcode  of  31x4 
PowerMat=diag ( sqrt ( [ 1  1  1  1] ) ) ; 
pcodew=pcode  *  PowerMa t ; 

%  Now  spread 

x_m=kron(data_sym(  :  ,1)  /pcodew(  :  ,1)  )  +kron  (data_sym (  :  ,2)  , pcodew(  s , 2) )  ♦••• 
kron  ( data_sym  (:  ,3)  ,  pcodew  (:  ,3)  )  +kron  ( data_sym  (  :  ,  4 )  ,  pcodew  ( : ,  4 ) )  ; 


%  Signal  power  of  the  channel  input  is  2*Lc 
%  Generate  noise  ( AWGN ) 

noiseq=randn (Ldata* Lc / 1 )  +  j  *randn (Ldata*Lc ,  1 )  ;  %  Power  is  2 

BER1=[] ; 

BER2= [ ] ; 

BER3= [ ] ; 

BER4= [ ] ; 

BER_az= [ ] ; 


for  i=l : 12 , 

Eb2N ( i ) = ( i-1 ) ; 

Eb2N_num=10/x  (Eb2N(i)  /10)  ; 
Var_n=Lc / ( 2  *  Eb2N_num ) ; 
signois=sqrt  (Var__n)  ; 
awgnois=signois*noiseq; 

%  Add  noise  to  signals  at  the 


% (Eb/N  in  dB) 

%  Eb/N  in  numeral 
%  1  / SNR  is  the  noise  varianc 
%  standard  deviation 
%  AWGN 

channel  output 
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y_out=x_in+awgnois ; 

Y_out=reshape (y_out , Lc, Ldata) . ' ;  clear  y_out  awgnois; 

%  Despread  first 
z_out=Y_out*pcode; 

%  Decision  based  on  the  sign  of  the  samples 
dec=sign(real (z_out) ) + j *sign ( imag ( z_out ) ) ; 

%  Now  compare  against  the  original  data  to  compute  BER 
BER1= [ BER1 ; sum( [real (data_sym( :  ,  1) ) ~=real (dec ( : , 1) ) ; . . . 

imag (data_sym( 1,1)) ~= imag (dec ( : , 1) ) ] ) / (2*Ldata) ) ; 

BER2  = [ BER2 ; sum ( [real (data_sym ( : ,2) ) ~=real (dec( : ,2) )  ;  .  .  . 

imag (data_sym {: ,2) ) ~=imag (dec {: ,2) )]) / (2  *Ldata) ] ; 

BER3= [BER3 ;sum( [real (data_sym( : , 3) ) ~=real (dec ( : , 3)  )  ;  .  .  . 

imag (data_sym( :  ,  3  )  ) ~=imag (dec (  :  ,  3 )  )  ]  )  / (2*Ldata) ] ; 

BER4= [ BER4 ; sum ( [real (data_sym( :  ,  4)  ) ~=real (dec ( : , 4) ) ; . . . 

imag (data_sym ( : , 4 ) ) ~  =  imag (dec ( : , 4 ) ) ] ) / ( 2  * Ldata) ] ; 

BER_az= [BER_az ; 0 . 5*erfc (sqrt (Eb2N_num) ) ] ;  %analytical 

end 

BER= [BERl  BER2  BER3  BER4] ; 
figure (1) 

f igber=semilogy (Eb2N, BER_az , 'k-' , Eb2N,  BERl ,  # k-o ' /Eb2N,BER2, #k-s#  #  .  .  . 

Eb2N,BER3/ 'k-v' /Eb2N#BER4/ 'k-*# ) ; 
legend ('Single-user  (analysis)  \  'User  1  BER', 'User  2  BER', 

'User  3  BER', 'User  4  BER') 
axis ( [0  12  0 . 99e-5  I.eO]); 
set ( f igber , ' LineWidth ' , 2 ) ; 

xlabel ( ' E_b/N  (dB) ' ) ;ylabel ( ' QPSK  bit  error  rate') 
title (' 4-user  CDMA  BER  with  Gold  code  of  length  31'); 


COMPUTER  EXERCISE  12.4:  MULTIUSER  CDMA  DETECTION 


WIIU  Uic  ptliuiwm- -  n  A/ 

performance  results  of  user  2  and  user  4  in  Fig.  1  -.26. 


%  variations. 


itigate  the  near-far  effect 


% 

%clear >  elf 
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Figure  12.26 

Performance 
comparison  of 
decorrelator 
MUD  in  com¬ 
parison  with  the 
conventional 
single-user 
receiver. 


Ldata-100000 ;  %  data  length  in  simulation;  Must  be  divisible  by 

Lc=31;  %  spreading  factor  vs  data  rate 

%User  number  =  4; 

%  Generate  QPSK  modulation  symbols 

data_j syni=2  *  round  ( rand (Ldata,  4 )  )  -1  +  j  *  ( 2*  round  (rand  (Ldata ,  4 )  )  -1 )  ; 

%  Select  4  spreading  codes  (Gold  Codes  of  Length  11) 

gold31code; 

pcode=GPN; 

%  Spreading  codes  are  now  in  matrix  pcode  of  31x4 
PowerMat=diag ( sqrt ( [ 10  1  5  1] ) ) ; 
pcodew=pcode  *  PowerMa t ; 

Rcor=pcodew/ *pcodew; 

Rinv=pinv ( Rcor ) ; 

%  Now  spread 

X— m=kron (data_sym ( :  ,  1)  , pcodew (  :  f  1)  )  +kron (data_sym (  :  ,  2)  , pcodew ( :  ,2) )  +* 
kron  ( data_sym  ( :  ,  3 )  ,  pcodew  ( : ,  3 )  )  +kron  ( data_sym  ( : ,  4 )  ,  pcodew  ( : ,  4 )  )  ; 

%  Signal  power  of  the  channel  input  is  2*Lc 

%  Generate  noise  (AWGN) 

noiseq=randn ( Ldata*Lc ,  1 )  +  j  *randn ( Ldata*Lc , 1) ;  %  Power  is  2 

BERb2= [ ] ; 

BERa2= [ ] ; 

BERb4= [ ] ; 

BERa4= [ ] ; 

BER_az= [ ] ; 


for  i=l : 13 , 

Eb2N (i) = (i— 1) ; 


% ( Eb/N  in  dB) 
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Eb2N_num=l(r  (Eb2N(i)  /10)  ; 
Var_n=Lc/  (2*Eb2N_num)  ; 
signois=sqrt (Var_n) ; 
awgnois=signois*noiseq; 


%  Eb/N  in  numeral 
% 1 / SNR  is  the  noise  variance 
%  standard  deviation 
%  AWGN 


%  Add  noise  to  signals  at  the  channel  output 
y_out=x_in+awgnois ; 

Y_out=reshape (y_out , Lc , Ldata) . ' ;  clear  y_out  awgnois; 

%  Despread  first  and  apply  decorrelator  Rinv 

z_out= (Y_out Opcode ) ;  %  despreader  (conventional)  output 

clear  Y_out; 

z  dcr=z_out*Rinv;  %  decorrelator  output 


%  Decision  based  on  the  sign  of  the  single  receivers 
decl=sign (real ( z_out ) ) +j *sign (imag (z_out) ) ; 
dec2=sign (real ( z_dcr) ) + j *sign ( imag ( z_dcr ) ) ; 

%  Now  compare  against  the  original  data  to  compute  BER  of  user  2 
%  and  user  4  (weaker  ones) . 

BERa2= [BERa2 ; sum ( [real (data_sym( : ,2) ) "=real (decl  (  : , 2 ) )  ;  .  .  . 

imag ( data_sym ( : , 2) ) ~=imag(decl ( : , 2)  )  ]  )  / (2*Ldata) ] ? 

BERa4= [BERa4 ; sum ( [real (data_sym( :  ,  4)  ) ~=real (decl ( : , 4 ) ) ; . . . 

imag ( data_sym ( : ,  4) ) ~=imag(decl ( : , 4)  )  ]  )  / (2*Ldata) ] ; 

BERb2= [ BERb2 ; sum( [real (data_sym( : ,2))  ~=real (dec2 (1,2));... 

imag (data_sym ( : ,  2 )  ) ~=imag (dec2 (:,2))])/(2*  Ldata ) ] , 

BERb4= [BERb4 ; sum ( [real (data_sym ( : #  4 ) )  =real (dec2 ( : , 4 ) )  ;  .  .  . 

imag (data_sym ( : ,4) ) ~=imag(dec2 ( : , 4) ) ] ) / (2* Ldata) ] ; 

BER_az= [BER_az; 0 . 5*erfc (sqrt (Eb2N_num) ) ] ;  %analytical 


end 

f igber=semilogy (Eb2N , BER_az , ' k- ' , Eb2N , BERa2 , ' k-o ' , Eb2N, BERa4 , ' 
Eb2N, BERb2 , 'k — o' , Eb2N , BERb4 , #k  s  # ) ; 
legend ('Single-user  (analysis) 'User  2  (single  user  detector) ' 
'User  4  (single  user  detector )', 'User  2  (decorrelator)',... 
'User  4  (decorrelator )' ) 
axis ( [0  12  0 . 99e-5  I.eO] ) ? 
set (f igber, 'LineWidth' , 2) ; 

xlabel ( ' E_b/N  (dB) ' ) ;ylabel ( 'QPSK  bit  error  rate  ) 
title ( 'Weak-user  BER  comparisons'); 


k-s ' 


We  ,l,o  implemenl  the  decisioo  feedb«:k  MUD  of  See.ion  1 2.7  in  MATLAB  pn^m  Exl2.4b 
The  dedsioo  feedback  MUD  perfonn.nce  of  the  two  owr,  „  shown  <«  Ftg.  1 2 .27. 


m. 


I  foe  bokkiun-c  Ck* 

%  The  <1  users  have  differeoc  peers  to  rllostrete  the 

*  near-tar  effect  in  single  user  conventional  receivers 


%  Decision  feedback  detectors  are 
%  ability  to  overcome  the  near-  ar 


tested  to  show  its 
problem . 


%clear;clf 
Ldata=100000; 
Lc=31 ; 


%  data  length  in  simulation;  Must  be  divisible  by  8 
%  spreading  factor  vs  data  rate 


%User  number  -  4; 
%  Generate  QPSK 


modulation  symbols 
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Figure  1 2.27 

Performance 
comparison  of 
decision 
feedback  MUD 
in  comparison 
with  the 
conventional 
single-user 
receiver. 


Weak-user  BER  comparisons 

-| - r~ 

Conventional  single-user  detector 


§ 


_c 

*  10“ 
C/5 

a, 

O' 


io^  r 


10 


Single-user  (analysis) 

User  2  (single-user  detector) 
User  4  (single-user  detector) 
User  2  (decision  feedback) 
User  4  (decision  feedback) 


EtfM  dB 


& 


data_sym-2  *  round  ( rand  ( Ldata ,  4)  )  —  1-*- j  *  (2*round  (rand  (Ldata,  4)  )  -1) ; 

%  Select  4  spreading  codes  (Gold  Codes  of  Length  11) 

gold31code; 

pcode=GPN; 

%  Spreading  codes  are  now  in  matrix  pcode  of  31x4 
PowerMat=diag (sqrt ( [10  1  5  1] ) ) ; 
pcodew=pcode*PowerMat; 

Rcor=pcodew' *pcodew; 

%  Now  spread 

x__ m=kron(data_sym(  :  ,  1)  /pcodew(  :  ,  1)  )  +kron  (data_sym ( :  ,2)  , pcodew  ( :  ,2) )  +••• 
kron  (data_sym  ( : ,  3 )  ,  pcodew  ( : ,  3 )  )  +kron  ( data_sym  ( : ,  4 )  ,  pcodew  ( : ,  4 ) )  ; 

%  Signal  power  of  the  channel  input  is  2*Lc 

%  Generate  noise  (AWGN) 

no i s eq= r andn (Ldata *Lc, 1) + j *randn (Ldata *Lc , 1 ) ;  %  Power  is  2 

REFE 


for  i=l : 13 , 

Eb2N (i ) = ( i — 1 ) ; 
Eb2N_num=10''  (Eb2N(i)  /10)  ; 
Var_n=Lc / ( 2  *  Eb2N_num ) ; 
signois=sqrt (Var_n) ; 


% ( Eb/N  in  dB) 

%  Eb/N  in  numeral 
%1/SNR  is  the  noise  varia"0 
%  standard  deviation 


BER_c2=[] ; 
BER2= [ ] ; 
BER_c4= [ ] ; 
BER4= [ ] ; 
BER_az= [ ] ; 
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awgnois=signois*noiseq;  %  AWGN 

%  Add  noise  to  signals  at  the  channel  output 
y_out=x_in+awgnois ; 

Y_out=reshape (y_out , Lc, Ldata) . '  ;  clear  y_out  awgnois; 

%  Despread  first 

z_out=Y_out*pcode;  %  despreader  (conventional)  output 

clear  Y_out; 

%  Decision  based  on  the  sign  of  the  single  receivers 
dec=sign(real (z_out) ) +j *sign (imag (z_out) ) ; 


%  Decision  based  on  the  sign  of  the  samples 
decl=sign (real (z_out ( 1,1))) +j * sign ( imag ( z_out (:,1))); 
z_fkl=z_out-decl*Rcor (1, :) ? 

dec3=sign (real (z_fkl( : , 3) ) ) + j *sign ( imag ( z_f kl (:,3))); 
z_fk2=z_fkl-dec3*Rcor (3, :); 

dec2=sign (real ( z_f  k2 ( : , 2 ) ) ) + j  *sign ( imag ( z_f k2 (:,2))); 
z_f k3=z_fk2-dec2*Rcor (2 , : ) ; 

dec4=sign ( real ( z_f k3 ( : , 4 ) ) ) + j *sign ( imag ( z_f k3 ( s , 4 ) )  )  ; 

%  Now  compare  against  the  original  data  to  compute  BER 
BER_c2= [BER_c2 ;sum( [real (data_sym( : ,2)) ~=real (dec [: ,2) ) ;  .  .  . 

imag  (data__sym  ( :  ,2)  )  ~=imag(dec  {:  ,2))))  /  (2  ♦Ldata)  ]  ; 

BER2= [BER2 ; sum ( [real (data_sym ( : , 2 ) )  =real (dec2 ) ; .  .  . 

imag (data_sym ( : , 2 ) ) ~=imag (dec2 ) ] ) / (2 ♦ Ldata ) ] ; 

BER_c4= [BER_c4 ; sum ( [real (data_sym( :  ,  4) ) ~=real (dec ( : , 4) ) ; . . . 

imag (data_sym( : , 4) ) ~=imag (dec ( : , 4) ) ] ) / (2*Ldata) ] ; 

BER4= [BER4; sum( [real (data_sym( : , 4) )  =real (dec4) ; .  .  . 

imag (data_sym( : , 4 ) ) ~=imag (dec4 ) ] ) / (2*Ldata) ] ; 

BER_az= [BER_az ; 0 . 5*erfc (sqrt (Eb2N_num) ) ] ?  %analytical 


end 

clear  z_fkl  z_fk2  z_fk3  decl  dec3  dec2  dec4  x_in  y_out  noiseq; 

f igber=semilogy (Eb2N, BER_az, ' k- ' , Eb2N ,  BER_c2 , ' k-o ' , Eb2N, BER_c4 , 'k-s'  ,  . 
Fb2N  BER2  ' k--o ; , Eb2N, BER4  ,  ' k--s ' ) ; 

legend ('Single-user  (analysis) 'User  2  (single  user  detector)',... 
'User  4  (single  user  detector) ', 'User  2  (decision  feedback)  .... 
'User  4  (decision  feedback) ' ) 
axis ( [0  12  0 . 99e-5  I.eO] ) ? 
set (fiaber, 'LineWidth' ,2) ; 

xlabel ( ' E_b/N  (dB) ' ) ;ylabel ( 'QPSK  bit  error  rate  ) 
title ('Weak-user  BER  comparisons'); 
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PROBLEMS 

12.1- 1  Consider  a  fast  hopping  binary  ASK  system.  The  AWGN  spectrum  equals  Sn(/)  =  ^’andthc 

binary  signal  amplitudes  are  0  and  2  V,  respectively.  The  ASK  uses  a  data  rate  of  100  kbit/s  and 
is  detected  noncoherently.  The  ASK  requires  1 00  kHz  bandwidth  for  transmission.  However,  the 
frequency  hopping  is  over  12  equal  ASK  bands  with  bandwidth  totaling  1.2  MHz.  The  partial 
band  jammer  can  generate  a  strong  Gaussian  noise— like  interference  with  total  power  of  27  dBm. 

(a)  If  a  partial  band  jammer  randomly  jams  one  of  the  1 2  FH  channels,  derive  the  BER  of  the 
FH-ASK  if  the  ASK  signal  hops  6  bands  per  bit  period. 

(b)  If  a  partial  band  jammer  randomly  jams  two  of  the  1 2  FH  channels,  derive  the  BER  of  the 
FH'ASK  if  the  ASK  signal  hops  6  bands  per  bit  period. 

(c)  If  a  partial  band  jammer  jams  all  12  FH  channels,  derive  the  BER  of  the  FH-ASK  if the 
ASK  signal  hops  6  bands  per  bit  period. 

12.1- 2  Repeat  Prob.  12.1-1  if  the  ASK  signal  hops  12  bands  per  bit  period. 

12.1- 3  Repeat  Prob.  12.1-1  if  the  ASK  signal  hops  one  band  per  bit  period. 

12.2- 1  In  a  multiuser  FHSS  system  that  applies  BFSK  for  each  user  transmission,  consider  each  inter¬ 

fering  user  as  a  partial  band  jammer.  There  are  M  users  and  L  total  signal  bands  for  synchronous 
frequency  hopping.  The  desired  user  under  consideration  hops  Lh  bands  within  each  bit  pen°  ' 
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(a)  Find  the  probability  that  exactly  1  of  the  signal  bands  used  by  the  desired  user  during  a 
signal  bit  is  jammed  by  the  interfering  signals. 

(b)  Determine  the  probability  that  none  of  the  signal  bands  used  by  the  desired  user  during  a 
signal  bit  will  be  jammed  by  the  interfering  signals. 

(c)  Assume  that  when  a  partial  signal  band  is  jammed,  we  can  compute  the  BFR  effect  by 
discarding  the  signal  energy  within  the  jammed  band.  Find  the  BFR  of  a  given  user  within 
the  system. 

12.4-1  Let  the  AWGN  noise  n(f)  have  spectrum  M/2.  If  the  AWGN  noise  n(/)  is  ideally  band-limited 
to  1  /27V  Hz,  show  that  if  the  spreading  signal  c(/)  has  autocorrelation  function 


Rc(t)  =  Y,S(r-iLTc) 

i 


then  the  PSD  of  x(f)  =  n(f)c(f)  is  approximately 

SAf)=  Sn(v)Sc(f  -  v)dv  =  — 

J —00  L 

12.5-1  Consider  DSSS  systems  with  interference  signal  /'(/).  At  the  receiver,  the  despread  signal  c(t )  = 
±1  with  bandwidth  Bc. 

(a)  Show  that  /(f)  and  the  despread  interference 

4,(0  =  *(0c(0 


have  identical  power. 

(b)  If  i(0  has  bandwidth  Bs  and  the  spreading  factor  is  L  such  that  BC  =  L  B„  show  that  the 
power  spectrum  of  4,(0  «  L  times  lower  but  L  times  wider. 

12  6-1  In  a  multiuser  CDMA  system  of  DSSS.  all  transmitters  are  at  equal  distance  from  the  receivers. 
In  other  words.  gi  =  constant.  The  additive  white  Gaussian  noise  spectrum  equals  Sn(/)  = 
5  x  10-6.  BPSK  is  the  modulation  format  of  all  users  at  the  rate  of  16  kbit/s. 

(a)  If  the  spreading  codes  are  all  mutually  orthogonal,  find  the  desired  user  signal  power  P, 
required  to  achieve  BER  of  10"5. 

(b)  If  the  spreading  codes  are  not  orthogonal,  more  specifically, 

Rii  =  1  Rij  =  —1/16  i^j 

Determine  the  required  user  signal  power  to  achieve  the  same  BER  of  10"5  by  applying  Gaussian 
approximation  of  the  nonorthogonal  MAI. 

12  6.2  Repeal  Prob.  12.6-1,  if  one  of  the  15  incerfering  .~,it,er  is  2  iimes  ok*,  io  lb.  de.ireb 
receiver  such  that  its  gain  gl  is  4  times  stronger. 

12.6-3  In  a  CDMA  DSSS  system,  each  of  its  three  users  transmits  BPSK  with  power  of  20  n,W.  A.  the 
joint  receiver,  the  three  transmitted  signals  have  gains 

g,  =  10-2  S2='0_l  and  2X|0"' 

The  spreading  codes  are  such  that 

/?,.,=  64  Rij  =  “I  t*J 
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The  sampled  noise  n\  (k)  is  Gaussian  with  zero  mean  and  variance  10  \ 

(a)  Find  the  BER  of  the  received  user  1  data. 

(b)  Find  the  BER  of  the  received  user  2  data. 

(c)  Find  the  BER  of  the  received  user  3  data. 

12.7- 1  For  the  multiuser  CDMA  system  of  Prob.  1 2.6-3,  design  the  corresponding  decorrelator  and  the 

MMSE  detectors. 

12.7- 2  For  the  multiuser  CDMA  system  of  Prob.  12.6-3,  determine  the  MLD  receiver  criterion. 

12.7- 3  For  the  multiuser  CDMA  system  of  Prob.  12.6-3,  explicitly  describe  the  implementation  of  the 

decision  feedback  detector. 


Q  DIGITAL  COMMUNICATIONS 
O  UNDER  LINEARLY  DISTORTIVE 
CHANNELS 


In  our  earlier  discussion  and  analysis  of  digital  communication  systems,  we  have  made 
the  rather  idealistic  assumption  that  the  communication  channel  introduces  no  distortion. 
Moreover,  the  only  channel  impairment  under  consideration  has  been  additive  white  Gaus¬ 
sian  noise  (AWGN).  In  reality,  however,  communication  channels  are  far  from  ideal.  Among 
a  number  of  physical  channel  distortions,  multipath  is  arguably  the  most  serious  problem 
encountered  in  wireless  communications.  In  analog  communication  systems,  multipath  repre¬ 
sents  an  effect  that  can  often  be  tolerated  by  human  ears  (as  echos)  and  eyes  (as  shadows).  In 
digital  communications,  however,  multipath  leads  to  linear  channel  distortions  that  manifest  as 
^symbol  interferences  (ISI).  This  is  because  multipath  leads  to  multiple  copies  of  the  same 
signal  arrivin'1  at  the  receiver  with  different  delays.  Thus,  one  symbol  pulse  is  delayed,  which 
affects  one  or  more  adjacent  symbols,  causing  ISI.  As  we  have  discussed,  ISI  can  severely 
affect  the  accuracy  of  the  receivers.  To  combat  the  effects  of  ISI  due  to  multipath  channels, 
we  discuss,  in  this  chapter,  two  highly  effective  tools:  equalization  and  OFDM  (orthogonal 
frequency  division  modulation). 


.1 


LINEAR  DISTORTIONS  OF  WIRELESS 
MULTIPATH  CHANNELS 


Dieital  communication  requires  that  digital  signals  be  transmuted  over  a  specific  medium 
U  8  U  ,  on(i  the  receiver  The  physical  media  (channels)  in  real  world  are 

between  the  transmt  ai  ljmitations  however,  analog  channels  are  usually  imperfect  and 
analog.  Because  P  Examples  of  nonideal  analog  media  include  telephone 

can  introduce  and  radio-frequency  (RF,  wireless  channels  at 

lines,  coaxial  cabl  .  demonstrates  a  simple  case  in  which  transmission  from  a 

various  frequence. gure muhipath channel:  one  ray  from  the  line-of- 

s^tt  and  one^rom'the  ground  reflection.  At  the  receiver,  there  are  two  copies  of  the  transmitted 
signal,  one  of  which  is  a  delayed  version  ol  the  other. 
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Figure  13.1 

Simple  illus¬ 
tration  of  a 
two-ray  multipath 
channel. 


Mobile  station 


To  understand  the  effect  of  multipath  in  this  example,  we  denote  the  line-of-sight  signal 
arrival  and  the  reflective  arrival,  respectively,  as 

s(t)  =  m{t)  cos  coct  and  a\s(t  —  T\)  =  a\m(t  —  ri)cos  (oc{t  -  rj) 

Here  we  assumed  that  the  modulation  is  DSB  with  PAM  message  signal  (Chapter  7) 


m (0  =  ^«*p(f  -  kT) 


k 


where  T  is  the  PAM  symbol  duration.  Note  also  that  we  use  a  i  and  ri .  respectively,  to  represent 
the  multipath  loss  and  the  delay  relative  to  the  line-of-sight  signal.  Hence,  the  receiver  RF  input 
signal  is 

r(/)  =  m(t)  cos  coct  +  a\m(t  -  ri)  cos  coc(t  -  ri)  +  nc(r)cos  (oct  +  n,(f)  sin  coct  (13.1) 

In  Eq.  (13.1),  nf(/)  and  n<(/ )  denote  the  in-phase  and  quadrature  components  of  the  bandpass 
noise,  respectively  (Sec.  9.8).  By  applying  coherent  detection,  the  receiver  baseband  output 
signal  becomes 


y(0  =  LPF(2r(r)cos  coct] 

=  m(t)  +  aq(cos  cocz\)m(t  -  rj)  +  nt  (f) 

=  ^ °kP(t  ~  kT)  +  (<*i  •  cos  cocz\ )  ^ akp(t  -  kT  -  x\)  +  nt(f) 


(13.2a) 


k 


k 


—  ^2ak  —  kT)  +  («]  cos  coczt)p(t  —  kT  —  ti)]  +  nc(f)  (13.2b) 


k 

By  defining  a  baseband  waveform 


9(0  —p(0  +  (<*i  cos  cocz\)p(t  —  n) 


we  can  simplify  Eq.  (13.2b) 


(13.2c) 


* 


Effectively,  this  multipath  channel  has  converted  the  original  pulse  shape  p(t)  into  <7(0-  )fp^) 
was  designed  (as  in  Chapter  7)  to  satisfy  Nyquist’s  first  criterion  of  zero  ISI, 


then  the  new  pulse  shape  q(t)  will  certainly  have  ISI  as 

q(nT)  =p(nT )  +  (oq  -cos  coczi)p(nT  -  z\)  ^  0  n  =  ±1,  ±2,  ••• 
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To  generalize,  if  there  are  K  +  1  different  paths,  then  the  effective  channel  response  is 

K 

q(t)  =p(t)  +  |  a,  cos  <wcTf]  p(t  -  r,) 

i'=l 

in  which  the  line-of-sight  path  delay  is  assumed  to  be  ro  =  0  with  unit  path  gain  cro  =  I. 
The  ISI  effect  caused  by  the  K  summations  in  q(t)  depends  on  (a)  the  relative  strength  of  the 
multipath  gains  {or, };  and  (b)  the  multipath  delays  { r, ) . 


General  QAM  Models 

For  conserving  bandwidth  in  both  wire-line  and  wireless  communications,  QAM  is  an  efficient 
transmission.  We  again  let  the  QAM  symbol  rate  be  \/T  and  its  symbol  duration  be  T.  Under 
QAM,  the  data  symbols  { sk }  are  complex-valued,  and  the  quadrature  bandpass  RF  signal 
transmission  is 


s(t)  = 


e{sk}p(t  -kT) 


cos  coct  + 


y  Im{.v*}/?(f  -  kT) 


L  k 


sin  u)ct  (13.3) 


Thus,  under  multipath  channels  with  K  +  1  paths  and  impulse  response 


8(t)  +  ctj8(t  —  r /) 


i=l 


the  received  bandpass  signal  for  QAM  is 

K 


t\ 

r(,)  =  s(f)  +  y  aiS(t  -  r,)  +  nc(t)  cos  coct  +  n,(/)  sin  o>ct  ( 1 3.4) 

/=! 

Applying  coherent  detection,  the  QAM  demodulator  has  two  baseband  outputs 
LPF  |2r(f)  cos  coct)  and  LPF|2r(/)  sin  a>ct).  These  two  (in-phase  and  quadrature)  outputs  are 
real-valued  and  can  be  written  as  a  single  complex-valued  output: 

y(t)  =  LPF{2r(r)  cos  coct }  +j  •  LPF{2r(r)  sin  a>ct)  0  3.5a) 


=  ^Re{s*} 


K 

^(a,  cos  (ocTi)p(t  -  kT  -  Tj) 


./=o 


+  ^Im{s*} 


X 

y ^(ctj  sin  cocn)p(t  -  kT  —  r,) 


_  /=o 


-j  ■  ^Re{s*} 

k 

+y.^Im{s*} 


X 

y^((y/  sin  (vcTj)p(t  —  kT  —  r,) 
,/=o 

■  K 

<Xi  cos  cocTj)p(t  -kT  -  t i) 


-i=0 


+  n<(/)  +jns(l) 


-E 


Sk 


~  K 

Y  cti  exp (—ja>c*i)P(?  ~  ^  ~  T‘  ^ 


k  L  1=0 


+  nr(/)  +j  ns(t) 


(13.5b) 
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Once  again,  we  can  define  a  baseband  (complex)  impulse  response 

K 

q(t)  =  or/  exp(jcocTi)p(t  -  kT  -  r,)  (13.6a) 

i=0 

and  the  baseband  complex  noise 

MO  =  MO  +J  MO  03.6b) 

The  receiver  demodulator  output  signal  at  the  baseband  can  be  then  written  simply  as 

y(t)  =  Y  5kq(t  -  kT)  -I-  n,(0  03.7) 

k 

in  which  all  variables  are  complex-valued.  Clearly,  the  original  pulse  p(t)  that  was  designed  to 
be  free  of  ISI  has  been  transformed  by  the  multipath  channel  route  into  q(t).  In  the  frequency 
domain,  we  can  see  that 

K 

Q(f )  =  or,  exp  [—j(2jrf  -  coc)t ,]  •  P(f)  (I3  8) 

«'=0 

This  means  that  the  original  frequency  response  P{f)  encounters  a  frequency-dependent 
transfer  function  because  of  multipath  response 

K 

Y  otj  exp(ja)cTj)  exp  [-jlnf  r,] 

/= 0 

Therefore,  the  channel  distortion  is  a  function  of  the  frequency/.  Communication  channels 
that  introduce  frequency-dependent  distortions  are  known  as  frequency-selective  channels. 
Frequency-selective  channels  can  exhibit  substantial  ISI,  which  can  lead  to  significant  increase 
of  detection  errors. 

Wire-Line  ISI 

Although  we  have  just  demonstrated  how  multipath  in  wireless  communications  can  lead  to 
ISI  and  linear  channel  distortions,  wire-line  systems  are  not  entirely  immune  to  such  problems. 
Indeed,  wire-line  systems  do  not  have  a  multipath  environment  because  all  signals  are  trans 
mitted  by  dedicated  cables.  However,  when  the  cables  have  multiple  unused  open  terminals* 
impedance  mismatch  at  these  open  terminals  can  also  generate  reflective  signals  that  will  arrive 
as  delayed  copies  at  the  receiver  terminals.  Therefore,  ISI  due  to  linear  channel  distortion  can 
also  be  a  problem  in  wire-line  systems.  Cable  internet  service  is  one  example. 

Equalization  and  OFDM 

Because  ISI  channels  lead  to  serious  signal  degradation  and  poor  detection  pertormanc 
their  effects  must  be  compensated  either  at  the  transmitter  or  at  the  receiver.  In  most  cases 
transmitters  in  an  uncertain  environment  are  not  aware  of  the  actual  conditions  of  propagatio 
Thus,  it  is  up  to  the  receivers  to  identify  the  unknown  multipath  channel  q(t)  and  to  a 
effective  means  to  combat  the  ISI.  The  two  most  common  and  effective  tools  against 
channels  are  channel  equalization  and  OFDM. 


13.2  Receiver  Channel  Equalization 
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Figure  13.2 

Baseband 
representation  of 
QAM  trans¬ 
mission  over  a 
linear  time- 
invariant  channel 
with  ISI. 


Input 

LTI  channel 

Output 

^2skS(t-  kT  +  t0) 

q{t) 

V - 

>  y(t) 

k 

n  At) 

Noise 

13.2  RECEIVER  CHANNEL  EQUALIZATION 

It  is  convenient  for  us  to  describe  the  problem  of  channel  equalization  in  the  stationary  channel 
case.  Once  the  fundamentals  of  linear  time-invariant  (LTI)  channel  equalization  is  understood, 
adaptive  technology  can  handle  time-varying  channels. 

When  the  channel  is  LTI,  we  use  the  simple  system  diagram  of  Fig.  13.2  to  describe  the 
problem  of  channel  equalization.  In  general,  channel  equalization  is  studied  for  the  (spectrally 
efficient)  digital  QAM  systems.  The  baseband  model  for  a  typical  QAM  (quadrature  amplitude 
modulated)  data  communication  system  consists  of  an  unknown  LTI  channel  q(l),  which 
represents  the  physical  interconnection  between  the  transmitter  and  the  receiver  in  baseband. 

The  baseband  transmitter  generates  a  sequence  of  complex-valued  random  input  data  (.v* ), 
each  element  of  which  belongs  to  the  constellation  A  of  QAM  symbols.  The  data  sequence 
{s*}  is  sent  through  the  baseband  channel  that  is  LTI  with  impulse  response  </(/).  Because 
QAM  symbols  [sk]  are  complex-valued,  the  baseband  channel  impulse  response  q(t)  is  also 
complex-valued  in  general. 

Under  the  causal  and  complex-valued  LTI  communication  channel  with  impulse  response 
q(t),  the  input-output  relationship  of  the  QAM  system  can  be  written  as 

00 

y(t)  =  ^  Sk<j(t  ~~  kT  +  to)  +  n*(f)  sic  6  A  (13.9) 

A: =-oo 

Typically  the  baseband  channel  noise  n,(f)  is  assumed  to  be  stationary,  Gaussian,  and  inde¬ 
pendent  of  the  channel  input  sk.  Given  the  received  baseband  signal  y(t)  at  the  receiver,  the 
job  of  the  channel  equalizer  is  to  estimate  the  original  data  { sk )  from  the  received  signal  y(t). 

In  what  follows,  we  present  the  common  framework  within  which  channel  equalization 
is  typically  accomplished.  Without  loss  of  generality,  we  let  /„  =  0. 


13.2.1  Antialiasing  Filter  vs.  Matched  Filter 

We  showed  in  Secs  1 1 . 1  and  1 1 .6  that  the  optimum  receiver  filter  should  be  matched  to  the  total 
««>  ThiVliUer  serves  to  maximize  the  SNR  of  the  sampled  signal  auhe  hire,  output. 
Keen  if  the  response  M)  has  ISI,  Forney1  has  eslnblished  rhe  ophmalny-  ol  the  matched  hirer 
E  f  shown  in  Fig  13  3  With  a  matched  filter^-/)  and  symbol  (baud)  rate  sampling 
=  an  output  sequence  relationship  between  the  transmitter  data 

[sk]  and  the  receiver  samples  as 

z[n]  =  ^skh(nT-kT)  (13.10) 


i  „  .  ,  rf-tutics  for  input  symbol  estimation  is  retained  by  baud  rale  sampling  at  t  -  n'l  ol 

*  Forney  proved  that  sufficient  st  _  ha^is  of  the  well-known  single-input-single-output  (SISO)  system 

matched  filter  output  signal.  1  his  re^  However  when  </(/)  is  unknown,  the  optimality  no  longer  applies. 
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Figure  13.3 

Optimal  matched 
filter  receiver. 


y(t ) 

Matched  filter 

q(-t) 

z(t) 

/  =  nT 

where 


h(t)  =  q(t)  *  q(-t) 


(13.11) 


If  we  denote  the  samples  of  h(t) 


h[n]  =  h(nT) 


then  Eq.  (13.10)  can  be  simplified 

z[n]  =  ^  skh[n  —  k]  =  h[n]  *  s[n]  (13.12) 

k 

In  short,  the  channel  (input-output)  signals  are  related  by  a  single-input-single-output  (SISO) 
linear  discrete  channel  with  transfer  function 

H(z)  =  (13-13) 

n 

The  SISO  discrete  representation  of  the  linear  QAM  signal  leads  to  the  standard  T-spaced 
equalizer  (TSE).  The  term  T -spaced  equalization  refer  to  processing  of  the  received  signal 
sampled  at  the  rate  of  1  /T.  Therefore,  the  time  separation  between  successive  samples  equals 
the  baud  (symbol)  period  T. 

The  optimal  matched  filter  receiver  faces  a  major  practical  obstacle  that  the  total  pulse 
shape  response  q(t)  depends  on  the  multipath  channel  environment.  In  reality,  it  is  practically 
difficult  to  adjust  the  receiver  filter  according  to  the  time-varying  q(t)  because  channel  environ¬ 
ment  may  undergo  significant  and  possibly  rapid  changes.  Moreover,  the  receivers  generally 
do  not  have  a  priori  information  on  the  channel  that  affects  q(t).  As  a  result,  it  does  not  make 
sense  to  implement  the  optimum  receiver  filter  q(— t)  in  a  dynamic  channel  environment.  It 
makes  better  sense  to  design  and  implement  a  time-invariant  receiver  filter.  Therefore,  the 
important  task  is  to  select  a  receiver  filter  without  losing  any  signal  information  in  y(0- 

To  find  a  solution,  recall  the  QAM  channel  input  signal 

x(t)  =  Y^skp(t-kT) 
k 

We  have  learned  from  Section  7.2  [see  Eq.  (7.9)]  that  the  power  spectral  density  of  an  amplitude- 
modulated  pulse  train  is 


SxV)  =  I  P(f)\2  ~ 


oo 

Rs[n\e~jn2jtfr 

_n=—oo 


Rs  1  =  sk+nsk 


(1 3.14a) 
(13.14b) 


by  simply  substituting  the  pulse  amplitude  a*  with  the  QAM  symbol  j*.  The  signal  sPeij.trV 
of  Eq.  (13.14a)  shows  that  the  signal  component  in  y(t)  is  limited  by  the  bandwidth  o  P 
or  P(f). 
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Figure  13.4 

Commonly  used 
receiver  filter 
matched  instead 
to  the  trans¬ 
mission  pulse. 


Receiver  filter  z(f)  z[f*J 


P(~t) 

_  t  =  nT 


Therefore,  the  receiver  filter  must  not  filter  out  any  valuable  signal  component  and  should 
have  bandwidth  equal  to  the  bandwidth  of  Pif).  On  the  other  hand,  if  we  let  the  receiver 
filter  have  a  bandwidth  larger  than  P(f ),  then  more  noise  will  pass  through  the  filter,  with  no 
benefit  to  the  signal.  For  these  reasons,  a  good  receiver  filter  should  have  bandwidth  exactly 
identical  to  the  bandwidth  of  P(f).  Of  course  many  such  filters  exist.  One  is  the  filter  matched 
to  the  transmission  pulse  p(t)  given  by 


p(-t)  <=*  P*(f) 


Another  consideration  is  that,  if  the  channel  introduces  no  additional  distortions,  then  c/(t)  = 
p{t).  In  this  case,  the  optimum  receiver  would  be  the  filter  pi-t)  matched  to  pit).  Consequently, 
it  makes  sense  to  select  pi-t)  as  a  standard  receiver  filter  (Fig.  13.4)  for  two  reasons: 

(a)  The  filter  pi-t)  retains  all  the  signal  spectral  component  in  the  received  signal  y(t ). 

(b)  The  filter  pi-t)  is  optimum  if  the  environment  happens  to  exhibit  no  channel  distortions. 

Therefore,  we  often  apply  the  receiver  filter  pi-t)  matched  to  the  transmission  pulse  shape 
pit).  This  means  that  the  total  channel  impulse  response  consists  of 


h{t)  =  qit)  *pi-t) 


Notice  that  because  of  the  filtering  z(0  =  p(-t)  *  y(t).  The  signal  z(0  now  becomes 


(13.15) 


k 


in  which  the  filtered  noise  term  w(/)  arises  from 

w(f)  =  p(-t)  *  n,,(/) 


(13.16) 


with  power  spectral  density 


sw(0  =  I P(f)\2  5iw(0 

Finally,  the  relationship  between  the  sampled  output  z[k ]  and  the  communication  symbols  s*  is 

z[n]  =  ^  h\n  —  k]st  +  w|nl 


k 


(13.17) 


k 


where  the  discrere  noise  samples  am  denoled  by  w|„]  =  wW). 
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Generally,  there  are  two  approaches  to  the  problem  of  channel  input  recovery  (i.e.,  equal¬ 
ization)  under  ISI  channels.  The  first  approach  is  to  determine  the  optimum  receiver  based 
on  channel  and  noise  models.  This  approach  leads  to  maximum  likelihood  sequence  estima¬ 
tion  (MLSE),  which  is  computationally  demanding.  A  low-cost  alternative  is  to  design  filters 
known  as  channel  equalizers  to  compensate  for  the  channel  distortion.  In  what  follows,  we 
first  describe  the  essence  of  the  MLSE  method  for  symbol  recovery.  By  illustrating  its  typi¬ 
cally  high  computational  complexity,  we  provide  the  necessary  motivation  for  the  subsequent 
discussions  on  various  complexity  channel  equalizers. 


13.2.2  Maximum  Likelihood  Sequence  Estimation  (MLSE) 

The  receiver  output  samples  {z[n]}  depend  on  the  unknown  input  QAM  symbols  {$„}  according 
to  the  relationship  of  Eq.  (13.17).  The  optimum  (MAP)  detection  of  {$„}  from  {z[fl]}  requires 
the  maximization  of  joint  conditional  probability  [Eq.  (1 1.81)]: 


max  p(. . . ,  sr i_i,  sn ,  5/I+1, 
bn) 


z[n-  l].z[/i].z[n+  1],  •••)  (13.18) 


Unlike  the  optimum  symbol-by-symbol  detection  for  AWGN  channels  derived  and  analyzed 
in  Sec.  11.6,  the  interdependent  relationship  in  Eq.  (13.17)  means  that  the  optimum  receiver 
must  detect  the  entire  sequence  {s,,}  from  a  sequence  of  received  signal  samples  [z[n]}- 
To  simplify  this  optimum  receiver,  we  first  note  that  in  most  communication  systems  and 
applications,  each  QAM  symbol  sn  is  randomly  selected  from  its  constellation  A  with  equal 
probability.  Thus,  the  MAP  detector  can  be  translated  into  a  maximum  likelihood  sequence 
estimation  (MLSE): 


max  /?(...,  z[n  -  1],  z[n],  z[n  +  1],  . . . 

{*/»}  v 


$n—  1  *  $n *  ^n+1  * 


(13.19) 


If  the  original  channel  noise  nf(f)  is  white  Gaussian,  then  the  discrete  noise  w[/t]  is  also 
Gaussian  because  Eq.  (13.16)  shows  that  w(r)  is  filtered  output  of  n^r).  In  fact,  we  can  define 
the  power  spectral  density  of  the  white  noise  n e(t)  as 

K(f)  =  y 

Then  the  power  spectral  density  of  the  filtered  noise  w (/)  is 


S»(f)  =  \P(f)\2  Sne(f)  =  J-  \P(f)\2 


(13.20) 


From  this  information,  we  can  observe  that  the  autocorrelation  function  between  the  noise 
samples  is 


/?w[f]  =  w[f  +  n]w*[n] 

=  w  (£T  +  nT)v/*(nT) 


/OO 

S„(f)e-MtTdf 

-OO 

\f  r°° 

=  T  \P(f)\2  e~j2,rfeTdf 
^  J- OO 


(13.21) 
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In  general,  the  autocorrelation  between  two  noise  samples  in  Eq.  ( 13.21 )  depends  on  the 
receiver  filter  which  is,  in  this  case,  p(—t).  In  Sec.  7.3,  the  ISI-free  pulse  design  based  on 
Nyquist’s  first  criterion  is  of  particular  interest.  Nyquist’s  first  criterion  requires  that  the  total 
response  from  the  transmitter  to  the  receiver  be  free  of  intersymbol  interferences.  Without 
channel  distortion,  the  QAM  system  in  our  current  study  has  a  total  impulse  response  of 

p(t)*P(-t)  «=*  \p(j)\2 

For  this  combined  pulse  shape  to  be  free  of  ISI,  we  can  apply  the  first  Nyquist  criterion  in  the 
frequency  domain 


1 

T 


E 


k=-oo 


(/+f 


) 


2 


1 


This  is  equivalent  to  the  time  domain  requirement 


p(t)*p(-t) 


i  e  =  o 

0  f  =  ±1,  ±2,  ... 


(13.22a) 


(13.22b) 


In  other  words,  the  Nyquist  pulse-shaping  filter  is  equally  split  between  the  transmitter  and 
the  receiver.  According  to  Eq.  (13.22a),  the  pulse-shaping  frequency  response  P(f)  is  the 
square  root  of  a  pulse  shape  that  satisfies  Nyquist’s  first  criterion  in  the  frequency  domain. 
If  the  raised-cosine  pulse  shape  of  Section  7.3  is  adopted,  then  P(f)  would  be  known  as  the 
root-raised-cosine  pulse.  For  a  given  roll-off  factor  r.  the  root-raised-cosine  pulse  in  the  time 
domain  is 


PnciO  — 


2r  “Sl 

[d+r>y 

H 

H) 

1 

1 

sin  (1  —  r)?y 

1-1 

(^f) 

1 

(13.23) 


Based  on  the  ISI-free  conditions  of  Eq.  ( 1 3.22b),  we  can  derive  from  Eq.  ( 1 3.2 1 )  that 


=  t  f  \P{f)\2e  j2nfndf 

2.  7-oo 

M 

=  —p«)*  pi-0  t=lT 

%  t  =  o 

0  t  =  ±  1,±2,  ... 


(13.24) 


This  means  .ha,  ,he  noise  samples  |w|n|l  are  uneorrelared.  Since  the  noise  samples _(w|»||  are 
Gaussian,  they  are  also  independent  Asa  result,  the  conditional  joint  probability  ol  Eq.  ( 1 3. 19) 

becomes  much  simpler 


p(...,z[n-  1],  z[«l.  zl"+  • 

=  nK21"-'1 


• »  Sn—  1  ♦  sn *  sn+ 1 
.  . . ,  Sn—  1 »  *^n+ 1 »  *  *  *^ 


....) 


(13.25) 
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Indeed,  Eq.  ( 1 3.24)  tells  us  that  z[n-i]  is  Gaussian  with  equal  variance  AT/2  and  mean  value  of 

^  '  h[k]Sn—i—k 
k 


Therefore,  the  MLSE  optimum  receiver  under  Gaussian  channel  noise  and  root-raised-cosine 
pulse  shape  pnc(t)  [Eq.  (13.23)], 


max  In 

M 


sn—  1 »  ^/i+l » 


max 

M 


z[n  -  /]  -  y^  hWsn-j-ic 


(13.26a) 


Thus,  MLSE  is  equivalent  to 

2 


min  V 

bnl  “ 


i 


z[n  - 1]  -  ^  h[k]s„-i-k 
k 


(13.26b) 


For  a  vast  majority  of  communication  channels,  the  impulse  response  h[k]  can  be  closely 
approximated  as  a  finite  impulse  response  (FIR)  filter  of  some  finite  order.  If  the  maximum 
channel  order  is  L  such  that 


L 

H(z)  =  J^li[k]z~k 

k=  0 

then  the  MLSE  receiver  needs  to  solve 


min 

M 


E 


z\n  i]  y  'h[k j—k 

k= 0 


2 


(13.27) 


We  note  that  the  MLSE  algorithm  requires  that  the  receiver  possess  the  knowledge  of  the  dis¬ 
crete  channel  coefficients  [h[k]}.  When  exact  channel  knowledge  is  not  available,  the  receiver 
must  first  complete  the  important  task  of  channel  estimation. 

MLSE  Complexity  and  Practical  Implementations 

Despite  the  apparent  high  complexity  of  the  MLSE  algorithm  [Eq.  (13.27)],  there  exists  a 
much  more  efficient  solution  given  by  Viterbi3  based  on  the  dynamic  programming  principle  o 
Bellman.3  This  algorithm,  often  known  as  the  Viterbi  algorithm,  does  not  have  an  exponentially 
growing  complexity  as  the  data  length  grows.  Instead,  if  the  QAM  constellation  size  is  AL 
then  the  complexity  of  the  Viterbi  algorithm  grows  according  to  M  L.  The  Viterbi  algorithm  is  a 
very  powerful  tool,  particularly  when  the  channel  order  L  is  not  very  long  and  the  constellation 
size  M  is  not  huge.  The  details  of  the  Viterbi  algorithm  will  be  explained  in  Chapter  15 
we  present  the  decoding  of  convolutional  codes. 

MLSE  is  very  common  in  practical  applications.  Most  notably,  many  GSM  cellular 
receivers  perform  the  MLSE  detection  described  here  against  multipath  distortions.  Because 
GSM  uses  binary  constellations  in  voice  transmission,  the  complexity  of  the  MLSE  receivers 
is  reasonably  low  for  common  cellular  channels  that  can  be  approximated  as  FIR  responses  o 
order  3  to  8. 
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Figure  13.5 

A  SISO  discrete 
linear  channel 
model  for  TSE. 


On  the  other  hand,  the  modulation  formats  adopted  in  high-speed  dial-up  modems  are 
highly  complex.  For  example,  the  V.32bis  (14.4  kbit/s)  modem  uses  a  trellis-coded  QAM  con¬ 
stellation  of  size  128  (with  64  distinct  symbols)  at  the  symbol  rate  of  2400  baud  (symbols/s). 
In  such  applications,  even  a  relatively  short  L  =  5  FIR  channel  would  require  MLSE  to  have 
over  1  billion  states.  In  fact,  at  higher  bit  rates,  dial-up  modems  can  use  size  256  QAM  or  even 
size  960  QAM.  As  a  result,  the  large  number  of  states  in  MLSE  makes  it  completely  unsuit¬ 
able  as  a  receiver  in  such  systems.  Consequently,  suboptimal  equalization  approaches  with 
low  complexity  are  much  more  attractive.  The  design  of  simple  and  cost  effective  equalizers 
(deployed  in  applications  including  voiceband  dial-up  modems)  is  discussed  next. 

13.3  LINEAR  7-SPACED  EQUALIZATION  (TSE) 

When  the  receiver  filter  is  matched  to  the  transmission  pulse  p(t)  only,  it  is  no  longer  optimum  * 
Even  if  the  ideal  matched  filter  q(-t)  is  known  and  applied,  it  is  quite  possible  in  practice  for 
the  sampling  instant  to  have  an  offset  t0  such  that  the  sampling  takes  place  at  t  =  nT  +  /<>. 
Such  a  sampling  offset  is  known  as  a  timing  error.  When  there  is  a  timing  error,  the  receiver 
is  also  not  optimum.  It  is  in  fact  commonplace  for  practical  communication  systems  to  have 
unknown  distortive  channels  and  timing  jitters.  Nevertheless,  7-spaced  equalization  is  simpler 
to  implement.  Here  we  discuss  the  fundamental  aspects  of  TSE  design. 

Because  7-spaced  sampling  leads  to  a  simple  discrete  time  linear  system  Eq.  (13.17)  as 
shown  in  Fig.  1 3.5,  the  basic  linear  equalizer  is  simply  a  linear  filter  F(z)  followed  by  a  direct 
QAM  decision  device.  The  operational  objective  of  the  equalizer  (filter)  F(z)  is  to  remove  as 
much  ISI  as  possible  from  its  output  d\n).  We  begin  our  discussion  on  the  7-spaced  equalizer 
(TSE)  by  denoting  the  (causal)  equalizer  transfer  function 

F(z)  = 

i 

If  the  channel  noise  w[/i]  is  included,  the  TSE  output  is 

d[n\  =  F(z)z[n]  =  F(z)//(z)5„  +  HQwM  (13.28) 

signal  term  noise  term 

We  denote  the  joint  channel  equalizer  transfer  function  as 

00 

C(z)  =  F(z)H(z)  =  J2CiZ~‘ 

1=0 

The  goal  of  the  equalizer  F <i>  is  lo  clean  up  .he  ISI  in  <f[»)  lo  achieve  an  enor-free  decision 

S„  =  dec  (</[«])  =  s„-u  (13.29) 


The  sufficient  statistics  shown  by 


G  D.  Forney1  are  not  necessarily  retained. 
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where  u  is  a  fixed  delay  in  the  equalizer  output.  Because  both  the  channel  and  the  equalizer 
must  be  causal,  the  inclusion  of  a  possible  delay  u  provides  opportunities  for  simpler  and  better 
equalizer  designs. 

To  better  understand  the  design  of  the  TSE  filter  F(z ),  we  can  divide  the  TSE  output  into 
different  terms 

OO  00 

d[n]  =  +  X>]w[*  -  i] 

1=0  1=0 

oo  oo 

=  cu  Sn-U  +  ^2  Ci*n-i  +  - »']  (13.30) 

1=0,  i^M  1=0 

ISI  term  noise  term 

The  equalizer  filter  output  d[n]  consists  of  the  desired  signal  component  with  the  right  delay, 
plus  the  ISI  and  noise  terms.  If  both  the  ISI  and  noise  terms  are  zero,  then  the  QAM  decision 
device  will  always  make  correct  detections  without  any  error.  Therefore,  the  design  of  this 
linear  equalizer  filter  F(z)  should  aim  to  minimize  effect  of  the  ISI  and  the  noise  terms.  In 
practice,  there  are  two  very  popular  types  of  linear  equalizer:  zero-forcing  (ZF)  design  and 
minimum  mean  square  error  (MMSE)  design. 

13.3.1  Zero-Forcing  TSE 

The  principle  of  zero-forcing  equalizer  design  is  to  eliminate  the  ISI  term  without  considering 
the  noise  effect.  In  principle,  a  perfect  ZF  equalizer  F(z)  should  force 


OO 

cisn-i  =  0 

i=0,i^u 

In  other  words,  all  ISI  terms  are  eliminated 


1  i  =  u 

C‘  0  /  ^  u 

Equivalently  in  frequency  domain,  the  ZF  equalizer  requires 

C(z)  =  F(z)H(z)  =  z~u 

Notice  that  the  linear  equalizer  F(z)  is  basically  an  inverse  filter  of  the  discrete 
H (z)  with  appropriate  delay  u 


(13.31a) 


(13.31b) 
ISI  channel 


F(z)  = 


H(z) 


(13.31c) 


It  the  ZF  filter  of  Eq.  (13.3 1  c)  is  causal  and  can  be  implemented,  then  the  ISI  is  complete  - 
eliminated  from  z[«].  This  appears  to  be  an  excellent  solution,  since  the  only  decision  that  t  e 
decision  device  now  must  make  is  based  on 


z[«]  =  sn-u  +  F(z)w[n] 


i 
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without  any  ISI.  One  major  drawback  of  the  ZF  equalizer  lies  in  the  remaining  noise  term 
F(z)w[/;].  If  the  noise  power  in  z[n]  is  weak,  then  the  QAM  decision  would  be  highly  accurate. 
Problems  arise  when  the  transfer  function  F(z)  has  strong  gains  at  certain  frequencies.  As  a 
result,  the  noise  term  F(z)w[az]  may  be  amplified  at  those  frequencies.  In  fact,  when  the 
frequency  response  of  H(z)  has  spectral  nulls,  that  is, 

H^0)  =  0  for  some  co0  €  [0,  tt] 

then  the  ZF  equalizer  F(z)  at  co0  would  have  infinite  gain,  and  substantially  amplify  the  noise 
component  at  coQ. 

A  different  perspective  is  to  consider  the  filtered  noise  variance.  If  w[/?J  are  independent 
identically  distributed  (i.i.d.)  Gaussian  with  zero  mean  and  variance  AT  /2,  then  the  filtered 
noise  term  equals 

00 

vv[rt]  =  F(zMm]  =  -  0 

i=0 


The  noise  term  w[n]  remains  Gaussian  with  mean 


00  00  _ 

^/[/]w[n  -  t]  =  £/[j']w[n  -  /]  =  0 

;= o  i=o 


and  variance 


£/[i]w[«  -  i] 

t=0 


=M/2-£m2 

1=0 


Because  the  ZF  equalizer  output  is 

z[n]  =  sn-u  +  vv[fl] 


the  probability  of  decision  error  in  dec  (z[n]>  can  therefore  be  analyzed  by  applying  th^same 
tools  used  in  Chapter  1 1  (Sec.  1 1 .6).  In  particular,  under  BPSK  modulation.  sn  =  ±^Eh  with 
equal  probability.  Then  the  probability  of  detection  error  is 


Pb  =  Q 


(13.32) 


where  the  ZF  equalizer  parameters  can  be  obtained  via  the  inverse-Z  transform 


■hi 


1  — u 


2njJ  H(z) 


dz 


(13.33) 


nulls,  then/t/]  from  Eq.  (13.33)  may  become  very  large,  causing  a 


If  F(eja>)  has  spectral 
serious  increase  of  Pb- 
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Example  13.1  Consider  a  first-order  channel 

H(z)  =  1  +  z~2 

Determine  the  noise  amplification  effect  on  the  ZF  equalizer  for  a  BPSK  transmission. 

Because  )  =  0  when /  =  ±  1  /4,  it  is  clear  that  H{z)  has  spectral  nulls.  By  applying 

the  ZF  equalizer,  we  have 

1  /  z‘~'~u 

=  ^ijf  T+P1* 

(  0  i  <  u 

~  {  (— 1)'-"  i>u 

Therefore, 

oo  oo 

£ra2  =  £(n  =  oo 

i=0  i—u 

This  means  that  the  BER  of  the  BPSK  transmission  equals 

Ph  =  Q  (0)  =  0.5 

The  noise  amplification  is  so  severe  that  the  detection  is  completely  random. 


Example  13.1  clearly  shows  the  significant  impact  of  noise  amplification  due  to  ZF 
equalization.  The  noise  amplification  effect  strongly  motivates  other  design  methodologies 
for  equalizers.  One  practical  solution  is  the  minimum  mean  square  error  (MMSE)  design. 


13.3.2  TSE  Design  Based  on  MMSE 

Because  ot  the  noise  amplification  effect  in  ZF  equalization,  we  must  not  try  to  eliminate  the 
1SI  without  considering  the  negative  impact  from  the  noise  term.  In  fact,  we  can  observe  the 
equalizer  output  in  Eq.  (13.30)  and  quantify  the  overall  distortion  in  d[n]  by  considering  the 
difference  (or  error) 


d[n]  -  Sn-U  =  ^2  cisn—i  -  s„-u  +  -  i] 

1=0,1^14 


1=0 


(13.34) 


To  reduce  the  number  of  decision  errors  when 


dec  (</[«])  ^  j„_H 

it  would  be  sensible  to  design  an  equalizer  that  would  minimize  the  mean  square  error  between 
d[n]  and  sn-u.  In  other  words,  the  MMSE  equalizer  design  should  minimize 


\d[n]-sn-u\2 


(13.35) 
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Let  us  now  proceed  to  find  an  equalizer  filter  that  can  minimize  the  mean  square  error  of 
Eq.  (13.35).  Once  again,  we  will  apply  the  principle  of  orthogonality  in  optimum  estimation 
(Sec.  8.5),  that  the  error  (difference)  signal  must  be  orthogonal  to  the  signals  used  in  the  filter 
input.  Because  d[n]  =  f[i]z\n  —  /],  we  must  have 

( d[n ]  -  sn—u)  _L  z[n  -  i ]  t  =  0,  1 ,  . . . 

In  other  words. 


(d[n]-sn.u)  z*[n-t]  =  0  i  =0,  1,  ...  (13.36) 

Therefore,  the  equalizer  parameters  [f[i]}  must  satisfy 


(pm*  - fl  “  sn-u  j  Z*[n  -  i)  =  0  i  =  0,  I,  . . . 

Note  that  the  signal  sn  and  the  noise  w|n)  are  independent.  Moreover,  {.v„J  are  also  i.i.d.  with 
zero  mean  and  variance  Es.  Therefore,  s,i_Mw*|/i|  =  0,  and  we  have 


S„-uZ*[n  ~  i]  =  Sn-u  (Y,hW*Sn-j-t  +  wl/1  " 
j—0 


=  h\j]*  s„-u  s*n-j-t  +  0 

7=0 

[  Es  ■  h[u  - 1]*  o  <1<U 


0 


>  u 


(13.37) 


Let  us  also  denote 


R.[m]  =  z\n  +  m]z*[n\ 


Then  the  MMSE  equalizer  is  the  solution  to  linear  equations 

°°  I  Esh[u  —  £]*  i  =  0,  1 « 

^/[/] /?;[«-/]  =  |o  £  =  «+!,«  + 2 . oo 

1=0 

Based  on  the  channel  output  signal  model,  we  can  show  that 


Rz["i}  =  (T,hiSn+m-i+w[n+m] 

\i= 0 

=  Esf^hm+jhJ  +  jS[m] 
j=° 


(13.38) 


(13.39) 


(13.40) 
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Minimum  MSE  and  Optimum  Delay 

Because  of  the  orthogonality  condition  Eq.  (13.36),  we  have 


{d[ri\  -  s„-u)d[n]*  =  0 

Hence,  the  resulting  minimum  mean  square  error  is  shown  to  be 


MSE(w)  =  ( sn.u  -  d[h\)  s*_u 
=  Es{\~cu) 


-*(*-£ 

\  ;=0 


-  V  hif[u-i] 


(13.41) 


It  is  clear  that  MMSE  equalizers  of  different  delays  can  lead  to  different  mean  square  error 
results.  To  find  the  delay  that  achieves  the  least  mean  square  error,  the  receiver  can  determine 
the  optimum  delay  according  to 


u0  =  arg  max  Y'  hj[u  -  i] 

u  ' 


(13.42) 


i=0 


Finite  Length  MMSE  Equalizers 

Because  we  require  the  equalizer  F(z)  to  be  causal,  the  MMSE  equalizer  based  on  the  solution 
of  Eq.  (13.39)  does  not  have  a  simple  closed  form.  The  reason  is  that  {/[/]}  is  causal  while 
Rz[m]  is  not.  Fortunately,  practical  implementation  of  the  MMSE  equalizer  often  assumes  the 
form  of  a  finite  impulse  response  (FIR)  filter.  When  F(z)  is  FIR,  the  MMSE  equalizer  can  be 
numerically  determined  from  Eq.  (13.39).  Let 


M 


nz)  = 


1=0 


The  orthogonality  condition  of  Eq.  (13.39)  then  is  reduced  to  a  finite  set  of  linear  equations 


£/WflZ[*-/]=  {  QSk[U  eV 
i= 0  l 


l  =  0,  1 . u  (13.43a) 

t  =  2,  . . . ,  M 


Alternatively,  we  can  write  the  MMSE  condition  into  matrix  form  for  u  <  M : 


X-[  0]  Rz[- 1] 
RdU  Rz[  0] 

RJM-l] 


Rzl  0] 


"  /[0]  ' 

/[l] 

.  f[M]  _ 

-E„ 


h[u]* 
h[U  -  ir 


/t[0]* 

0 


M  +  1  rows 


(13.43b) 
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Of  course,  if  the  delay  u  exceeds  M ,  then  the  right  hand  side  of  Eq.  ( 13.43b)  becomes 


R-M 

RzlU 


Rzl- 1] 
R-M 


Rz[-M] 

Rzl  1  ~M\ 


"  /10)  " 

h[u\* 

/ID 

= 

h\u  -  1]* 

_  f\M  | 

_  h[u  -  M  |*  _ 

(13.43c) 


_RZ[M]  fl:[M-l]  •••  /?;[0] 

The  solution  is  unique  so  long  as  the  autocorrelation  matrix  in  Eq.  (13.43c)  has  full  rank. 


MMSE  vs.  ZF 

Note  that  if  we  simply  set  the  noise  spectral  level  to  fif  =  0,  the  MMSE  equalizer  design  of 
Eqs.  (13.39)  and  (13.43c)  is  easily  reduced  to  the  ZF  design.  In  other  words,  the  only  design 
change  from  MMSE  to  ZF  is  to  replace  Rz[ 0|  from  the  noisy  to  the  noise-free  case  of 

00 

Rz  10]  =  Es  £  \hj\ 2 
i= o 


All  other  procedures  can  be  directly  followed  to  numerically  obtain  the  ZF  equalizer  parameters. 

It  is  important  to  understand,  however,  that  the  design  of  finite  length  ZF  equalizers 
according  to  Eq.  (13.43c)  may  or  may  not  achieve  the  objective  of  forcing  all  IS1  to  zero.  In 
fact,  if  the  channel  H(z)  has  finite  order  L  then  ZF  design  would  require 

M  L 

F(z)H(z)  =  £/[*-'  Ytl,[i]z~'  =  Z~U 
1=0  1=0 


This  equality  would  be  impossible  for  any  stable  causal  equalizer  to  achieve.  The  reason  is 
quite  simple  if  we  consider  the  basics  of  polynomials.  The  left-hand  side  is  a  polynomial  of 
order  M+L  Hence  it  has  a  total  of  M+L  roots,  whose  locations  depends  on  the  channel  and 
the  equalizer  transfer  functions.  On  the  other  hand,  the  right-hand  side  has  a  root  at  oo  only.  It 
is  therefore  impossible  to  fully  achieve  this  zero-forcing  equality.  Thus,  one  would  probably 
ask  the  following  question:  What  would  a  finite  length  equalizer  achieve  if  designed  according 

W  ^ The' ant wer  can  in  fact  be  found  in  the  MMSE  objective  function  when  the  noise  is  zero. 
Specifically,  the  equalizer  is  designed  to  minimize 


\d[n]  -  s„-u\2  =  I F(z)H ( z)s„  - 

„.  .he  finite  ie„,h  ZFdesig„  ^  on  Eq.  (13.43c)  wifi 
minimize  the  ISI  distortion  that  equals 

|c„-l|2  +  I>|2 


M 


^f[i]h[u  -i  1-1 
/=o 


+  E 

j^U 


M 


jyinw  - 


i= 0 


2 
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In  other  words,  this  equalizer  will  minimize  the  contribution  of  ISI  to  the  mean  square  error 
in  d[ri\. 

Finite  Data  Design 

The  MMSE  (and  ZF)  design  of  Eqs.  (13.39)  and  (13.43c)  assumes  statistical  knowledge  of 
Rz[m]  and  sn-uz*  [/*  —  £].  In  practice,  such  information  is  not  always  readily  available  and 
may  require  real-time  estimation.  Instead,  it  is  more  common  for  the  transmitter  to  send  a  short 
sequence  of  training  (or  pilot)  symbols  that  the  receiver  can  use  to  determine  the  optimum 
equalizer.  We  now  describe  how  the  previous  design  can  be  directly  extended  to  cover  this 
scenario. 

Suppose  a  training  sequence  {sflJ  n  =  n\,  n\  +  l . ^2}  is  transmitted.  To  design  an  FIR 

equalizer 


F(z)=m  +f[l]z~'  +  ...+f\M]z-M 


we  can  minimize  the  average  square  error 


J  = 


«2  -  n\  +  1 


£  !</[«]  -Sn-ul2 


n=u+n  j 


where 


M 

d[n]  =  £)/[/]z[/t  -  i] 
1=0 


To  minimize  J ,  we  can  take  its  gradient  with  respect  to/|y|.  By  setting  the  gradient  to  zero, 
we  can  derive  the  conditions  required  by  the  optimum  equalizer  parameters 


M 


£/w 


1 

«2  ~«l  +  1 


W+/22 

£  z[n-  i]z*[n-j] 

n=u+n\ 


1 

«2  —  «1  +  1 


M+rt2 

x  J2  sn-uZ*[n  —  j]  y‘  =  0,  1 . M 

n=u+n\ 


(13.44) 


These  M  +  1  equations  can  be  written  more  compactly  as 


'/?;[0,0]  *-[1,0]  • 

^[0,1]  KJ1.1]  • 

‘  /[0]  " 
/  m 

R.SZ  [  ] 

*«[— w  + 

AtO.A/]  RZ[\,M]  ■ 

■■  RZ[M  ,M]_ 

_f[M]  _ 

*K[— u  +  M]  _ 

(13.45) 


where  we  denote  the  time  average  approximations  of  the  correlation  functions  (for  *»./ 
0,  1,  M): 


RzUJ] 

R.k[~u  +y] 


l 

«2  -  n\  +  1 


M+rt2 

£  z[n-i]z*[n-j] 

n=u+n  i 


i 


u+n  2 

£  Sn-uZ* [n  -j] 
n=u+n\ 


«2  ~  n\  +  1 
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It  is  quite  clear  from  comparing  Eqs.  (13.45)  and  ( 13.43c)  that  under  a  short  training  sequence 
(preamble),  the  optimum  equalizer  can  be  obtained  by  replacing  the  exact  values  of  the  corre¬ 
lation  function  with  their  time  average  approximations.  If  matrix  inverse  is  to  be  avoided  for 
complexity  reasons,  adaptive  channel  equalization  is  a  viable  technology.  Adaptive  channel 


equalization  was  first  developed  by  Lucky  at  Bell  Labs4,  s  for  telephone  channels.  It  belongs 
to  the  field  of  adaptive  filtering.  Interested  readers  can  refer  to  the  book  by  Ding  and  Lif>  and 
the  references  therein. 


1 3.4  LINEAR  FRACTIONALLY  SPACED 
EQUALIZERS  (FSE) 


We  have  shown  that  when  the  channel  response  is  unknown  to  the  receiver,  TSE  is  likely  to 
lose  important  signal  information.  In  fact,  this  point  is  quite  clear  from  the  sampling  theory. 
As  shown  by  Gitlin  and  Weinstein,7  when  the  transmitted  signal  (or  pulse  shape)  does  have 
spectral  content  beyond  a  frequency  of  1  /(2T)  Hz,  baud  rate  sampling  at  the  frequency  of  I /T 
is  below  the  Nyquist  rate  and  can  lead  to  spectral  aliasing.  Consequently,  receiver  performance 
may  be  poor  because  of  information  loss. 

In  most  cases,  when  the  transmission  pulse  satisfies  Nyquist’s  first  criterion  of  zero  ISI, 
the  received  signal  component  is  certain  to  possess  frequency  content  above  l/(2F)  Hz.  For 
example,  when  a  raised-cosine  (or  a  root-raised-cosine)  pulse  p„c(t)  is  adopted  with  roll-off 
factor  r  [Eq.  (13.23)],  the  signal  component  bandwidth  is 


13.4.1  The  Single-Input-Multiple-Output  (SIMO)  Model 


receiver  front 
end  for  FSE. 


Receiver  filter 
p(~t) 
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Any  sampling  rate  of  the  form  1  /  A  =  m/T  (m  >  1 )  will  be  above  the  Nyquist  sampling  rate 
and  can  avoid  aliasing.  For  analysis,  denote  the  sequence  of  channel  output  samples  as 

00 

z(kA)  =  X snh(kA  —  nT)  4-  w(£A) 

n= 0 
00 

=  ^^snh(kA  —  nm  A)  4-  w(A'A)  (13.46) 

n= 0 

To  simplify  our  notation,  the  oversampled  channel  output  z(k  A )  can  be  reorganized  (decimated) 
into  m  parallel  subsequences 

=  z(kT  +  /A) 

=  z(kmA  4*  /  A) 

00 

=  X  snh(kmA  4*  /A  —  n  m  A)  4-  w(km  A  4-  /  A). 

n= 0 
oo 

=  ^Tsnh(kT  -  nT  4-  /  A)  4-  w (kT  4-  «A)  i  =  1,  . . m  (13.47) 

n= 0 

Each  subsequence  n[k]  is  related  to  the  original  data  via 

Zi[k]  =  z(kT  +  /A)  =  Sk  *  h(kT  4-  /A)  4-  w^T  4-  /A) 

In  effect,  each  subsequence  is  an  output  of  a  linear  subchannel.  By  denoting  each  subchannel 
response  as 


oo 

hi{k]  =  h(kT  +  /A)  Hi(z)  =  £>[*];-* 

*=o 

and  the  corresponding  subchannel  noise  as 


w,[/c]  =  w(*T  +  iA) 

then  the  reorganized  m  subchannel  outputs  are 


00 

£/[^]  =  ^  ^  snhi[k  n]  4~  w,-[A:] 

n=0 


oo 

=  XI  +  W; [k]  i  =  1 ,  . . . ,  m 

n= 0 


(13.48) 


Thus,  these  m  subsequences  can  be  viewed  as  stationary  outputs  of  m  discrete  channel 
with  a  common  input  sequence  s[k]  as  shown  in  Fig.  13.7.  Naturally,  this  represents  a  single- 
input— multiple-output  (SIMO)  system  analogous  to  a  physical  receiver  with  ni  antennas.  The 
FSE  is  in  fact  a  bank  of  m  filters  (F,  (z)}  that  jointly  attempts  to  minimize  the  channel  distortion 
shown  in  Fig.  13.7. 


13.4  Linear  Fractionally  Spaced  Equalizers  (FSE) 
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Figure  13.7 

Equivalent 
structure  of 
fractionally 
spaced 

equalizers  (FSE). 


FSE 


1  3.4.2  FSE  Designs 

Based  on  the  SIMO  representation  of  the  FSE  in  Fig.  13.7,  one  FSE  filter  is  provided  for  each 
subsequence  zAk].  In  fact,  the  actual  equalizer  is  a  vector  of  filters 

M 

Fi(z)  =  «'  =  1 . «  (,3'49> 

k=  0 

The  m  filter  outputs  are  summed  to  form  the  stationary  equalizer  output 

y[k]  =  jr  ^fMzilk  -  n]  (13.50) 

i=l  n= 0 

Given  the  linear  relationship  between  equalizer  output  and  equalizer  parameters,  any  TSE 
design  criterion  can  be  generalized  to  the  FSE  design. 


Todesten  aZF  FSE,  the  goal  is  to  eliminate  all  ISI  at  the  input  of  the  decision  device.  Because 
toe Tnow  parallel  subchannels,  .he  ZF  hirers  should  sarisfy 


C(z)  =  f^F,(z)W/(z)  =  z"“ 


(13.51) 


1=1 


This  zero-forcing  requirenwrrt neveads  iurcowlecdon to ^wdbknown^rqualily 

hnown"  to  IX  in  J*—  —W  -W-  *■»  «  - 
orders  up  to  L. 
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If  A\  (z)  and  A2(z)  do  not  share  any  common  root,  then  they  are  called  coprime.  The  Bezout 
identity  states  that  if  A\  (z)  and  Aiiz)  are  coprime,  then  there  must  exist  two  polynomials 

M  M 

B\ ( z )  =  ^  b\ jz~‘  and  Bi(z)  =  ^  buz*1 
i= o  i=0 

such  that 

Bi(z)Al(z)+B2(z)A2(z)=  1 

The  order  requirement  is  that  M  >  L  —  1 .  The  solution  of  B\  (z)  and  B2(z)  need  not  be  unique. 
It  is  evident  from  the  classic  text  by  Kailath8  that  the  ZF  design  requirement  of  Eq.  ( 13.51)  is 

an  /w-channel  generalization  of  the  Bezout  identity.  To  be  precise,  let  {//,(z),  /  =  1,2 . m) 

be  a  set  of  finite  order  polynomials  of  z~ 1  with  maximum  order  L.  If  the  ///-subchannel  transfer 
functions  {///(z)}  are  coprime,  then  there  exists  a  set  of  filters  {F,(z)}  with  orders  M  >  L  -  1 
such  that 


J2  Fi(z)Hi(z)  =  Z~u  (13.52) 

i=\ 

where  the  delay  can  be  selected  from  the  range  u  =  0,  1,  . . . ,  M  +  L-  1.  Note  that  the 
equalizer  filters  {/^(z)}  vary  with  the  desired  delay  u.  Moreover,  for  each  delay  w,  the  ZF 
equalizer  filters  {F/(z)J  are  not  necessarily  unique. 

We  now  describe  the  numerical  approach  to  finding  the  equalizer  filter  parameters.  Instead 
of  continuing  with  the  polynomial  representation  in  the  z-domain,  we  can  equivalently  find 
the  matrix  representation  of  Eq.  (13.52)  as 


"Ziior 

~hd0] 

.  MO] 

1 

/.[  i] 

*|[1]  ' 

••  /iitOJ 

.  Ml]  '• 

•  MO] 

MM] 

h\[L]  ‘ 

••  /«i[l] 

.  hm[L]  •• 

•  Ml] 

MO] 

- 

h\[L] 

hm[L] 

MU 

7Y  :  (L  +  M)  x  m(M  -F  1 ) 

JmW\_ 

m(M  + 1 )  x  1 

(M  +L)xl 


z/th 


(13.53) 


The  numerical  design  as  a  solution  to  this  ZF  design  exists  if  and  only  if  H  has  full  row 
rank,  that  is,  if  the  rows  of  H  are  linearly  independent.  This  condition  is  satisfied  for  FSE  0  e  ’ 
m  >  1)  it  A/  >  L  and  {//;(z)}  are  coprime.6 
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MMSE  FSE  Design 

We  will  apply  a  similar  technique  to  provide  the  MMSE  FSE  design.  The  difference  between 
FSE  and  TSE  lies  in  the  output  signal 


m  M 


</[«]  = 


1=1  Jt=0 


To  minimize  the  MSE  | d[n  -  s„_u|2,  the  principle  of  orthogonality  leads  to 

(d[n]~  s„-„)  zfln-l\  =  0  £  =  0,  I . M,  j  =  1 . m  (13.54) 

Therefore,  the  equalizer  parameters  {/H*])  must  satisfy 


m  M  _ 

J2  ^2fi[k]Zi[n-k]zJ[n-i]  =  s„-uz*[n  -  i ]  l  =  0.  I . M.  j  =1,2 . 

i=l  *=o 

There  are  m(M  +  1)  equations  for  the  m(M  +  1 )  unknown  parameters  [f,\k m, 
^  _  o  M.  The  MMSE  FSE  can  be  found  as  a  solution  to  this  set  of  linear  equations.  In 
terms  of  practical  issues,  we  should  also  make  the  following  observations. 

•  When  we  have  only  finite  length  data  to  estimate  the  necessary  statistics. 


m 


Sn-U  Z*[«  -  and  z'l"  -  *1  -  n 

can  be  replaced  by  their  time  averages  from  the  limited  data  collection.  This  is  similar  to  the 
TSE  design. 

•  Also  similar  to  the  MMSE  TSE  design,  different  values  of  delay  u  will  lead  to  different  mean 
square  errors.  To  find  the  optimum  delay,  we  can  evaluate  the  MSE  for  all  possible  delays 
,i  =  0l  M  +  L  -  1  and  choose  the  delay  that  results  in  the  lowest  MSE  value. 

Since  their  first  appearance,7  adaptive  equalizers  have  often  been  implemented  as  FSE 
When  training  data  can  be  had.  FSE  has  the  advantage  of  suppressing  timing  phase  sensitivity 
Unlike  the  case  in  TSE,  linear  FSE  does  not  necessarily  amplify  the  channel  noise.  Indeed,  the 
noise  amplification  effect  depends  strongly  on  the  coprime  channel  condition.  In  some  cases, 
the  subchannels  in  a  set  do  not  strictly  share  any  common  zero.  However,  them  is  a,  least  one 
point  Za  that  is  almost  the  root  of  all  the  subchannels,  that  is, 

Hi(Za)K  0  i  =  1 . m 

.  u  U  nr,  rinse  to  being  singular.  When  the  subchannels  are  coprime 

'-r::PhhLot  — 


1 3.5  CHANNEL  ESTIMATION 

r  A  ran  the  direct  equalizer  design  approach  in  which  the  equalizer 
mter  ^.em  « Ely  estimated  front  the  channel  input  signal  »„  and  the  channel  output 
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signals  z,[n].  We  should  recognize  that  if  MLSE  receiver  is  implemented,  the  MLSE  algorithm 
requires  the  knowledge  of  channel  parameters  {/#]).  When  exact  channel  knowledge  is  not 
available,  the  receiver  must  first  complete  the  important  first  step  of  channel  estimation. 

In  channel  estimation,  it  is  most  common  to  consider  FIR  channels  of  finite  order  L. 
Similar  to  the  linear  estimation  of  equalizer  parameters  introduced  in  the  last  section,  channel 
estimation  should  first  consider  the  channel  input-output  relationship 


L 


z[«]  =  +  w[n] 

k=o 


(13.55) 


If  consecutive  pilot  symbols  {sn,  n  =  n\9  n\  +  1,  . . >12}  are  transmitted,  then  because  of  the 
finite  channel  order  L,  the  following  channel  output  samples 


{z[nl  n  =  n\  +  L,  n\  +  L  +  1 . n2] 


depend  on  these  pilot  data  and  noise  only.  We  can  apply  the  principle  of  MMSE  to  estimate 
the  channel  coefficients  {h[k]}  to  minimize  the  average  estimation  error: 


2 


L 


E  zM  -  E  <13-56) 


7(M0],  h[l],...,h[L])  = 


«2  ~n\  -L  +  1 


fll+L  k=0 


This  MMSE  estimation  can  be  simplified  by  setting  to  zero  the  derivative  of  the 
^[0],  Ml]*  •  •  •  *  h[M])  with  respect  to  each  h[j].  Removing  redundant  constants,  we  have 


J2  Zf nK-j  -  WJ  •  E  Sn-kS*„.j  =0  j  =  0,  1 . L 


Therefore,  by  defining 


and 


n\+L 


n\+L 


we  can  simplify  the  MMSE  channel  estimation  into  a  compact  matrix  expression: 


Rz[0,0]  RJO,  1]  ...  R;[0,L]1  pi[0n  frK[0]' 

^[1.0]  Rz[  1,1]  ...  R.[\,L]  A[l]  rJZ[l] 

_Rz[L,0]  Rz[L,l]  •••  RZ[L,L]J  |_/i[Z.]J  _rsz[M\_ 


Eq.  (13.57)  can  be  solved  by  matrix  inversion  to  estimate  the  channel  parameters  h[i]- 

In  the  more  general  case  of  FSE,  the  same  method  can  be  used  to  estimate  the  ith  subchannel 
parameters  by  simply  replacing  z[n  -  k]  with  Zi[n  -  k]. 


13.6  DECISION  FEEDBACK  EQUALIZER 


The  TSE  and  FSE  we  have  discussed  thus  far  are  known  as  linear  equalizers  because  the 
equalization  consists  of  a  linear  filter  followed  by  a  memoryless  decision  device.  These  linear 
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Figure  13.8 

A  decision 
feedback 
equalizer  with 
fractionally 
spaced  samples. 


equalizers  are  also  known  as  feedforward  ( FFW )  equalizers.  The  advantages  of  FFW  equalizers 
lie  in  their  simple  implementation  as  FIR  filters  and  in  the  straightforward  design  approaches 
they  accommodate.  FFW  equalizers  require  much  lower  computational  complexity  than  the 
nonlinear  MLSE  receivers. 

On  the  other  hand,  FFW  equalizers  do  suffer  from  several  major  weaknesses.  First,  the 
TSE  or  FSE  in  their  FFW  forms  can  cause  severe  noise  amplifications  depending  on  the 
underlying  channel  conditions.  Second,  depending  on  the  roots  of  the  channel  polynomials, 
the  FFW  equalizer(s)  may  need  to  be  very  long  to  be  effective,  particular  when  the  channels 
are  nearly  singular.  To  achieve  simple  and  effective  channel  equalization  without  risking  noise 
amplification,  a  decision  feedback  equalizer  (DFE)  proves  to  be  a  very  useful  tool. 

Recall  that  FFW  equalizers  generally  serve  as  a  channel  inverse  filter  (in  ZF  design)  or  a 
regularized  channel  inverse  filter  (in  MMSE  design).  The  DFE,  however,  comprises  another 
feedback  filter  in  addition  to  a  feedforward  filter.  The  feedforward  filter  is  identical  to  linear 
TSE  or  FSE,  whereas  the  feedback  filter  attempts  to  cancel  ISI  from  previous  data  samples 
usin°  data  estimates  generated  by  a  memoryless  decision  device.  The  feedforward  filter  may 
be  operating  on  fractionally  spaced  samples.  Hence,  there  may  be  m  parallel  filters  as  shown 
Fi cr  13  8 

The  basic  idea  behind  the  inclusion  of  a  feedback  filter  Biz)  is  motivated  by  awareness  that 
the  feedforward  filter  output  d[k\ may  contain  some  residual  ISI  that  can  be  more  effectively 
regenerated  by  the  feedback  filter  output  and  canceled  from  v[k ].  More  specifically,  consider 
the  case  in  which  the  feedforward  filter  output  d[k)  consists  of 


d[k]  =  sk-u  +  £  CiSk-i  +  w| n) 
(=«+ 1 


noise 


residual  ISI 


(13.58) 


There  is  a  residual  ISI  term  and  a  noise  term. 


If  the  decision  output  is  very  accurate  such  that 


h-u  =  sk-u 

,h,„  the  feedback  filter  input  will  equal  ,o  Ihe  aelual  da, a  symbol.  If  we  deuce  Ihe  feedback 
filter  as 


N—u 

Biz)  =  £  BiZ~l 
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then  we  have 


N-u 

v[k]  =  d[k]  -  £  bih-u-i 
1=1 

N  N-u 

=  sk-u  +  ^  CiSk-i  ~  ^  biSk-u-i  +  w[/i] 

l=M+ 1  1=1 

N  N—u 

=  Sk-u  +  X!  CiSk~‘  -  ^2  bjSk-u-i  +  w[n] 

1=11+ 1  1=1 

N-U 

=  Sk-u  +  T  (cu+i  -  bj)sk-u-i  +  w[n]  (13.59) 

i=l 

To  eliminate  the  residual  IS1,  the  feedback  filter  should  have  coefficients 

bi  =  cu+j  i  =  1 ,  2,  . . . ,  N  —  u  —  1 

With  these  matching  DFE  parameters,  the  residual  1S1  is  completely  canceled.  Hence,  the  input 
to  the  decision  device 


13.7 


v[k]  =  sk-u  +  w[/i] 

contains  zero  ISI.  The  only  nuisance  that  remains  in  v[it]  is  the  noise.  Because  the  noise  term 
in  d[k]  is  not  affected  or  amplified  by  the  feedback  filter,  the  decision  output  for  the  next  time 
instant  would  be  much  more  accurate  after  all  residual  ISI  has  been  canceled. 

Our  DFE  analysis  so  far  has  focused  on  the  ideal  operation  of  DFE  when  the  decision 
results  are  correct.  Traditionally,  the  design  and  analysis  of  DFE  has  often  been  based  on  such 
an  idealized  operating  scenario.  The  design  of  DFE  filters  must  include  both  the  feedforward 
filters  and  the  feedback  filter.  Although  historically  there  have  been  a  few  earlier  attempts  to 
fully  decouple  the  design  of  the  feedforward  filter  and  the  feedback  filter,  the  more  recent  work 
by  A1  Dhahir  and  Cioffi9  provides  a  comprehensive  and  rigorous  discussion. 

In  the  analysis  of  a  DFE,  the  assumption  of  correct  decision  output  leads  to  the  remo\al 
ot  ISI  in  v[k],  and  hence,  a  better  likelihood  that  the  decision  output  is  accurate.  One  cannot 
help  but  notice  this  circular  “chicken  or  egg”  argument.  The  truth  of  the  matter  is  that  the 
DFE  is  inherently  a  nonlinear  system.  More  importantly  the  hard  decision  device  is  not  even 
differentiable.  As  a  result,  most  traditional  analytical  tools  developed  for  linear  and  nonlinear 
systems  no  longer  apply.  For  this  reason,  the  somewhat  ironic  chicken-egg  analysis  becomes 
the  last  resort.  Fortunately,  for  high-SNR  systems,  this  circular  argument  does  yield  analytical 
results  that  can  be  closely  matched  by  experiments. 

Error  Propagation  in  DFE 

Because  of  its  leedback  structure,  the  DFE  does  suffer  from  the  particular  phenomenon  known 
as  error  propagation.  For  example,  when  the  decision  device  makes  an  error,  the  erroneous 
symbol  will  be  sent  to  the  feedback  filter  and  used  for  ISI  cancellation  in  Eq.  (13.59).  How¬ 
ever,  because  the  symbol  is  incorrect,  instead  of  canceling  the  ISI  caused  by  this  symbol,  the 
canceling  subtraction  may  instead  strengthen  the  ISI  in  v[*].  As  a  result,  the  decision  device  is 
more  likely  to  make  another  subsequent  error,  and  so  on.  This  is  known  as  error  propagation- 
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Error  propagation  means  that  the  actual  DFE  performance  will  be  worse  than  the  prediciton 
of  analytical  results  derived  from  the  assumption  of  perfect  decision.  Moreover,  the  effect  of 
error  propagation  means  that  DFE  is  more  likely  to  make  a  short  burst  of  decision  errors  before 
recovery  from  the  error  propagation  mode.  The  recovery  time  from  error  propagation  depends 
on  the  channel  response  and  was  investigated  by  Kennedy  and  Anderson.10 


13.7  OFDM  (MULTICARRIER)  COMMUNICATIONS 

As  we  have  learned  from  the  design  of  TSE  and  FSE,  channel  equalization  is  exclusively  the 
task  of  the  receivers.  The  only  assistance  provided  by  the  transmitter  to  receiver  equalization 
is  the  potential  transmission  of  training  or  pilot  symbols.  In  a  typically  uncertain  environment, 
it  makes  sense  for  the  receivers  to  undertake  the  task  of  equalization  because  the  transmitter 
normally  has  little  or  no  knowledge  of  the  channel  response  it  uses.*  Still,  despite  their  simpler 
implementation  compared  with  the  optimum  MLSE,  equalizers  such  as  the  feedforward  and 
decision  feedback  types  often  lead  to  less  than  satisfactory  performance.  More  importantly,  the 
performance  of  the  FFW  and  decision  feedback  equalizers  is  too  sensitive  to  all  the  parameters 
in  their  transversal  structure.  If  even  one  parameter  fails  to  hold  the  desired  value,  an  entire 
equalizer  could  crumble. 

In  a  number  of  applications,  however,  the  transmitters  have  partial  information  regarding 
the  channel  characteristics.  One  of  the  most  important  piece  of  partial  channel  information  is 
the  channel  delay  spread;  that  is,  for  a  finite  length  channel 

l 

H(z)  = 

k=  0 


the  channel  order  L  is  known  at  the  transmitter  while  {/#]}  are  still  unknown.  Given  this  partial 
channel  information,  the  particular  transmission  technique  known  as  orthogonal  frequency 
division  modulation  (OFDM)  can  be  implemented  at  the  transmitter.  With  the  application  o 
OFDM,  the  task  of  receiver  equalization  is  significantly  simplified. 


13.7.1  Principles  of  OFDM 


tt*r  fhnt  k  in  charge  of  transmitting  a  sequence  of  data  signals  { s * )  over  the 
HR  chann^HM  ol  or*r  up  to  L.  Before  we  begin  to  describe  the  fundamentals  of  OFDM, 
we  nore  .ha,  the'frequency  response  of  the  FIR  channel  can  he  represented  as 


H(e)W)  =  £  h[k  \e-j27rfkT  < 1 3-60) 

*=  0 


.  i  a  .icon  mrt  also  the  sampling  period.  Because  Hie1'*^)  is  the 
where  T  is  the  symbol  ura  _  .( .g  a  periodic  function  of/  with  period  1  /T . 

frequency  response  oft  ei-  .  sampled  function  of  the  channel  frequency 


- - - ,  nc,  unco  the  channels  are  quite  stable,  and  the  receivers  can  use  a  reverse  link 

*  In  a  stationary  environment  (e.g.,  h  )  jnformation.  This  channel  state  information  (CSI)  feedback, 

channel  to  inifom^e,"^"  [|"re  accuraty.  can  consume  rather  valuable  bandwidth  resources. 
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Figure  13.9 

(a)  Discrete  time 
domain  channel 
response  and  (b) 
its  corresponding 
periodic  DFT. 


the  frequency/  is  sampled  at 


*-°-5 T-° 

J  NT  NT 


,  (at  n  1  {N~]) 

JN-l  =  (N  —  1 )  •  -  =  - 

NT  NT 

We  can  use  a  simpler  notation  to  denote  the  DFT  sequence  by  letting  ion  =  Inn/NT 
H[n]  =  H(e‘u>nT) 

L 

=  ^T/h[k]exp(-jco„Tk) 
k= o 
L 


=  J2  h[k]  exp  (-jln-j^kT ) 
k=  0 

=  J2h^exp{~J27T'^  n  =  0,  1,  ....  (N  -  1)  (13.61) 

From  Eq.  (13.61),  it  is  useful  to  notice  that  H[n]  is  periodic  with  period  N  (Fig.  13.9).  Hence, 

=  |>Oexp 


(13.62a) 


k= 0 

=  H[N  -n] 


(13.62b) 


Based  on  the  linear  convolutional  relationship  between  the  channel  input  { s ^ )  and  output 

L 

z[k]  =  J2h[i]sk-i  +  vj[k] 

(=0 


/»[0] 


/i[l]  h[L] 

Ln.-  I, 


H[2\ 


ffioi 


. 


H\N] 


T 

//[AM] 


(a) 


(b) 
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a  vector  of  N  output  symbols  can  be  written  in  matrix  form  as 


z[N] 

"mo]  mu  •••  m 

z[N  -  1] 

MO]  Ml]  •••  M L] 

Z[L] 

= 

MO]  MD  •••  h[ L] 

z[l] 

MO]  Ml]  •••  M L]_ 

SN 

"  w IN]  ' 

SN- 1 

w  IN  -  1] 

S| 

+ 

w  [L] 

•so 

w|l|  . 

J-(L-l). 

(13.63) 


The  key  step  in  OFDM  is  to  introduce  what  is  known  as  the  cyclic  prefix  in  the  transmitted 
data.*  This  step  replaces  the  M  leading  elements 

s0,  5-1,  ....  s-(L-l) 

of  the  (N  +  L) -dimensional  data  vector  by  the  trailing  symbols 

[Sfj,  SN-U  •  •  •  >  SN-L+ 1 1  *  {*>,  S-l,  .  •  • ,  S-(i-l)} 

By  inserting  the  cyclic  prefix,  we  can  then  rewrite  Eq.  (13.63)  as 

7i[0]  M  1]  •••  h[L] 

h[  0]  h[  1]  •••  h[L] 


"  z[AT] 

zlN  -  1] 

zlL] 

— 

zl  1]  _ 

MO]  hi  1]  ••••  h[L] 

hi  0]  MU 


hlL)_ 


SN 

■  w[N] 

SN- 1 

w[N  -  1] 

*1 

+ 

w[L] 

SN 

w[l]  _ 

SN-L+l ^ 

(13.64a) 


*  Besides  the  use 


cyclic  prefix,  zero  padding  is  an  alternative  but  equivalent  approach. 
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'*[0] 

Ml] 

h[L] 

0 

0 

MO] 

Ml] 

h[L] 

0 

0 

*[01 

MU 

h[L] 

0 

*10] 

mu 

h[L) 

0 

.  .  . 

Wcp  :  (N  x  N) 


w[(V] 

0 

w[N  -  1] 

m 

+ 

w  [L] 

h[L  -  1  ] 

_  *1  _ 

w[l] 

(13.64b) 


The  critical  role  of  the  cyclic  prefix  is  to  convert  the  convolution  channel  matrix  in  Eq.  (13.64a) 
into  a  well-structured  N  x  N  cyclic  matrix  Hcp  in  Eq.  (13.64b). 

Next,  we  need  to  introduce  the  /V-point  DFT  matrix  and  the  corresponding  inverse  DFT 
matrix.  First,  it  is  more  convenient  to  denote 


Wn  =  exp 

This  complex  number  WN  has  some  useful  properties: 

'  <  =  1 
•  V  = 

If  we  take  the  DFT  of  the  W-dimensional  vector 

v0 

V| 

v  = 

_vyv— i  _ 

then  we  have  the  DFT 

N-\  ,  ,v  N-\ 

V[n]  =  Vi  exp  f  —jin )  =  £  v,  W^  n  =  0,  1,  . . . ,  (N  -  D 
*=o  V  7  k=0 

and 

N~ 1  /  .v  tf-l 

U[-n]  =  v,  exp  n  =  0,  1,  . . . ,  (N  -  D 

*=  o  V  '  *•= o 

The  inverse  DFT  can  also  be  simplified  as 

Vk  =  ^  E  exP  (j2ir^)  =  i  E  V*  k  =  0,  1 . (N  ' 1} 

«=0  V  /v  7  /V  i  =n 
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Thus,  the  iV-point  DFT  of  v  can  be  written  in  the  matrix  form 


V  = 


V[0] 

V[l] 

V[N  -  1]_ 

tyO-o 

w10 


W(M 

WN 

W1'1 

WN 


WN  WN 


If  we  denote  the  N  x  N  DFT  matrix  as 


W/v  = 


1  1 
i  < 


i  w, 


(Af-I) 


then  W also  has  an  inverse 


w-1  =  — 
W"  N 


N 


1  I 


I  w~l  ...  W~ 


1  w, 


W<HN-D 


W 


!t. 


(/v- 1) 


W 


N 

(N-IH/V-I) 

N 


w, 


(N-\) 

N 


w 


<N-I>2 


I 

-(N-l) 


...  w, 


-(N- 1)2 
N 


This  can  be  verified  (Prob.  13.7-1)  by  showing  that 

W  n  ■  =  In  *n 

Given  this  notation,  we  have  the  relationship  ol 


(13.65) 


(13.66a) 


(13.66b) 


V  =  Ws  -v 

V  =  WJ-V 

An  amazing  pmpeny  of  ,he  cyclic  maim  «,p  can  be  eaablishri  by  applying  the  DFT  and 
I  DFT  matrices. 

r/i[0]  MU  •••  WL]  0  ° 


0  M  0] 

hi  1] 

*  *  * 

h[L\ 
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0 
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MU 

...  h[L] 

h[L] 

0 

MO] 

.  •  •  ML  - 

h\\] 

h[L] 

0 

... 

o  M01 
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n  /V  J  — 

=  WJdh  (13.67a) 

where  we  have  defined  the  diagonal  matrix  with  the  channel  DFT  entries  as 


Dh  = 

~//[0] 

H[~  1] 

_ 

“//[AT] 

//[AT  -  1] 

//[—AT  -F  1] 

H[  1]_ 

The  last  equality  follows  from  the  periodic  nature  of  H[n]  given  in  Eq.  (13.62b).  We  leave  it  as 
homework  to  show  that  any  cyclic  matrix  of  size  N  xN  can  be  diagonalized  by  premultiplication 
with  WN  and  postmultiplication  with  W^1  (Prob.  13.7-2). 

Based  on  Eq.  (13.67a)  we  have  established  the  following  very  important  relationship  for 
OFDM: 


ncp  =  w"1  •  dh  .  wN 
-1 


D"  7nw" 


(13.67b) 


Recall  that  after  the  cyclic  prefix  has  been  added,  the  channel  input-output  relationship  is 
reduced  to  Eq.  (13.64b).  As  a  result. 


z[N] 
z[N  -  1] 

z[  1] 


S\ 

"  w  [AT] 

SN-l 

+ 

w  [N  -  1] 

Si 

w[l]  . 

This  means  that  if  we  put  the  information  source  data  into 


SN 

SN 

~  A 

SN-l 

/  1  ,  \ 

SN-l 

S  — 

; 

- 

-  J1  . 
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then  we  can  obtain  the  OFDM  transmission  symbols  via 


Sfil 

’  SN 

SN- 1 

(  *  \_l 

SN-\ 

-  Jl 

s\ 

Despite  the  minor  scalar  1  / sfN <  we  can  call  the  matrix  transformation  of  sfN  W N 1  the  IDFT 
(inverse  DFT)  operation.  In  other  words,  we  apply  IDFT  on  the  information  source  data  v  at 
the  OFDM  transmitter  to  obtain  s  before  adding  the  cyclic  prefix. 

Similarly,  we  can  also  transform  the  channel  output  vector  via 


z  = 


"  l[N] 

z[N\ 

UN-  1] 

/  1  \ 

z\N  -  1 1 

z[  1] 

”7 

n 

II 

z(U 

Corresponding  to  the  IDFT,  this  operation  can  also  be  named  the  DFT:  Finally,  we  note  that 
the  noise  vector  at  the  channel  output  also  undergoes  the  DF I . 


w  = 


‘  w[A^l 

'  w  [N] 

w  [N  -  1) 

=  (-CfVv) 

w\N  —  1 1 

w[l] 

\y/N  ) 

w[l] 

We  now  can  see  the  very  simple  relationship  between  the  source  data  and  the  channel  output 
vector,  which  has  undergone  the  DFT. 

z  =  Dhs  +  vv  0  3.68a) 

Because  DH  is  diagonal,  this  matrix  product  is  essentially  element-wise  multiplication: 

z[n ]  =  H[n]  s„  +  w[n]  n  =  I . N  ( 1 3.68b) 

This  shows  that  we  now  equivalently  have  N  parallel  (subchannels  each  of  which  is  just  a 
scalar  channel  with  gain  H[n].  Each  vector  of  N  data  symbols  in  OFDM  transm.ss.on  is  known 
aslnOFDM  frame  or  an  OFDM  symbol.  Each  subchannel  H\n] |  is  also  known  as  a  subcarner 
i_  nivincr  the  IDFT  on  the  source  data  vector  and  the  DFT  on  the  channel  output 
vector  OFDMconverts  an  ISI  channel  of  order  L  into  N  parallel  subchannels  without  LSI.  We 
no  longer  have  to  deal  with  the  complex  convolution  that  involves  the  time  domam  channe 
nnS  Instead  every  subchannel  is  a  non-frequency-select. ve  gam  only.  There  is  no  ISI 
reSth  each  subchanne?  The  N  parallel  subchannels  are  independent  of  one  another  because 
wthm  each  subchan"el,  P  h  modulation  is known  as  orthogonal  frequency 

divisionmodi^ation^o'FDMvThe  block  diiigrumi'luDA'-poinl  OFDM  system  implemenlalion 

with  a  linear  FIR  channel  of  order  L  is  given  in  Fig.  .. 

13.7.2  OFDM  Channel  Noise 

c  m  A8bl  each  of  the  N  channels  acts  like  a  separate  carrier  of  frequency 
/ “"/W  wilhchannel  fain  «I4  Effccdvdy.  *e  original  da, a  symbols  B.I  am  spli,  into N 
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Figure  13.10 

Illustration  of  an 
N -point  OFDM 
transmission 
system. 


Figure  13.1 1 

N  independent 
AWGN  channels 
generated  by 
OFDM  without 
ISI. 


SN  -  I  ’  *  ’  S2  S I  $0  S-  I  *  *  ’  S- 


N-point  data  frame 


S  v  -  i  *  *  *  S  |  SN  *  *  *  -  z.  ♦  i 


sequences  and  transmitted  over  N  subcarriers.  For  this  apparent  reason,  OFDM  is  also  com¬ 
monly  known  as  a  multicarrier  communication  system.  Simply  put,  OFDM  utilizes  IDFT  and 
cyclic  prefix  to  effectively  achieve  multicarrier  communications  without  the  need  to  actually 
generate  and  modulate  multiple  (sub)carriers.  The  effective  block  diagram  of  OFDM  appears 
in  Fig.  13.11. 

Now  we  can  study  the  relationship  between  the  transformed  noise  samples  w[w]  in 
Fig.  13.11.  First,  notice  that 


yv-i 

w[JV  -j]  =  W^(N-j)vj [N  -  k] 

k=  0 
N- 1 

=  J2WNkj'"[N-l<]  j  =  0,  1 . (AT  —  I) 

*=0 

They  are  linear  combinations  of  jointly  distributed  Gaussian  noise  samples  { w [A^  ~  Ul- 
Therefore.  {w[W  —  j’]}  remains  Gaussian.  In  addition,  because  w[n]  has  zero  mean,  we  have 

_  At— l  _ _ 

w[N  -j]  =  £  W~kJ w[N-k]  =  0,  j  =  0,  1 . (N  -  1) 

k=  0 

_ _ _  J 

w[A/  -  t] w [N  -j}*  =  ~Y1  WNkriMN  -  ki]  W„2'jw[N  -  k2V 

*1=0  t2=0 
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.  N- 1 N- 1 

=  jv  E  E  K2j~kvl^N  -  k^N  - 


1  ...krj-kvi .  ^S[ki  _  kl] 


*,=0Jfc2=0 

N- 1  N-l 

=  sEE< 

*1=0  *2=0 

N-l 

=  ^/2N  £  <' 

*,=0 

|o  ;;j 

=  jS[i-j] 


(13.69) 


Because  { w[«]}  are  zero  mean  with  zero  correlation,  they  are  uncorrelated  according  to 
Eq  (13  69).  Moreover,  {w[n])  are  also  Gaussian  noises.  Since  uncorrelated  Gaussian  random 
variables  are  also  independent,  {w[n]}  are  independent  Gaussian  noises  with  zero  mean  and 
identical  variance  oftf/2.  The  independence  of  the  N  channel  noises  demonstrates  that  OFDM 
converts  an  FIR  channel  with  ISI  and  order  up  to  L  into  N  parallel,  independent,  and  AWGN 
channels  as  shown  in  Fig.  13.11. 


13.7.3  Zero-Padded  OFDM 


We  have  shown  that  by  introducing  a  cyclic  prefix  of  length  L,  a  circular  convolution  channel 
matrix  can  be  established.  Because  any  circular  matrix  of  size  N  x  N  can  be  diagonalized  by 
1DFT  and  DFT  (Prob.  13.7-2),  the  ISI  channel  of  order  less  than  or  equal  to  L  is  translormed 

into  N  parallel  independent  subchannels.  . 

There  is  also  an  alternative  approach  to  the  use  of  cyclic  prefix.  This  method  .s  known 
as  zero  padding  The  transmitter  first  perform  an  I  DFT  on  the  N  input  data.  Then,  mstead  of 
repeating  the  last  L  symbols  as  in  Eq.  (13.64b)  to  transmit 


SN 
SN- 1 


*1 

SN 


SN-L+\_ 


we  can  simply  replace  .he  cyclic  prefix  wifi.  L zeros  and  Iransmi, 

SN- 1 

S\ 

0 

0 


(N  +  L)  x  1 
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The  rest  of  the  OFDM  transmission  steps  remain  unchanged.  At  the  receiver  end,  we  can  stack 
up  the  received  symbols  in 


"  z[N] 

0 

z[N-  1] 

l 

y  = 

z{L] 

+ 

0 

z[N  +  L] 

z[l) 

z{N  +  1] 

We  then  can  show  (Prob.  13.7-4)  that 

would  achieve  the  same  multichannel  relationship  of  Eq.  ( 1 3.68b). 


(13.70) 


13.7.4  Cyclic  Prefix  Redundancy  in  OFDM 

The  two  critical  steps  of  OFDM  at  the  transmitter  are  the  insertion  of  the  cyclic  prefix  and  the 
use  of  /V-point  IDFT.  The  necessary  length  of  cyclic  prefix  L  depends  on  the  order  of  the  FIR 
channel.  Since  the  channel  order  may  vary  in  practical  systems,  the  OFDM  transmitter  must 
be  aware  of  the  maximum  channel  order  information  a  priori. 

Although  it  is  acceptable  for  OFDM  transmitters  to  use  an  overestimated  channel  order,  the 
major  disadvantage  of  inserting  a  longer-than-necessary  cyclic  prefix  is  the  waste  of  channel 
bandwidth.  To  understand  this  drawback,  notice  that  in  OFDM,  the  cyclic  prefix  makes  possible 

the  successful  transmission  of  N  data  symbols  {s\ . sn }  with  time  duration  (N  +  L)T .  The 

L  cyclic  prefix  symbols  are  introduced  by  OFDM  as  redundancy  to  remove  the  ISI  in  the 
original  frequency-selective  channel  H(z ).  Because  (N  +  L)  symbol  periods  are  now  being 
used  to  transmit  the  N  information  data,  the  effective  data  rate  of  OFDM  equals 

N  1 
N  +  LT 

It  L  is  overestimated,  the  effective  data  rate  is  reduced,  and  the  transmission  of  the  unnecessarily 
long  cyclic  prefix  wastes  some  channel  bandwidth.  For  this  reason,  OFDM  transmitters  require 
accurate  knowledge  about  the  channel  delay  spread  to  achieve  good  bandwidth  efficiency- 1 
the  cyclic  prefix  is  shorter  than  L,  then  the  receiver  is  required  to  include  a  time  domain 
filter  known  as  the  channel-shortening  filter  to  reduce  the  effective  channel-filter  response  to 
within  LT. 


13.7.5  OFDM  Equalization 

We  have  shown  that  OFDM  converts  an  ISI  channel  into  N  parallel  AWGN  subchannels  as 
shown  in  Fig.  13.11.  Each  of  the  N  subchannels  has  an  additive  white  Gaussian  noise 
zero  mean  and  variance  j\f / 2.  The  subchannel  gain  equals  H[k],  which  is  the  FIR  frequen  y 
response  at  k/NT  Hz.  Strictly  speaking,  these  N  parallel  channels  do  not  have  any  1ST  Hen' 
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Figure  13.12 

Using  a  bank  of 
receiver  gain 
adjustors  for  N 
independent 
AWGN  channels 
in  OFDM  to 
achieve  gain 
equalization. 
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channel  equalization  is  not  necessary.  However,  because  each  subchannel  has  a  different  gain, 
the  optimum  detection  of  { .v „ }  from 

z[n]  =  //[«]  s„  n=l,...,N 
would  require  knowledge  of  the  channel  gain  H\n] 

5„  =  dec  (//[«]"'  z[«])  n=\ . N 

This  resulting  OFDM  receiver  is  shown  in  Fig.  13.12.  For  each  subchannel,  a  one-tap 
gain  adjustment  can  be  applied  to  compensate  the  subchannel  scaling.  In  fact,  this  means  that 
we  need  to  implement  a  bank  of  N  gain  adjustment  taps.  The  objective  is  to  compensate  the 
N  subchannels  such  that  the  total  gain  of  each  data  symbol  equals  unity  before  the  QAM 
decision  device.  In  fact,  the  gain  equalizers  scale  both  the  subchannel  signal  and  the  no.sc 
eauallv  They  do  not  change  the  subchannel  SNR  and  do  not  change  the  detection  accuracy 
Indeed  equalizers  are  used  only  to  facilitate  the  use  of  the  same  modular  decision  device  on  all 
subchannels  Oddly  enough,  this  bank  of  gain  elements  at  the  receiver  is  exactly  the  same  as 
'he  “r  in  a  high-fidelity  audio  amplifier.  This  structure  is  known  henceforth  as  a  one-tap 

equalizer  for  OFDM  receivers. 

13.8  DISCRETE  MULTITONE  (DMT)  MODULATIONS 

A  r„hiiv  different  form  of  OFDM  is  called  discrete  multitone  (DMT)  modulation.  In  DMT, 
A  slightly  ditt  operations  are  essentially  identical  to  OFDM.  The  only  difference 

the  basic  stgna  J&JJ  0FDM  is  that  DMT  transmitters  are  given  knowledge  of  the 

between  transmits  signals  of  differing  constellations  on 

subchannel  infoniutioitAs^^resuIl.^DM^)  ^  ^  t Rg  , ,  , ,  lhe  singlc  RF  channel » 

“““  » Channels  or  sobcamem  by  OFDM  or  DMT.  Each  subcamer  conveys  a  rlisrinc, 
data  sequence: 

{...  S([k  +  1]  hW  sM  -H 

The  QAM 8ai«  are  gcn- 
Because  the  original  '  |h  evCT  lh,1U(,|,  DMT  or  OFDM  converts  the  channel 

S  IsidSSo  N  parallel  Independent  channels  without  IS,,  symbols  transmitted  over 
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Figure  13.13 

DMT  trans¬ 
mission  of  N 
different  symbol 
streams  over  a 
single  FIR 
channel. 


different  subcarriers  will  encounter  different  SNRs  at  the  receiver  end.  In  DMT.  the  receivers 
are  responsible  for  conveying  to  the  transmitter  all  the  subchannel  information.  As  a  result,  the 
transmitter  can  implement  compensatory  measures  to  optimize  various  performance  metrics. 
We  mention  two  common  approaches  adopted  at  DMT  transmitters: 

•  Subcarrier  power  loading  to  maximize  average  receiver  SNR. 

Subcarrier  bit  loading  to  equalize  the  bit  error  rate  (BER)  across  subcarriers. 


Transmitter  Power  Loading  for  Maximizing  Receiver  SNR 

To  describe  the  idea  of  power  loading  at  the  transmitter  for  maximizing  total  receiver  SNR. 
let  s,[k]  be  the  data  stream  carried  by  the  ith  subchannel  and  call  {s,[A]}  an  independent  data 
sequence  in  time  k.  Let  us  further  say  that  all  data  sequences  {$/[&]}  are  also  independent  ot 
one  another.  Let  the  average  power  of  si[k]  be 


Pi  =  km  I2 

The  total  channel  input  power  is 

M 

Ep' 

1=1 

whereas  the  corresponding  channel  output  power  at  the  receiver  equals 

M 

E  •  Pi 

i=i 

Hence,  the  total  channel  output  SNR  is 


E”i  mn\2-Pi 

AW/2 


M 


E  ™i2  •  pi 


/=i 
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To  determine  the  optimum  power  distribution,  we  would  like  to  maximize  the  output  SNR. 
Because  the  channel  input  power  is  limited,  the  optimization  requires 


N 

max  T\H[i]\2-Pi 
|/\>0|  f-f 
(=1 

N 


(13.71) 


subject  to  y  P,  =  P 

i=l 

Once  again,  we  can  invoke  the  Cauchy-Schwartz  inequality 

2 

i  i  i 

with  equality  if  and  only  if  6;  =  ka* 

Based  on  the  Cauchy-Schwartz  inequality. 


max  |//[/]|2  •  Pi  = 
if 

Pi  =  k\H\i]\2 


yv  /v 

£|tf[fll4-£lfil2  03.72a) 

1=1  1=1 


(13.72b) 

Because  of  the  input  power  constraint  Pi  =  P.  the  optimum  input  power  distribution 
should  be 

N  N 

£>,.  =  *.£  \H[i]\2  =  P 


(13.73a) 


i=  1 


i=\ 


In  other  words, 


k  = 


ESLi  iwmi2 


(13.73b) 


Substituting  Eq.  (13.73b)  into  Eq.  (13.72b),  we  can  obtain  the  optimum  channel  input  power 
loading  across  the  N  subchannels  as 


Pi  = 


\H\i}\2 


£?=,  \m\2 


(13.74) 


•mis  optimum  distribution  of  powur  in  OFDM,  also  known  as  power  load, ng  makes  wry 
«„od  sen*  When  a  channel  has  htgh  gain,  i,  is  able  to  boos,  the  power  of ,  s  mput  much  more 
effectively  than  a  channel  with  low  gain.  Hence,  the  high-gam  subchannels  wall  be  recemng 
effectively  tna  |ow.aain  subchannels  will  receive  much  less.  No  power  should 

higher  power  loa  &  ^  ^  ofa  subchanne)  tha,  has  zero  gain,  since  the  output  of  such  a 

be  wasted  on  contribution  to  the  total  received  signal  power, 

subchannel  will  make  "P  q{  maximjzing  average  SNR.  information  theory  can  also 

In  addition  to  P  P  f  er  |oac|ing  (known  as  water  pouring)  in  maximizing  the 
c^acity^f^quency-selective  channels.  This  discussion  will  be  presented  later  (Sec.  14.7). 
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Subcarrier  Bit  Loading  in  DMT 

If  the  transmitter  has  obtained  the  channel  information  |f/[/]|,  it  then  becomes  possible  for 
the  transmitter  to  predict  the  detection  error  probability  on  the  symbols  transmitted  over  each 
subcarrier.  The  SNR  of  each  subcarrier  is 


SNR,  = 


2|W[/]|2 

Af 


!*,[*]  1 2 


Therefore,  the  BER  on  this  particular  subcarrier  depends  on  the  SNR  and  the  QAM  constellation 
of  the  subcarrier.  Different  modulations  at  different  subcarriers  can  lead  to  different  powers 

\si[k]  I2. 

Consider  the  general  case  in  which  the  /th  subchannel  carries  Kj  bits  in  each  modulated 
symbol.  Furthermore,  we  denote  the  BER  of  the  /th  subchannel  by  Pb[i]-  Then  the  average 
receiver  bit  error  rate  across  the  N  subcarriers  is 

P 

b  EL  K, 

If  all  subchannels  apply  the  same  QAM  constellation,  then  Kj  is  constant  for  all  i  and 


i=l 

Clearly,  subchannels  with  a  very  weak  SNR  will  generate  many  detection  errors,  while  sub¬ 
channels  with  a  strong  SNR  will  generate  very  few  detection  errors.  If  there  is  no  power 
loading,  then  the  /th  subchannel  SNR  is  proportional  to  the  subchannel  gain  |//[t’]|"-  In  °^ier 
words.  BERs  of  poor  subchannels  can  be  larger  than  the  BERs  of  good  subchannels  by  several 
orders  of  magnitude.  Hence,  the  average  BER  Ph  will  be  dominated  by  those  large  PbU\ from 
poor  subchannels.  Based  on  this  observation,  we  can  see  that  to  reduce  the  overall  average 
BER.  it  is  desirable  to  “equalize”  the  subchannel  BER.  By  making  each  subchannel  equally 
reliable,  the  average  BER  of  the  DMT  system  will  improve.  One  effective  way  to  “equalize 
subchannel  BER  is  to  apply  the  practice  of  bit  loading.  "• 12 

To  describe  the  concept  of  bit  loading,  Table  13.1  illustrates  the  SNR  necessary  to  achieve 
a  detection  error  probability  of  10-6  for  five  familiar  constellations.  It  is  clear  that  small 
constellations  (e.g.,  BSPK,  QPSK)  require  much  lower  SNRs  than  large  constellations  (e.g- 
16-QAM,  32-QAM).  This  means  that  subcarriers  with  low  gains  should  be  assigned  less  com¬ 
plex  constellations  and  should  carry  fewer  bits  per  symbol.  In  the  extreme  case  of  subchannels 
with  gains  close  to  zero,  no  bit  should  be  assigned  and  the  subcarriers  should  be  kept  vacant. 

TABLE  13.1 

SNR  Required  to  Achieve  Detection 
Error  Probability  of  10-6 


Constellation  Eb/N  at  Pe  =  10-6,  dB 


BPSK 

10.6 

QPSK 

10.6 

8-PSK 

14 

16-QAM 

14.5 

32-QAM 

17.4 
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On  the  other  hand,  subcarriers  with  large  gains  should  be  assigned  more  complex  constellations 
and  should  carry  many  more  bits  in  each  symbol.  This  distribution  of  bits  at  the  transmitter 
according  to  subcarrier  conditions  is  called  bit  loading.  In  some  cases,  a  subcarrier  gain  may  be 
a  little  too  low  to  carry  n  bits  per  symbol  but  too  wasteful  to  carry  n  -  I  bits  per  symbol.  In  such 
cases,  the  transmitter  can  apply  additional  power  loading  to  this  subcarrier.  Therefore,  DMT 
bit  loading  and  power  loading  are  often  complementary  at  the  transmitter.11’ 12  Figure  13.14 
is  a  simple  block  diagram  of  the  highly  effective  DMT  bit-and-power  loading. 

Cyclic  Prefix  and  Channel  Shortening 

The  principles  of  OFDM  and  DMT  require  that  the  cyclic  prefix  be  no  shorter  than  the  order 
of  the  FIR  communication  channel  response.  Although  this  requirement  may  be  reasonable  in 
a  well-defined  environment,  for  many  applications,  channel  order  or  delay  spread  may  have 
a  large  variable  range.  If  a  long  cyclic  prefix  is  always  provisioned  to  target  the  worst-case 
(large)  delay  spread,  then  the  overall  bandwidth  efficiency  of  the  OFDM/DMT  communication 
systems  will  be  very  low. 

To  overcome  this  problem,  it  is  more  desirable  to  apply  an  additional  time  domain  equalizer 
(TEQ)  at  the  receiver  end  to  shorten  the  effective  channel  order.  We  note  that  the  objective  of 
this  time  domain  equalizer  (TEQ)  is  not  to  fully  eliminate  the  ISI  as  in  Sec.  13.3.  Instead,  the 
purpose  of  TEQ  filter  Gteq(z)  is  t0  shorten  the  effective  order  of  the  combined  response  of 
channel  equalizer  such  that 


L\ 

Gjeq(z)H (z)  ^  k  L\  <  L 

k=  o 


This  channel-shortening  task  is  less  demanding  than  full  ISI  removal.  By  forcing  L\  to  be 
(approximately)  smaller  than  the  original  order  L  a  shorter  cyclic  prefix  can  be  used  to  improve 
the  OFDM/DMT  transmission  efficiency.  The  inclusion  of  a  TEQ  lor  channel  shortening  is 

illustrated  in  Fig.  13.15. 


Figure  13.14 

Bit  and  power 
loading  in  a 
DMT  (OFDM) 
transmission 
system  with  N 
subcarriers. 


Figure  13.15 
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1 3.9  REAL-LIFE  APPLICATIONS  OF  OFDM  AND  DMT 


OFDM  is  arguably  one  of  the  most  successful  signaling  techniques  for  digital  communications. 
Combined  with  transmitter  power  loading  and  bit  loading,  the  benefits  of  OFDM  include  high 
spectral  efficiency  and  resiliency  against  RF  interferences  and  multipath  distortion.  As  a  result 
of  the  many  advantages,  there  are  a  number  of  practical  OFDM/DMT  communication  systems 
ranging  from  the  wire-line  digital  subscriber  line  (DSL)  system  to  the  wireless  ultrawideband 
(UWB)  radio  as  well  as  satellite  broadcasting. 

Asymmetric  Digital  Subscriber  Line  (ADSL) 

In  the  past  few  years,  ADSL  has  replaced  a  vast  majority  of  voice  modems  to  become  the 
dominant  technology  providing  internet  service  to  millions  of  homes.  Conventional  voice 
band  modems  use  up  to  3.4  kHz  of  analog  bandwidth  sampled  at  8  kHz  by  the  public  switched 
telephone  network  (PSTN).  These  dial-up  modems  convert  bits  into  waveforms  that  must 
fit  into  this  tiny  voice  band.  Because  of  the  very  small  bandwidth,  voice  band  modems  are 
forced  to  apply  very  large  QAM  constellation  (e.g.,  960-QAM  in  V.34  for  28.8kbit/s).  Large 
QAM  constellation  require  very  high  transmission  power  and  high  complexity  equalization. 
For  these  reasons,  voice  band  modems  quickly  hit  a  rate  plateau  at  56kbit/s  in  the  ITU-T  V.90 
recommendation.13 

ADSL,  on  the  other  hand,  is  not  limited  by  the  telephone  voice  band.  In  fact,  ADSL  com¬ 
pletely  bypasses  the  voice  telephone  systems  by  specializing  in  data  service.  It  relies  on  the 
traditional  twisted  pair  of  copper  phone  lines  to  provide  the  last-mile  connection  to  individual 
homes.  The  main  idea  is  that  the  copper  wire  channels  in  fact  have  bandwidth  much  larger 
than  the  4  kHz  voice  band.  However,  as  distance  increases,  the  copper  wire  channel  degrades 
rapidly  at  higher  frequency.  Hence,  DSL  can  exploit  the  large  telephone  wire  bandwidth 
(up  to  1  MHz)  only  when  the  connection  distance  is  short  ( 1-5  km).14 

The  voice  band  is  sometimes  known  as  the  plain-old-telephone-service  (POTS)  band. 
POTS  and  DSL  data  service  are  separated  in  frequency.  The  voice  traffic  continues  to  use  the 
voice  band  below  3.4  kHz.  DSL  data  uses  the  frequency  band  above  the  voice  band.  As  shown 
in  Fig.  13.16,  the  separation  of  the  two  signals  is  achieved  by  a  simple  (in-line)  low-pass  filter 
inserted  between  the  phone  outlet  and  each  telephone  unit  when  DSL  service  is  available. 

Figure  13.17  illustrates  the  bandwidth  and  subcarrier  allocation  of  the  ADSL  system.  From 
the  top  of  the  POTS  band  to  the  nominal  ADSL  upper  limit  of  1 1 04  kHz,  we  have  255  equally 
spaced  subchannels  (subcarriers)  of  bandwidth  4.3175  kHz.  These  subcarriers  are  labeled  1  to 
255.  The  lower  number  subcarriers,  between  4.3175  and  25.875  kHz,  may  also  be  optional I  y 
used  by  some  service  providers.  In  typical  cases,  however,  ADSL  service  providers  utilize  the 


Figure  13.16 
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Figure  13.17 
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TABLE  13.2 

Basic  ADSL  Upstream  and  Downstream  Subcarrier  Allocations  and  Data  Rates 


Upstream 


Downstream 


QPSK  to  64-QAM  (2-6  bits  per  symbol) 
4  kHz 
No.  96 


Modulation  (bit  loading) 

DMT  frame  transmission  rate 
Pilot  subcarrier 
Typical  subcarriers 
Typical  bits  per  frame 
Maximum  possible  subcarriers 
Maximum  bits  per  frame 
Maximum  data  rate 


No.  64 
6  to  32 
Up  to  162  bits 
1  to  63 

Up  to  378  bits 
4  kHz  x378  =  1.512  Mbit/s 


33  to  255 
Up  to  1326  bits 
1  to  255  (excluding  64  and  96) 
Up  to  1518  bits 

4  kHz  x  1 5 1 8  bits  =  6.072  Mbit/s 


nominal  band  of  25.875  to  1104  kHz  (subcarrier  6  to  subcarrier  255).  These  250  available 
subcarriers  are  divided  between  downstream  data  transmission  (from  DSL  server  to  homes) 

and  upstream  data  (from  homes  to  DSL  server). 

In  today’s  internet  applications,  most  individual  consumers  have  a  higher  downstream 
need  than  upstream.  Unlike  business  users,  these  “asymmetric”  data  service  requirements 
define  the  objective  of  ADSL.  Therefore  in  ADSL,  the  number  of  downstream  subcamers  is 
greater  than  the  number  of  upstream  subcarriers.  In  ADSL,  subcarriers  6  to  32  (corresponding 
?_  95  875-138  kHz)  are  generally  allocated  for  upstream  data.  Subcarrier  64  and  subcarrier 
96 'are  reserved  for  upstream  pilot  and  downstream  pilot,  respectively.  Excluding  the  two 
p  ot  subcarriers,  subcarriers  33  to  255  (corresponding  to  138-1 104  kHz)  are  aJIocated  for 
downstream  data.  The  typical  carrier  allocation  and  data  rates  are  ^ hr i  Table 13  2. 
Notice  that  this  table  applies  only  to  the  basic  DSL  recommendations  by  ITU-T  (G992.1). 
Depending  on  the  channel  condition,  various  service  providers  may  choose  to  increase  the 
™  u  hiohpr  bandwidth  and  even  more  subcamers  above  subcarrier 

^3ta  IrtADSL^he  DMT  frame  transmission  rate  is  4  kH/.  Upstream  DMT  utilizes  M-poinl  real- 

valued  IFFT  that  is  equivalent  to  32-point  complex  IFFT.  The  upstream  cyclic  prefix  has  length 
valued  lrr  t  ina  H  IFFT  js  applied,  equivalent  to  256-point  complex  IF-FT. 

4.  On  downstream,  5  2  nd I  vato* l  i  PP  |6  |ex  numbers).  Because 

The  u—  Ser  than, he  preset, bed  cyclic  prefix.  TEQ  channel 

ADSL  wi,h  lht  help  of  “,eral  ‘hol,sa'ld  ,ra,"i"s  !,mbols 

(eg  in  downstream)  ,0  adapt  the  TEQ  parameters. 
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Digital  Broadcasting 

Although  North  America  has  decided  to  adopt  the  ATSC  standard  for  digital  television  broad¬ 
casting  at  the  maximum  rate  of  19.39  Mbit/s  using  8-VSB  modulation.  DVB-T  (digital  video 
broadcasting — terrestrial)  has  become  a  pan-European  standard,  also  gaining  acceptance  in 
parts  of  Asia.  Latin  America,  and  Australia.  DVB-T  was  first  introduced  in  1 997, 1  r>  utilizing 
OFDM  over  channels  6,  7.  or  8  MHz  wide. 

DVB-T  specifies  three  different  OFDM  transmission  modes  with  increasing  complexity 
for  different  target  bit  rates  (video  quality).  It  can  use  2048  subcarriers  (2k  mode),  4096 
subcarriers  (4k  mode),  and  8196  subcarriers  (8k  mode).  The  cyclic  prefix  length  may  be  1/32, 
1/16,  1/8,  or  1/4  of  the  FFT  length  in  the  three  different  modes.  Each  subcarrier  can  have 
three  modulation  formats:  QPSK.  16-QAM.  or  64-QAM.  When  subchannel  quality  is  poor, 
a  simpler  constellation  such  as  QPSK  is  used.  When  subchannel  SNR  is  high,  the  64-QAM 
constellation  is  used.  Different  quality  channels  will  bring  about  different  video  quality  from 
standard-definition  TV  (SDTV)  to  high-definition  TV  (HDTV). 

The  D  VB-H  standard  for  mobile  video  reception  by  handheld  mobile  phones  was  published 
in  2004.  The  OFDM  and  QAM  subcarrier  modulation  formats  remain  identical  to  those  for 
DVB-T.  For  lower  video  quality  multimedia  services,  digital  multimedia  broadcasting  (DMB) 
also  applies  OFDM  but  limits  itself  to  (differential)  QPSK  subcarrier  modulation.  Occupying 
less  than  1.7  MHz  bandwidth,  DMB  can  use  as  many  as  1536  subcarriers. 


Broad  OFDM  Applications 

DSL  and  DVB-T  are  only  two  limited  applications  of  OFDM  in  digital  communication 
systems.  Overall,  OFDM  has  found  broad  applications  in  numerous  terrestrial  wireless  com¬ 
munication  systems.  An  impressive  list  includes  digital  audio  broadcasting  (DAB).  Wi-Fi 
(IEEE  802.1  la,  IEEE  802.1  Ig),  WiMAX  (IEEE  802.16),  ultrawideband  (UWB)  radio  (IEEE 
802.15.3a),  3rd  Generation  Partnership  Project  (3GPP)  long-term-evolution  (LTE),  and  high¬ 
speed  OFDM  packet  access  (HSOPA).  Table  13.3  provides  a  snapshot  of  the  important  roles 
played  by  OFDM  in  various  communication  systems. 

It  is  noticeable,  however,  that  OFDM  has  not  been  very  popular  in  satellite  communications 
using  directional  antennas  and  in  coaxial  cable  systems  (e.g..  cable  modems,  cable  DTV).  The 
reason  is  in  fact  quite  obvious.  Directional  satellite  channels  and  coaxial  cable  channels  have 
very  little  frequency-selective  distortion.  In  particular,  they  normally  do  not  suffer  from  serious 
multipath  effects.  Without  having  to  combat  significant  channel  delay  spread  and  ISI,  OFDM 
would  in  fact  be  redundant.  This  is  why  systems  such  as  digital  satellite  dish  TV  services  and 
cable  digital  services  all  prefer  the  basic  single-carrier  modulation  format.  Direct  broadcasting 
and  terrestrial  applications,  on  the  other  hand,  often  encounter  multipath  distortions  and  are 
perfect  candidates  for  OFDM. 

Digital  Audio  Broadcasting 

As  listed  in  Table  13.3,  the  European  project  Eureka  147  successfully  launched  OFDM  dig¬ 
ital  audio  broadcasting  (DAB).  Eureka  147  covers  both  terrestrial  digital  audio  broadcasting 
and  direct  satellite  audio  broadcasting  without  directional  receiving  antennas.  Receivers  a 
equipped  only  with  traditional  omnidirectional  antennas.  Eureka  147  requires  opening  a  new 
spectral  band  of  1 .452  to  1 .492  MHz  in  the  L-band  for  both  terrestrial  and  satellite  broadcasting- 

Despite  the  success  of  Eureka  in  Europe,  however,  concerns  about  spectral  conflict 
the  L-band  led  the  United  States  to  decide  against  using  Eureka  147.  Instead.  DAB  in  Nort 
America  has  split  into  satellite  radio  broadcasting  by  XM  and  Sirius,  relying  on  proprietary 
technologies  on  the  one  hand  and  terrestrial  broadcasting  using  the  IBOC  (in-band,  on-channe 
standard  recommended  by  the  FCC  on  the  other.  XM  and  Sirius  competed  as  two  separa 
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TABLE  13.3 

A  Short  but  Impressive  History  of  OFDM  Applications 

Year  Events 


1995  Digital  audio  broadcasting  standard  Eureka  147:  first  OFDM  standard 

1 996  ADSL  standard  ANSI  T 1 .4 1 3  (later  became  ITU  G992. 1 ) 

1 997  DVB-T  standard  defined  by  ETSI 

1998  Magic  WAND  project  demonstrates  OFDM  modems  for  wireless  LAN 

1999  IEEE  802. 11a  wireless  LAN  standard  (Wi-Fi) 

2002  IEEE  802. 1  lg  standard  for  wireless  LAN 

2004  IEEE  802. 16d  standard  for  wireless  MAN  (WiMAX) 

2004  MediaFLO  announced  by  Qualcomm 
2004  ETSI  DVB-H  standard 

2004  Candidate  for  IEEE  802. 1 5.3a  (UWB)  standard  MB-OFDM 

2004  Candidate  for  IEEE  802. 1 1  n  standard  for  next-generation  wireless  LAN 

2005  IEEE  802. 1 6e  (improved)  standard  for  WiMAX 

2005  Terrestrial  DMB  (T-DMB)  standard  (TS  102  427)  adopted  by  ETSI  (July) 
2005  First  T-DMB  broadcast  began  in  South  Korea  (December) 

2005  Candidate  for  3.75G  mobile  cellular  standards  (LTE  and  HSOPA) 

2005  Candidate  for  CJK  (China.  Japan,  Korea)  4G  standard  collaboration 

2005  Candidate  for  IEEE  PI 675  standard  for  power  line  communications 

2006  Candidate  for  IEEE  802. 16m  mobile  WiMAX 


companies  before  completing  their  merger  in  2008.  The  new  company,  Sirius  XM.  serves 
satellite  car  radios,  while  IBOC  targets  traditional  home  radio  customers.  Sirius  XM  uses  the 
2.3  GHz  S-band  for  direct  satellite  broadcasting.  Under  the  commercial  name  of  HD  Radio 
developed  by  iBiquity  Digital  Corporation,  IBOC  allows  analog  FM  and  AM  stations  to  use 
the  same  band  to  broadcast  their  content  digitally  by  exploiting  the  gap  between  traditional 
AM  and  FM  radio  stations.  By  October  2008,  over  1.5  million  HD  radio  chipsets  have  been 
shipped  and  there  were  more  than  1800  HD  Radio  Stations  in  the  United  States  alone. 

In  satellite  radio  operation,  XM  radio  uses  the  bandwidth  of  2332.5  to  2345.0  MHz. 
This  12  5  MHz  band  is  split  into  six  carriers.  Four  carriers  are  used  for  satellite  transmis¬ 
sion  XM  radio  uses  two  geostationary  satellites  to  transmit  identical  program  content.  The 
signals  are  transmitted  with  QPSK  modulation  from  each  satellite.  For  reliable  reception,  the 
line-of-sight  signals  transmitted  from  satellite  1  are  received,  reformatted  to  mult.camer  mod¬ 
ulation  (OFDM)  and  rebroadcast  by  terrestrial  repeaters.  Each  two-carrier  group  broadcasts 
1 00  streams  of  8  kbit/s.  These  streams  represent  compressed  audio  data.  They  are  combined  by 
means  of  a  patented  process  to  form  a  variable  number  of  channels  using  a  variety  of  bit  rates. 

Sirius  satellite  radio,  on  the  other  hand,  uses  three  orbiting  satellites  over  the  frequency 
i  i  9990  to  2332  MHz  These  satellite  are  in  lower  orbit  and  are  not  geostationary.  In  fact, 
they^ follovv  a  highly  inclined  elliptical  Earth  orbit  (HEO),  also  known  as  the  Tundra  orbit.  Each 
?  <-r»mnlpfes  one  orbit  in  24  hours  and  is  therefore  said  to  be  geosynchronous.  At  any 
gt= !  tae  .1  of  the  three  satellites  will  covet  Noah  America  That,  .he  12  MH/  bandwidth 
given  t  ,  „mrino  three  carriers:  two  for  the  two  satellites  in  coverage  and  one  for 

is  equally  dm  e  8  QPSK  modulation  is  adopted  for  Sirius  transmission. 

terrestrial  repeaters.  The  h.gh^  r  ^  where  coverage  may  ^  blocked. 

Terrestrial  reP^at^r*fp  ^dio  systems,  OFDM  is  also  key  modulation  technology  in  IBOC 

f  u7h  am  IBOC  aid  the  FM  IBOC.  Unlike  satellite  DAB.  which  bundles  multiple  station 
for  both  AM  IBOC  aM  iboc  and  FM  iB0C  allow  each  station  to  use  its 

to  broadens,:  jus,  like  a  tradittonal  radio  station.  FM  IBOC  has  broader 
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bandwidth  per  station  and  provides  a  higher  data  rate.  With  OFDM,  the  FM  IBOC  subchannel 
bandwidth  equals  363.4  Hz,  and  the  maximum  number  of  subcarriers  is  1093.  Each  subcarrier 
uses  QPSK  modulation.  On  the  other  hand,  the  AM  IBOC  subchannel  bandwidth  is  181.7 
Hz  (half  as  wide),  and  as  many  as  104  subcarriers  may  be  used.  Each  subcarrier  can  apply 
16-point  QAM  (secondary  subcarriers)  or  64  point  QAM  (primary  subcarriers).  Further  details 
on  IBOC  can  be  found  in  the  book  by  Max  son. 1  (> 


13.10  BLIND  EQUALIZATION  AND  IDENTIFICATION 

Standard  channel  equalization  and  identification  at  receivers  typically  require  a  known  (train¬ 
ing)  signal  transmitted  by  the  transmitter  to  assist  in  system  identification.  Alternatively, 
the  training  sequence  can  be  used  directly  to  determine  the  necessary  channel  equalizer. 
Figure  13.18  illustrates  how  a  training  signal  can  be  used  in  the  initial  setup  phase  of  the  receiver. 

During  the  training  phase,  a  known  sequence  is  transmitted  by  the  transmitter  such  that 
the  equalizer  output  can  be  compared  with  the  desired  input  to  form  an  error.  The  equalizer 
parameters  can  be  adjusted  to  minimize  the  mean  square  symbol  error.  At  the  end  of  the  training 
phase,  the  equalizer  parameters  should  be  near  enough  to  their  optimum  values  that  much  ot 
the  intersymbol  interference  (ISI)  is  removed.  Now  that  the  channel  input  can  be  correctly 
recovered  from  the  equalizer  output  through  a  memoryless  decision  device  (sheer),  real  data 
transmission  can  begin.  The  decision  output  s[k  —  u]  can  be  used  as  the  correct  channel  input  to 
form  the  symbol  error  for  continued  equalizer  adjustment  or  to  track  slow  channel  variations. 
The  adaptive  equalizer  then  obtains  its  reference  signal  from  the  decision  output  when  the 
equalization  system  is  switched  to  the  decision-directed  mode  (Fig.  1 3. 1 8).  It  is  evident  that  this 
training  mechanism  can  be  applied  regardless  of  the  equalizer  in  use,  be  it  TSE,  FSE,  or  DFE. 

In  many  communications,  signals  are  transmitted  over  time-varying  channels.  As  a  result, 
a  periodic  training  signal  is  necessary  to  identify  or  equalize  the  time-vary  ing  channel  response. 
The  drawback  of  this  approach  is  evident  in  many  communication  systems  where  the  use  ot 
training  sequence  can  represent  significant  overhead  costs  or  may  even  be  impractical.  For 
instance,  no  training  signal  is  available  to  receivers  attempting  to  intercept  enemy  communi¬ 
cations.  In  a  multicast  or  a  broadcast  system,  it  is  highly  undesirable  for  the  transmitter  to  start 
a  training  session  for  each  new  receiver  by  temporarily  suspending  its  normal  transmission 
to  all  existing  users.  As  a  result,  there  is  a  strong  and  practical  need  for  a  special  kind  o 
channel  equalizer,  known  as  blind  equalizers,  that  do  not  require  the  transmission  of  a  training 
sequence.  Digital  cable  TV  and  cable  modems  are  excellent  examples  of  such  systems  that  can 
benefit  from  blind  equalization. 

There  are  a  number  of  different  approaches  to  the  problem  of  blind  equalization.  In  general, 
blind  equalization  methods  can  be  classified  into  direct  and  indirect  approaches.  In  the  direct 


Figure  13.18 
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blind  equalization  approach,  equalizer  filters  are  derived  directly  from  input  statistics  and  the 
observed  output  signal  of  the  unknown  channel.  The  indirect  blind  equalization  approach  first 
identifies  the  underlying  channel  impulse  response  before  designing  an  appropriate  equalizer 
filter  or  MLSE  metrics.  Understanding  these  subjects  require  in-depth  reading  of  the  literature, 
including  papers  from  the  1980s  by  Benveniste  et  al.,17,  ,s  who  pioneered  the  terminology 
“blind  equalization.”  Another  very  helpful  source  of  information  can  be  found  in  the  papers  by 
Godard,19  Picchi  and  Prati,20  Shalvi  and  Weinstein,21' 22  Rupprecht,23  Kennedy  and  Ding,24 
Tong  et  al.,25  Moulines  et  al.,26  and  Brillinger  and  Rosenblatt. :s  For  more  systematic  coverage. 
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readers  are  referred  to  several  published  books  on  this  topic.' 


13.1 1  TIME-VARYING  CHANNEL  DISTORTIONS 
DUE  TO  MOBILITY 


Thus  far,  we  have  focused  on  channel  distortions  that  are  invariant  in  time,  or  invariant  at  least 
for  the  period  of  concern.  In  mobile  wireless  communications,  user  mobility  naturally  leads  to 
channel  variation.  Two  main  causes  lead  to  time-varying  channels:  ( I )  a  change  of  surroundings 
and  (2)  the  Doppler  effect.  In  most  cases,  a  change  of  surroundings  for  a  given  user  takes  place 
at  a  much  slower  rate  than  the  Doppler  effect.  For  example,  a  transmitter/receiver  traveling  at 
the  speed  of  100  km/h,  moves  less  than  2.8  meters  in  l(X)  ms.  However,  for  carrier  frequency  of 
900  MHz.  the  maximum  corresponding  Doppler  frequency  shift  would  he  83  Hz.  This  means 
that  within  100  ms,  the  channel  could  have  undergone  8  full  cycles  of  change.  Thus,  unless  the 
mobile  unit  suddenly  turns  a  comer  or  enters  a  tunnel,  the  Doppler  effect  is  usually  far  more 
severe  than  the  effect  of  change  in  surroundings. 


in  moDiie  conimunitauwn^  ^  *** — a  ...... 

known  as  the  Doppler  effect,  described  by  the  nineteenth-century  Austrian  physicist  Christian 
_ _ w  U,  nh^rved  that  the  frequency  of  light  and  sound  waves  is  affected  by  the  relative 


m(t)  cos  a>ct 


the  relative  velocity  of  the  distance  change  between  the  source  and  the  receiver  equals  vd 


V, 

COi  =  —CDc 

c 


(13.75) 


Moreover,  because 


-vd  <  v,  <  vj 
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the  maximum  Doppler  shift  is  bounded  by 


i  i  ^ 

\(Oi\  <  comax  =  - (oc 

c 


Based  on  the  Doppler  analysis,  each  path  has  a  Doppler  frequency  shift  &>/,  delay  r„  and 
path  attenuation  a,.  The  signal  from  the  ith  path  can  be  written  as 


Off 


Y,  Re  {sk)p(t  -  kT  -  r,) 


cos  t(ft>c  +  a >,-)(/  -  r,)l 


+  ai 


^Im  {sk}p(t-kT  -  ti ) 


sin  [(a>c  +  a>,)(/  -  r,)] 


(13.76) 


As  a  result,  the  baseband  receiver  signal  after  demodulation  is  now 


y(t)  =  YSk 


k 


T.  <*i  exp  [~j(wc  +  a>,)r,]  exp  (-jwp)  p(t  -  kT  -  r,) 


i=0 

K 


AW 

Y^MOpit-kT  -Ti) 


.1=0 


(13.77) 


Frequency-Selective  Fading  Channel 

Recall  that  the  original  baseband  transmission  is 

x(t)  = 

k 

In  the  channel  output  of  Eq.  (13.77),  if  the  mobile  velocity  is  zero,  then  a>j  =  0  and  =  ft 
are  constant.  In  the  case  of  zero  mobility,  the  baseband  channel  output  simply  becomes 


yd)  =  £ 


YfiiP(t-kT -Ti) 


k  L/=0 

This  means  that  corresponding  channel  is  linear  time-invariant  with  impulse  response 

K 


h(t)  =  Y  Pi  -  Ti ) 


(13.78) 


i=0 


and  transfer  function 


K 


H{f)  =  Y  P‘  exp (-j2nfTi) 


(13.79) 


i=0 


This  is  a  frequency-selective  channel  with  intersymbol  interference  (ISI). 


13.1 1  Time- Varying  Channel  Distortions  Due  to  Mobility  815 


When  the  mobile  speed  vj  is  not  zero,  then  /},•(/)  are  time-varying.  As  a  result,  the  channel 
is  no  longer  linear  time-invariant.  Instead,  the  channel  is  linear  time-varying.  Suppose  the 
channel  input  is  a  pure  sinusoid,  x(t)  =  exp {jcopt).  The  output  of  this  time-varying  channel 
according  to  Eq.  (13.77)  is 

K  K 

y,  Pi(t)  exp  [jwp(t  -  Tj)]  =  exp <j(Opt)  ■  y  fij(t)  exp( -jiopXi)  ( 1 3.80) 

1=0  1=0 

This  relationship  shows  that  the  channel  response  to  a  sinusoidal  input  equals  a  sinusoid  of 
the  same  frequency  but  with  time-varying  amplitude.  Moreover,  the  time-varying  amplitude 
of  the  channel  output  also  depends  on  the  input  frequency  (a>p).  For  these  multipath  channels, 
the  channel  response  is  time-varying  and  is  frequency  dependent.  In  wireless  communica¬ 
tions,  time-varying  channels  are  known  ns  fading  channels.  When  the  time-varying  behaviors 
are  dependent  on  frequency,  the  channels  are  known  as  frequency-selective  fading  channels. 
Frequency-selective  fading  channels,  which  are  characterized  by  time-varying  ISI,  are  major 
obstacles  to  wireless  digital  communications. 

Flat  Fading  Channels 

One  special  case  to  consider  is  when  the  multipath  delays  {r,}  do  not  have  a  large  spread.  In 
other  words,  let  us  assume 


0  =  ro  <  t|  <  •  •  •  < 

If  the  multipath  delay  spread  is  small,  then  xK  «  T  and 

n  ~  o  i  =  i,2,...,^ 

In  this  special  case,  because  p(t  -  r,)  »  p(t),  the  received  signal  y(t)  is  simply 


>•(/)  =  £> 
k 


y  on  exp  l-j(a)c  +  (Oi)Xi]  exp  (~jio,t)  pit  -  kT  -  r() 
1=0 

K  \ 

oil  exp  [—j((oc  +  (L>i)Tj\  exp  (—jcop)  p(t  —  k7  )  j 


i=0 


'^T.Skpit  -  kT) 


j  cti  exp  [— jijMc  +  <*)i)Ti]  exp  (  jco.t ) 

=  p(t)  -J2akp(*  ~  kT) 

k 

where  we  have  defined  the  time-varying  channel  gain  as 

K 

p(t)  =  y<*i exp  [ -j(toc  +  coi)Xi]  exp  (->//) 


i=0 


(13.81) 


(13.82) 


,  ,  mllifinnth  delav  spread  is  small,  the  only  distortion  in  the  received 

signab'ot'a  Lading  gaia  f><»-  of  .he  received  signal  s.reng.h  is 
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known  as  fading.  Channels  that  exhibit  only  a  time-varying  gain  that  is  dependent  on  the 
environment  are  known  as  flat  fading  channels.  Flat  fading  channels  do  not  introduce  any  IS1 
and  therefore  do  not  require  equalization.  Instead,  since  flat  fading  channels  generate  output 
signals  that  have  time-varying  strength,  periods  of  error-free  detections  tend  to  be  followed 
by  periods  of  error  bursts.  To  overcome  burst  errors  due  to  flat  fading  channels,  interleaving 
forward  error  correction  codewords  is  an  effective  tool. 

Converting  Frequency-Selective  Fading  Channels 
into  Flat  Fading  Channels 

Fast  fading  frequency-selective  channels  pose  serious  challenges  to  mobile  wireless  communi¬ 
cations.  On  one  hand,  the  channels  introduce  ISI.  On  the  other  hand,  the  channel  characteristics 
are  also  time  varying.  Although  the  time  domain  equalization  techniques  described  in  Secs.  13.3 
to  13.6  can  effectively  mitigate  the  effect  of  ISI,  they  require  training  data  to  either  identify 
the  channel  parameters  or  estimate  equalizer  parameters.  Generally,  parameter  estimation  of 
channels  or  equalizers  cannot  work  well  unless  the  parameters  stay  nearly  unchanged  between 
successive  training  periods.  As  a  result,  such  time  domain  channel  equalizers  are  not  well 
equipped  to  confront  fast  changing  channels. 

Fortunately,  we  do  have  an  alternative.  We  have  shown  (in  Sec.  13.7)  that  OFDM  can  con¬ 
vert  a  frequency-selective  channel  into  a  parallel  group  of  flat  channels.  When  the  underlying 
channel  is  fast  fading  and  frequency  selective,  OFDM  can  effectively  converts  it  into  a  bank 
of  fast  flat-fading  channels.  As  a  result,  means  to  combat  fast  flat-fading  channels  such  as  code 
interleaving  can  now  be  successfully  applied  to  fast  frequency-selective  fading  channels. 

We  should  note  that  for  fast  fading  channels,  another  very  effective  means  to  combat  the 
fading  effect  is  to  introduce  channel  diversity.  Channel  diversity  allows  the  same  transmit¬ 
ted  data  to  be  sent  over  a  plurality  of  channels.  Channel  diversity  can  be  achieved  in  the  time 
domain  by  repetition,  in  the  frequency  domain  by  using  multiple  bands,  or  in  space  by  applying 
multiple  transmitting  and  receiving  antennas.  Because  both  time  diversity  and  frequency  diver¬ 
sity  occupy  more  bandwidth,  spatial  diversity  in  the  form  of  multiple-input-multiple-output 
(MIMO)  systems  has  been  particularly  attractive  recently.  Among  recent  wireless  standards, 
Wi-Fi  (IEEE  802. 1 1  n),  WiM AX  (IEEE  802. 1 6e),  and  cellular  LTE  ( long-term  evolution)  have 
all  adopted  OFDM  and  MIMO  technologies  to  achieve  much  higher  data  rate  and  better 
coverage.  We  shall  present  some  fundamental  discussions  on  MIMO  in  Chapter  14. 


13.12  MATLAB  EXERCISES 


We  provide  three  different  computer  exercises  in  this  section;  all  model  a  QAM  communica 
tion  system  that  modulates  data  using  16-QAM  constellation.  The  16-QAM  signals  then  pass 
through  linear  channels  with  ISI  and  encounter  additive  white  Gaussian  noise  (AWGN)  at  the 
channel  output. 


COMPUTER  EXERCIESE  13.1:  16-QAM  LINEAR  EQUALIZATION 

The  first  MATLAB  program,  Exl  3_1 .  m.  generates  1 ,000.000  points  of  1 6-QAM  data  for  transmission. 
Each  QAM  requires  T  as  the  symbol  period.  The  transmitted  pulse  shape  is  a  root-raised  cosine  wit 
roll-off  factor  of  0.5  [Eq.  (13.23)].  Thus  the  bandwidth  at  the  baseband  is  0.15/T  Hz. 

%  Matlab  Program  <Exl3_l.m> 

-s  This  Matlab  exercise  <Exl3__l.m>  performs  simulation  of 
*  linear  equalization  under  QAM-16  baseband  transmission 
%  a  multipath  channel  with  AWGN. 
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%  Correct  carrier  and  synchronization  is  assumed. 

%  Root-raised  cosine  pulse  of  rolloff  factor  =  0.5  is  used 
%  Matched  filter  is  applied  at  the  receiver  front  end. 

%  The  program  estimates  the  symbol  error  rate  (SER)  at  different  Eb/N 


clear; elf ; 

L=1000000 ;  %  Total  data  symbols  in  experiment  is  1  million 

%  To  display  the  pulse  shape,  we  oversample  the  signal 
%  by  factor  of  f_ovsamp=8 

f_ovsamp=8 ;  %  Oversampling  factor  vs  data  rate 

delay_rc=4 ; 

%  Generating  root-raised  cosine  pulseshape  (rolloff  factor  =  0.5) 
prcos=rcosf It  ( [  1  ] ,  1,  f_ovsamp,  'sqrt',  0.5,  delay_rc) ;  %  RRC  pulse 
prcos=prcos  (1 :end-f_ovsamp+l) ;  %  remove  0's 

prcos=prcos/norm (prcos) ;  ^  normalize 

pcmatch=prcos ( end : - 1 : 1 ) ; 


%  Generating  random  signal  data  for  polar  signaling 
s  data=4*round (rand (L, 1) ) +2*round (rand (L, 1) ) -3+. . . 

+j* (4*round(rand(L,l) ) +2*round(rand(L, 1) ) -3) ; 

%  upsample  to  match  the  'oversampling  rate'  (normalize  by  1/T)  . 
%  It  is  f_ovsamp/T  (T=l  is  the  symbol  duration) 

s  up=upsample (s_data, f_ovsamp) ; 


%  Identify  the  decision  delays  due  to  pulse  shaping 

%  and  matched  filters 

de 1 ay r c = 2 * de 1 ay_r c * f _ovsamp ; 

%  Generate  polar  signaling  of  different  pulse-shaping 
xrcos=conv (s_up, prcos ) ; 

[c  num, c  den]  =  cheby2 (12 , 20,  ( 1+0 . 5 ) /8  ; 

%  The  next  commented  line  finds  frequency  response 
% [H, fnlz] =f reqz (c_num, c_den,  512 , 8)  ; 

,  fiupr  is  the  Tx  filter  before  signal  is  sent  to  channel 

%  The  lowpass  filter  is  tne 

xchout=f ilter (c_num, c_den, xrcos) ; 

ln).  ,-he  DOwer  spectral  densities  of  the  two  signals 
%  We  can  now  plot  tne  puwei. 

a.  xrcos  and  xchout 

%  This  shows  the  fleering  ^^sigS^powIr  spectral  densities 
I  “STS  little  lo«p.s»  TX  filter  >»ve  distorted  the  .ign.1 

plotPSD_comparison 


%  Apply  a  2-ray  mult  p  muitipath  delta (t) -0 . 65 

mpath= [1  0  0  -0.65 J; 

%  time-domain  multipath  channel 

h=conv(conv (prcos, pcmatch) ,mpath) , 

_ _  I'-'  ■ 


delta (t-3T/8) 


xchout=conv (mpath, xchout) ; 

xrxout=conv (xchout, pcmatch) ; 


%  apply  2-ray  multipath 
%  send  the  signal  through  matched  filter 
%  separately  from  the  noise 


Su;=h*.l  ,f_ovs.e,p idel,ychb.I,f_ovs,.p, , 

clear  xrxout; 
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%  Generate  complex  random  noise  for  channel  output 
noiseq=randn (L*f_ovsamp, 1) + j  *randn (L*f_ovsamp, 1) ; 

%  send  AWGN  noise  into  matched  filter  first 
noisef lt=f ilter (pcmatch, [1] ,noiseq) ;  clear  noiseq; 

%  Generate  sampled  noise  after  matched  filter  before  scaling  it 
%  and  adding  to  the  QAM  signal 
noisesamp=noisef It (1 : f_ovsamp : L*f_ovsamp, 1) ; 

clear  noiseq  noiseflt; 

Es=10*hscale;  %  symbol  energy 

%  Call  linear  equalizer  receiver  to  work 
linear_eq 

for  i i = 1 : 1 0 ; 

Eb2Naz (ii) =2*ii-2; 

Q  (ii) =3*0 . 5*erf c (sqrt ( (2*10" (Eb2Naz (ii) *0 . 1) /5) /2) ) ; 
%Compute  the  Analytical  BER 
end 

%  Now  plot  results 
plotQAM_results 


The  transmission  is  over  a  two-ray  multipath  channel  with  impulse  response 

h(t)  =g(t)~  0.65 g(t  -  37/8) 

where  g(l)  is  the  response  of  a  low-pass  channel  formed  by  applying  a  type  II  Chebyshev  (ilter  of 
order  12,  a  stopband  gap  of  20  dB.  and  bandwidth  of  0.75/7  Hz.  The  impulse  response  of  this  channel 
is  shown  in  Fig.  13.19. 

The  main  program  Exl3_l .  m  will  call  a  subroutine  program  plot  PSD  comparison .  m  to  first 
generate  the  power  spectral  densities  of  the  transmitted  signal  before  and  after  the  low-pass  Chebyshev 
filter.  The  comparison  in  Fig.  13.20  shows  that  the  root-raised-cosine  design  is  almost  ideally  band- 
limited,  as  the  low-pass  channel  introduces  very  little  change  in  the  passband  of  the  transmitted  signal 
spectrum.  This  means  that  the  multipath  environment  is  solely  responsible  for  the  ISI  effect. 

%  MATLAB  PROGRAM  <plotPSD__comparison .  m> 

•s  This  program  computes  the  PSD  of  the  QAM  signal  before  and  after  it 
%  enters  a  good  chebyshev  lowpass  filter  prior  to  entering  the  channel 
% 

[Pdfy,fq]=pwelch(xchout, [],[] ,1024, 8, 'twosided' ) ;  %  PSD  before 

Tx  filter 

[Pdfx,fp]=pwelch(xrcos, [],[] ,1024,8, 'twosided' ) ;  %  PSD  after 

Tx  filter 
figure (1) ; 

subplot (211) ; semilogy (fp-f_ovsamp/2 , f f tshif t (Pdfx) , 'b-' ) ; 
axis ( [-4  4  l.e-10  1.2e0]); 

xlabel  ('  Frequency  (in  unit  of  1/T_s)  ')  ,-ylabel  ('  Power  Spectrum'); 
title (  (a)  Lowpass  filter  input  spectrum') 
subplot (212) ; semilogy ( fq-f_ovsamp/2 , ff tshif t (Pdfy) , 'b- ' ) ; 
axis ( [-4  4  l.e-10  1.2e0]); 

xlabel (' Frequency  (in  unit  of  1/T_s) ') ;ylabel (' Power  Spectrum'); 
title ('(b)  Lowpass  filter  output  spectrum') 
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Figure  13.19 

Two-ray 
multipath 
channel  response 
for  QAM 
transmission. 


Figure  13.20 

Power  spectral 
densities  of  the 
root-raised- 
cosine  QAM 
signal  before 
and  after  a 
low-pass  channel 
of  bandwidth 
0.75/7':  (a)  input 
and  (b)  output  of 
low-pass  filter 
spectra. 
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After  a  matched  filter  has  been  applied  at  the  receiver  (root-raised  cosine),  the  QAM  signal  will 
be  sampled,  equalized,  and  decoded.  The  subroutine  program  1  inear_eq .  m  designs  a  T-spaced  finite 
length  MM  SE  equalizer  of  order  M  =8  as  described  in  Sec.  13.3  [Eq.  ( 13.43b)].  The  equalizer  is  designed 
by  applying  the  first  200  QAM  symbols  as  training  data.  The  equalizer  filters  the  matched  filter  output 
before  making  a  16-QAM  decision  according  to  the  decision  region  of  Fig.  1 1.24b  in  Chapter  11. 


%  MATLAB  PROGRAM  <linear_eq . m> 

%  This  is  the  receiver  part  of  the  QAM  equalization  example 


% 

Ntrain=200 ; 
Neq=8 ; 
u=0 ; 


%  Number  of  training  symbols  for  Equalization 
%  Order  of  linear  equalizer  (=length-l) 

%  equalization  delay  u  must  be  <=  Neq 


SERneq=  []  ; 

SEReq=  []  ; 
for  i=l : 13 , 

Eb2N (i) =i*2-l ; 

Eb2N_num=l(T (Eb2N(i) /10) ; 
Var_n=Es/ (2*Eb2N_num) ; 
signois=sqrt (Var_n/2) ; 
zl=out_mf +signois*noisesamp; 


% ( Eb/N  in  dB) 

%  Eb/N  in  numeral 
%  1  / SNR  is  the  noise  variance 
%  standard  deviation 
%  Add  noise 


Z=toeplitz (zl (Neq+1 :Ntrain) , zl (Neq+1 : -1 : 1) ) ;  %  signal  matrx  for 

%  computing  R 

dvec= [s_data (Neq+l-u :Ntrain-u) ] ;  %  build  training  data  vector 

f=pinv(Z' *Z) *Z' *dvec;  %  equalizer  tap  vector 

dsig=f ilter (f , 1, zl) ;  %  apply  FIR  equalizer 

%  Decision  based  on  the  Re/Im  parts  of  the  samples 
deq=sign (real (dsig(l:L)  )  )  +sign(real (dsig(l:L) ) -2)  +  . . . 
sign (real (dsig(l:L) )+2) +. . . 

j* (sign(imag(dsig(l:L) ) ) +sign (imag (dsig  ( 1 : L) ) -2) +. . . 
sign ( imag (dsig ( 1 : L) ) +2) )  ; 

■s  Now  compare  against  the  original  data  to  compute  SER 
%  (1)  for  the  case  without  equalizer 
dneq=sign (real (zl (1:L) ) ) +sign (real (zl (1 :L) ) -2) +. . . 
sign (real (zl (1 : L) ) +2) + . . . 

j* (sign (imag (dsig (l:L) ) ) +sign (imag (zl  (1  :L)  ) -2) + .  . . 
sign (imag (zl ( 1 : L) ) +2) ) ; 

SERneq= [SERneq; sum (abs (s_data~=dneq) ) /  (L) ]  ; 

“5  (2)  for  the  case  with  equalizer 
SEReq= [SEReq;sum(s_data~=deq) /L] ; 

end 


Once  the  linear  equalization  results  are  available,  the  main  program  Exl  3_1 .  m  calls  another 
subroutine  program.  plotQAM_results  ,m.  to  provide  illustrative  figures.  In  Fig.  13.21,  the  noise- 
tree  eye  diagram  of  the  in-phase  component  at  the  output  of  the  receiver  matched  filter  before  sampling 
shows  a  strong  ISI  effect.  The  QAM  signal  eye  is  closed  and,  without  equalization,  a  simple  QAM 
decision  leads  to  very  high  probabilities  of  symbol  error  (also  known  as  symbol  error  rate). 

%  MATLAB  PROGRAM  <plotQAM_results . m> 

%  This  program  plots  symbol  error  rate  comparison  before  and  after 
%  equalization 

constellation  points 
eye-diagrams  before  equalization 

figure (2) 
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Figure  13.21 

Noise-free  eye 
diagram  of  the 
in-phase  (real) 
component  at  the 
receiver  (after 
matched  filter) 
before  sampling: 
the  eyes  are 
closed,  and  ISI 
will  lead  to 
decision  errors. 


Time,  Ts 


f igber=semilogy (Eb2Naz, Q, ' k- ' , Eb2N,  SERneq,  '  b-o'  ,  Eb2N, SEReq, 'b-v' ) ; 

axis (  [0  26  . 99e- 5  1]  )  ;  ,  . 

legend ['Analytical' ,  'Without  equalizer',  'With  equalizer' ) , 
xlabel (■ E_b/N  IdB) ' I ,ylabel (' Symbol  error  probability  ; 
set (f igber, ' Linewidth' , 2) ; 

%  Constellation  plot  before  and  after  equalization 
figure (3 ) 

p^Ureli(zi(l.min<L,4000,)),imag(tl(l!mina.4000l)l.'.'l, 

axis ( ' square' ) 

xlabel ('Real  part') 

title ('(a)  Before  equalization') 

ylabel ( ' Imaginary  part ' ) ; 

plot  Ueal^dsigd  main  (In  4000)  * '  . imag(dsig(l:min(L,4000) 

axis (' square' )  , 

title ('(b)  After  equalization  ) 
xlabel ( ' Real  part  ) 
ylabel ( ' Imaginary  part' ) ; 
figure (4) 
t=length (h) ; 
plot ( [1 : t] /f_ovsamp,h) ; 
xlabel ('time  (in  unit  of 

r^e^grrSe  ^itip.tk 

eyevec-eyevec  Welaychb*  1 :  '[delaychb.800.  .f.ovsemp, , 

title^Eye^diegram^in-phaae  component) 
xlabel ('Time  (in  unit  of  T_s)  ), 


T_s) ' ) 

impulse  response'); 
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Figure  13.22 

Scatter  plots  of 
signal  samples 
before  (a)  and 
after  (b)  linear 
equalization  at 
Eh/Af  =  26  dB 
demonstrate 
effective  ISI 
mitigation  by  the 
linear  equalizer. 
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We  can  suppress  a  significant  amount  of  ISI  by  applying  the  linear  equalizer  to  the  sampled  matched 
filter  output.  Figure  13.22  compares  the  “scatter  plot”  of  signal  samples  before  and  after  equalization 
at  Eb/ N  =  26  dB.  The  contrast  illustrates  that  the  equalizer  has  effectively  mitigated  much  of  the  IS 
introduced  by  the  multipath  channel. 

The  program  linear_eq .  m  also  statistically  computes  the  symbol  error  rate  (SER)  at  differen 
SNR  levels.  It  further  computes  the  ideal  SER  according  to  ISI-free  AWGN  channel  (Chapter  ID  30  ' 
for  comparison,  the  SER  without  equalization.  The  results  shown  in  Fig.  13.23  clearly  demonstrate  t  e 
effectiveness  of  linear  equalization  in  this  example. 
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COMPUTER  EXERCISE  13.2:  DECISION  FEEDBACK  EQUALIZATION 

In  this  exercise,  we  use  the  main  MATLAB  program,  Exl3_2  ,m,  to  generate  the  same  kind  ot  data  as 
in  the  last  exercise.  The  main  difference  is  that  we  adopt  a  slightly  different  two-ray  multipath  channel 

h(l)  =  gU)  —  0.83g(f  —  37" /8) 

in  which  the  ISI  is  much  more  severe.  At  the  receiver,  instead  of  using  linear  equalizers,  we  will  implement 
and  test  the  decision  feedback  equalizer  (DFE)  as  described  in  Sec.  13.6.  For  simplicity,  we  will  implement 
only  a  DFE  feedback  filter,  without  using  the  FFW  filter. 


%  Matlab  Program  <Exl3_2.m> 

%  This  Matlab  exercise  <Exl3_2.m>  performs  simulation  of 
%  decision  feedback  equalization  under  QAM- 16  baseband  transmission 
%  a  multipath  channel  with  AWGN. 

%  Correct  carrier  and  synchronization  is  assumed. 

%  Root-raised  cosine  pulse  of  rolloff  factor  =  0.5  is  used 
%  Matched  filter  is  applied  at  the  receiver  front  end. 

%  The  program  estimates  the  symbol  error  rate  (SER)  at  different  Eb/N 

clear ;clf;  , 

L=100000 ;  %  Total  data  symbols  in  experiment  is  1  million 

%  To  display  the  pulse  shape,  we  oversample  the  signal 

%  by  factor  of  f_ovsamp=8 

f  ovsamp=8;  %  Oversampling  factor  vs  data  rate 

%  Generating  root-raised  cosine  pulseshape  (rolloff  factor  =  0.5) 
LSSSStt.  1  1.  X.  f .ovsamp ,  'sgrf,  0.5,  delay.ro, ,  t  Repulse 
prcos=prcos  {1  :end-£_ovsamp+l) , 

prcos=prcos/norm(prcos) ,  ^ 

pcmatch=prcos ( end : - 1 : 1 ) ; 

%  Generating  random  signal  data  for  polar  signaling 
s  data=4 * round (rand (L, 1) ) +2* round (rand (L, 1>  -3+  -  •  • 

-  +  j  * ( 4  *  round ( rand ( L , 1 ) ) +2  *  round  rand ( L ,  1 ) )  -  3 )  ; 

%  upsample  to  match  the  'oversampling  rate'  (normalize  by  /  ) . 

%  it  is  f_ovsamp/T  (T=l  is  the  symbol  duration) 
s  up=upsample (s_data,  f_ovsamp) ; 

%  identify  the  decision  delays  due  to  pulse  shaping 
%  and  matched  filters 
rtplavrc=2*delay  rc*f_ovsamp; 

.  Generate  pol.J  signaling  of  different  pulse-sh.p.ng 

xrcos=conv ( s_up , prcos ) ,  .  . 

tr  num  c  den]  =  cheby2 (12 , 20 , (1+0 . 5) /8 ) ; 

Tme  commented  line  finds  frequency  response 

% [H, fnlz] =f reqz (c_num, c_den,  512,8), 

fiU.r  is  the  Tx  filter  before  signal  is  sent  to  channel 
%  The  lowpass  filter  is 

xchout=f liter (c_num, c_den, xrcos) ; 

%  we  can  now  plot  the  power  spectral  densities  of  the  two  signals 

%  h  XtheSfilte^ngUeffect  of  the  Tx  filter  before 

%  This  shows  the  the  signal  power  spectral  densities 

%  transmission  m  ten 
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%  It  shows  how  little  lowpass  Tx  filter  may  have  distorted  the  signal 
plot PSD_compari son 

%  Apply  a  2 -ray  multipath  channel 

mpath=[l  0  0  -0.83];  %  multipath  delta (t) -0 . 83  delta (t-3T/8) 

%  or  use  mpath=[l  0  0  .45]; 

%  time-domain  multipath  channel 
h=conv (conv (prcos , pcmatch) #mpath) ; 
hscale=norm (h) ; 

xchout=conv (mpath, xchout) ;  %  apply  2-ray  multipath 

xrxout=conv (xchout , pcmatch) ;  %  send  the  signal  through  matched  filter 

%  separately  from  the  noise 

delaychb=delayrc+3 ; 

out_mf =xrxout (delaychb+1 : f_ovsamp : delaychb+L*f  ovsamp) ; 
clear  xrxout ; 


%  Generate  complex  random  noise  for  channel  output 
noi seq=randn ( L*  f _ovsamp , 1 ) + j  * randn (L*f_ovsamp , 1) ; 

%  send  AWGN  noise  into  matched  filter  first 
noisef lt=f ilter (pcmatch, [1] /noiseq) ;  clear  noiseq; 

-s  Generate  sampled  noise  after  matched  filter  before  scaling  it 
%  and  adding  to  the  QAM  signal 
noisesamp=noisef It ( 1 : f_ovsamp : L*f  ovsamp, 1) ; 

clear  noiseq  noisef It; 

Es=10*hscale ;  %  symbol  energy 

*  Call  decision  feedback  equalizer  receiver  to  work 
df  e 

SERdf e=SEReq; 
for  i i = 1 : 9 ; 

Eb2Naz (ii) =2*ii ; 

Q (ii) =3*0 . 5*erf c (sqrt ( (2*10~ (Eb2Naz(ii) *0.1)/5)/2) ) ; 
%Compute  the  Analytical  BER 
end 

%  use  the  program  plotQAM_results  to  show  results 

plotQAM_results 

linear_eq 


At  the  receiver,  once  the  signal  has  passed  through  the  root- raised-cosine  matched  filter,  the  T- 
spaced  samples  will  be  sent  into  the  DFE.  The  subroutine  program  df  e .  m  implements  the  DFE  design 
and  the  actual  equalization.  The  DFE  design  requires  the  receiver  to  first  estimate  the  discrete  channel 
response.  We  use  the  first  200  QAM  symbols  as  training  data  for  channel  estimation.  We  then  compute 
the  SER  of  the  DFE  output  in  df  e .  m.  The  necessary  program  df  e .  m  is  given  here 


%  MATLtAB  PROGRAM  <dfe.m> 

%  This  is  the  receiver  part  of  the  QAM  equalization 
•s  that  uses  Decision  feedback  equalizer  (DFE) 


Ntrain=200; 

Nch=3 ; 

SEReq=  []  ;  SERneq=  []  ; 


%  Number  of  training  symbols  for  Equalization 
*  Order  of  FIR  channel  (=length-l) 
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for  i=l:13, 

Eb2N (i) =i*2-l ; 

Eb2N_num=lCT  (Eb2N(i)/10)  ; 
Var_n=Es/ (2*Eb2N_num) ; 

signois=sqrt (Var_n/2) ; 
zl=out_mf +signois*noisesamp; 


% (Eb/N  in  dB) 

%  Eb/N  in  numeral 
% 1 / SNR  is  the  noise 
variance 

%  standard  deviation 
%  Add  noise 


Z=toeplitz (s_data (Nch+1 :Ntrain) , s_data (Nch+1 : -1 : 1) )  ; 

%  signal  matrx  for 

%  computing  R 

dvec= [zl (Nch+1 :Ntrain)  ]  ; 

%  build  training  data  vector 
h_hat=pinv(Z' *Z) *Z' *dvec; 

%  find  channel  estimate  tap  vector 
zl=zl/h_hat (1) ; 

%  equalize  the  gain  loss 
h_hat=h_hat (2 :end) /h_hat (1) ; 

%  set  the  leading  tap  to  1 


%  feedback  data 
%  subtract  the  feedback 


f eedbk=zeros (1 , Nch) ; 
for  kj  =1 :L, 

zf k=f eedbk*h_hat ; 

dsig (kj ) =zl (kj ) -zfk; 

%  Now  make  decision  after  feedback 
d_temp=sign (real (dsig (kj ) ) ) +sign (real (dsig (kj ) ) -2)  +  .  .  . 

sign (real (dsig (kj) )+2)+. . . 

j  * ( sign ( imag ( dsig ( kj )))+  s ign ( imag <  ds  ig  <  k  j  >  >  -  2 )  + .  . . 

sign (imag (dsig (kj ) ) +2) ) ; 
feedbk= [d_temp  feedbk (1 :Nch-l) ) ; 

%  update  the  feedback  data 

%ndNow  compute  the  entire  DFE  decision  after  decision  feedback 

dfeq=sign(real (dsig) ) +sign (real (dsig) -2) +. . . 

sign (real (dsig) +2) + . . • 

j* (sign (imag (dsig) )+sign (imag (dsig) -2) +. . . 

sign (imag (dsig) +2) ) ; 

%f Compute" the^SER*  after  decision  feedback  equalization 
SEReq= [SEReq; sum (s_data ( 1 : L)  -dfeq) /L] ; 

%  find  the  decision  without  DFE  . 

dneq=sign(real (zl (1:L) ) ) +sign(real (zl (1 :L) ) -2) +. . . 

sign (real (zl (1:L) )+2)  +  . .  • 

j* (sign (imag (zl (1 :L) ) ) +sign (imag (zl (l:L))-2)+. 
sign (imag (zl (IsL) )+2) ) ; 

%  Compute  the  SER  without  equ®^z' atl°n 
SERneq= [SERneq; sum (abs (s_data  -dneq)  /  >1. 

f  nFF  has  been  determined,  it  is  compared  against  the  SER  of  the  linear 
0nCe  lhf£  SE*  rlsi  exerc.se  alomz  with  the  SER  from  ideal  AWGN  channel  and  the  SER  from 
equalization  from  *e  las  -  ide  the  resu,ts  in  Fig.  13.24.  From  the  comparison,  we  can  see 

a  receiver  without  equalization  P  ^  *  mitigating  channe,  ,S|.  The  linear  equalizer 

is  sli^htlybetter  at  tower  SNR  because  the  DFE  is  more  susceptible  to  error  propagation  (Sec.  13.6)  at 

lower  SNR. 
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COMPUTER  EXERCISE  13.3:  OFDM  TRANSMISSION  OF  QAM  SIGNALS 

In  the  example,  we  will  utilize  OFDM  for  QAM  transmission.  We  choose  the  number  of  subcarriers  (and 
the  FFT  size)  as  N  =  32.  We  let  the  finite  impulse  response  (FIR)  channel  to  be 


channels [0 . 3  -0.5  0  1  .2  -0.3] 


The  channel  length  is  6  (L  —  5  in  Section  13.7).  For  this  reason,  we  can  select  the  cyclic  prefix  length  to 
be  the  minimum  length  of  L  =  5. 

%  Matlab  Program  <Exl3_3.m> 

%  This  Matlab  exercise  <Exl3_3.m>  performs  simulation  of 
%  an  OFDM  system  that  employs  QAM- 16  baseband  signaling 
%  a  multipath  channel  with  AWGN . 
u  Correct  carrier  and  synchronization  is  assumed. 

%  32  subcarriers  are  used  with  channel  length  of  6 
%  and  cyclic  prefix  length  of  5. 
clear; elf; 


(SER)  compari¬ 
son  of  DFE, 
linear  equali¬ 
zation,  and 
under  ideal 
channel. 


L=16  000  00 ; 


-s  Total  data  symbols  in  experiment  is  1  milli°n 
%  number  of  data  frames 


%  Generating  random  signal  data  for  polar  signaling 
s— ' data=4* round (rand (L, 1) ) +2* round (rand (L,l))-3+... 


+j  * (4*round (rand (L, 1) ) +2* round (rand (L, 1) ) -3)  ; 


channel=  [0 . 3  -0.5  0  l  .2  -0.3] ; 
hf =f f t (channel, 32) ; 


%  channel  in  t -domain 
%  find  the  channel  in  f -domain 


p_data=reshape(s_data,32,Lfr) ; 


%  S/P  conversion 


p_td=ifft  (p__data)  ; 

p__cyc=  [p_td  (end -4  : end,  :)  ;p_td]  ; 


%  IFFT  to  convert  to  t -domain 
%  add  cyclic  prefix 
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s_cyc=reshape (p_cyc, 37*Lf r , 1) ; 
Psig=10/32 ; 

chsout=f ilter (channel , 1 , s_cyc) ; 
clear  p_td  p_cyc  s_data  s_cyc; 
noiseq= (randn (37*Lf r , 1) +j*randn(37 
SEReq=  []  ; 


%  P/S  conversion 

%  average  channel  input  power 
%  generate  channel  output  signal 
%  release  some  memory 
*Lf r, 1) ) ; 


for  ii=l : 31, 

SNR (ii) =ii-l ; 

Asig=sqrt (Psig*10~ (-SNR(ii) /10) 
x_out=chsout+Asig*noiseq; 
x_para= reshape (x_out , 37, Lf r) ; 
x_disc=x_para(6:37, :) ; 
xhat_para=f ft (x_disc) ; 


%  SNR  in  dB 
♦norm (channel) ; 

%  Add  noise 

%  S/P  conversion 
%  discard  tails 
%  FFT  back  to  f -domain 


z_data=inv (diag (hf ) )*xhat_para;  %  f-domain  equalizing 

%  compute  the  QAM  decision  after  equalization 

deq=sign (real (z_data) ) +sign(real (z_data) -2) +sign (real (z_data) +2) + . . . 

j* (sign(imag(z_data) ) +sign(imag (z_data) -2) +sign(imag (z_data) +2) )  ; 
%  Now  compare  against  the  original  data  to  compute  SER 
SEReq= [SEReq  sum (p_data~=deq, 2) /Lfr] ; 
end 


for  ii=l:9, 

SNRa (ii) =2*ii-2 ;  ,  _  % x 

Q (ii) =3*0 . 5*erfc (sqrt ( (2*10* (SNRa (ii)*0.1)/5)/2) )  ; 

%Compute  the  Analytical  BER 
end 

%  call  another  program  to  display  OFDM  Analysis 
ofdmAz 


The  main  MATLAB  program  Ex  1 3 _3  .m  completes  OFDM  modulation,  equalization,  and  detec- 
tion  Because  the  subcarriers  (subchannels)  have  different  gain  and,  consequently,  different  SNR  each 
nf  the  32  subcarriers  may  have  a  different  SER.  Thus,  simply  comparing  the  overall  SER  does  not  tell  the 
fuH  story.  For  this  reason,  we  can  call  another  program  of  dmAz  .  m  to  analyze  the  results  of  th.s  OFDM 

system. 


Sr  matt.ab  PROGRAM  <ofdmAz.m> 

%  This^program  is  used  to  analyze  the  OFDM  subcarriers  and  their 
%  receiver  outputs. 


%  Plot  the  subcarrier  gains 

f igure ( 2 ) ; 

stem (abs (hf ) ) t 

xlabel ( ' Subcarrier  label ' ) ; 

title ( ' Subchannel  gain' ) ; 


%  plot  the  subchannel  constellation  scattering  after  OFUM 
figure (3);  i-noo)  %  subchannel  1  output 

subplot (221) ;plotjz_data (1,1- 800), 

liSH !  ^  eoo“' ’  -  i  output 
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ylabel ( ' Imaginary' ) ; 

title ('(b)  Subchannel  10  output'),*axis('square')/ 

subplot (223 ) /plot ( z_data ( 15 , 1 : 800 ),'.') /  %  subchannel  15  output 

xlabel ( ' Real ' ) ; ylabel ( ' Imaginary' ) ; 

title  ('(c)  Subchannel  15  output');axis('square'); 

subplot ( 224 ) /plot ( z_data (:, 1 : 8 0 0 ),' b .') ;  %  mixed  subchannel  output 

xlabel ( ' Real ' ) /ylabel ( ' Imaginary' ) ; 

title ('(d)  Mixed  OFDM  output ')/ axis (' square' ) / 

%  Plot  the  average  OFDM  SER  versus  SER  under  "ideal  channel" 

-s  By  Disabling  5  poor  subcarriers,  average  SER  can  be  reduced, 
figure (4)  ,* 

figc=semilogy (SNRa, Q, 'k-' , SNR, mean (SEReq) , 'b-o' 

SNR,mean( [SEReq (1:14, :) ; SEReq (20 : 32 , :)] ) , 'b-s' ) ; 
set (figc, ' LineWidth' , 2) ; 

legend (' Ideal  channel Using  all  subcarriers' ,' Disabling  5  poor 
subcarriers ' ) 

title ( 'Average  OFDM  SER')/ 
axis (  [1  30  l.e-4  1] ) /hold  off; 

xlabel (' SNR  (dB) ') /ylabel (' Symbol  Error  Rate  (SER)')/ 

First,  we  display  the  subchannel  gain  H\n\  in  Fig.  13.25.  We  can  clearly  see  that,  among  the  32 
subchannels,  the  5  near  the  center  have  the  lowest  gains  and  hence  the  lowest  SNR.  We  therefore  expect 
them  to  exhibit  the  worst  performance.  By  fixing  the  average  channel  SNR  at  30  dB.  we  can  take  a  quick 
peek  at  the  equalizer  outputs  of  the  different  subcarrier  equalizers.  In  particular,  we  select  subchannels  1, 
10,  and  15  because  they  represent  the  moderate,  good,  and  poor  channels,  respectively.  Scatter  plots  of  the 
output  samples  (Fig.  13.26a-c)  clearly  demonstrate  the  quality  contrast  among  them.  If  we  do  not  make 
any  distinction  among  subchannels,  we  can  see  from  Fig.  I3.26d  that  the  overall  OFDM  performance  is 
dominated  mainly  by  the  poor  subchannels. 

We  can  also  look  at  the  SER  of  all  32  individual  subcarriers  in  Fig.  13.27.  We  see  very  clearly  that 
the  5  worst  channels  are  responsible  for  the  5  worst  SER  performances.  Naturally  if  we  average  the  SER 
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Figure  13.26 
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Figure  13.28 

Average  SER  of 
the  OFDM 
subcarriers 
before  and  after 
disabling  five 
worst  channels. 


across  all  32  subchannels,  the  larger  SERs  tend  to  dominate  and  make  the  overall  SER  of  the  OFDM 
system  much  higher. 

To  make  the  OFDM  system  more  reliable,  one  possible  approach  is  to  apply  bit  loading.  In  fact,  one 
extreme  case  of  bit  loading  is  to  disable  all  the  poor  subchannels  (i.e.,  to  send  nothing  on  the  subchannels 
with  very  low  gains).  We  can  see  from  the  SER  comparison  of  Fig.  13.28  that  by  disabling  5  of  the  worst 
channels  among  the  32  subcamers,  the  overall  SER  is  significantly  reduced  (improved).  
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PROBLEMS 


13.1-1  In  a  QAM  transmission  of  symbol  rate  1/7=1  MHz.  assume  that  p(t)  is  a  raised-cosine  pulse 
with  roll-off  factor  of  0.5.  The  carrier  frequency  in  use  is  2.4  GHz. 

(a)  Derive  the  resulting  baseband  pulse  part  q(t)  when  the  multipath  channel  impulse  response 
is  given  by 

0.956(f)  -  0.36(f  -  77 2) 

(b)  Show  whether  the  eye  is  open  for  QPSK  transmission  in  part  (a)  when  the  channel  outputs 
are  sampled  at  t  =  kT. 


13.2-1  Consider  the  signal  transmission  model  of  Prob.  13.1-1. 

(a)  Determine  the  matched  filter  for  the  equivalent  baseband  pulse  resulting  from  the  multipath 
channel. 

(b)  Determine  the  equivalent  discrete  time  linear  system  transfer  function  H(z)  between  the 
QAM  input  symbols  and  the  matched  filter  output  sampled  at  /  =  kT. 


13.2- 2  In  a  digital  QAM  system,  the  received  baseband  pulse  shape  is  q(t)  =  A  (y^).  The  channel 

noise  (before  the  matched  filter)  is  AWGN  with  spectrum  ofAT/2. 

(a)  Find  the  power  spectral  density  of  the  noise  w(f )  at  the  matched  filter  output. 

(b)  Determine  the  mean  and  the  variance  of  the  sampled  noise  w[AT]  at  the  matched  filter 
output. 

(c)  Show  whether  the  noise  samples  w[AT]  are  independent. 

13.2- 3  The  matched  filter  q(—t)  in  Prob.  13.2-2  may  be  replaced  by  an  antialiasing,  root-raised-cosine 

pulse. 

(a)  Find  the  power  spectral  density  of  the  noise  w(f)  at  the  matched  filter  output. 

(b)  Determine  the  mean  and  the  variance  of  the  sampled  noise  w[kT]  at  the  matched  filter 
output. 

(c)  Show  whether  the  noise  samples  w[£T]  are  independent. 

(d)  Describe  how  the  maximum  likelihood  sequence  estimation  receivers  are  different  between 
this  receiver  filter  and  the  matched  filter  in  Prob.  13.2-2. 

13.3- 1  In  a  BPSK  baseband  system,  the  discrete  time  channel  is  specified  by 

H(z)  =  1  +0.6-T1 

The  received  signal  samples  are 

zlk]  =  H(z)sk  +  w[*] 

The  BPSK  signal  is  =  ±1  with  equal  probability.  The  discrete  channel  noise  w[&]  is  additiv e 
white  Gaussian  with  zero  mean  and  variance  M/ 2  such  that  the  Eh/M  =  18. 

(a)  Find  the  probability  of  error  if  z[k]  is  directly  sent  into  a  BPSK  decision  device. 

(b)  Find  the  probability  of  error  if  z[k]  first  passes  through  a  zero-forcing  equalizer  before  a 
BPSK  decision  device. 
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13.3- 2  Repeat  Prob.  13.3-2  if  the  discrete  channel 

H(z)=  1  +0.9z-1 

13.3- 3  Compare  the  BER  results  of  Probs.  13.3-1  and  13.3-2.  Observe  the  different  depth  of  the  channel 

spectral  nulls  and  explain  their  BER  difference  based  on  the  dillerent  the  noise  amplification 
effect. 


13.3- 4  For  the  channel  of  Prob.  13.3-1.  find  the  response  of  a  six-tap  MMSE  equalizer.  Determine  the 

resulting  minimum  MSE.  What  is  the  corresponding  MSE  if  the  ZF  equalizer  is  applied  instead? 

13.3- 5  Repeat  Prob.  1 3.3-3  for  the  FIR  channel  of  Prob.  1 3.3-2. 

13.3- 6  Repeat  Prob.  13.3-3  for  the  FIR  channel  in  Example  1 3. 1 . 

13.4- 1  In  a  fractionally  sampled  channel,  the  sampling  frequency  is  chosen  to  be  2/T  (i.e..  there  arc- 

two  samples  for  every  transmitted  symbol  sk ).  The  two  sampled  subchannel  responses  are 


tf|(z)  =  1+0.9; 


-1 


H2(z)  =  -0.3  +  0.5z 


,-l 


Both  subchannels  have  additive  white  Gaussian  noises  that  are  independent  with  zero  mean  and 
identical  variance  <t£  =  0.2.  The  input  symbol  sk  is  a  PAM-4  with  equal  probability  of  being 

(±1,  ±3). 

(a)  Show  that  F,  (z)  =  0.3  and  F2(z)  =  I  form  a  zero-forcing  equalizer. 

(b)  Show  that  F\  (z)  =  1  and  F2(z)  =  -18  also  form  a  zero-forcing  equalizer. 

(c)  Show  which  of  the  two  previous  fractionally  spaced  ZF  equalizers  delivers  better  per¬ 
formance.  This  result  shows  that  ZF  equalizers  of  different  delays  can  lead  to  different 
performance. 

13.4-2  For  the  same  system  of  Prob.  1 3.4- 1 ,  complete  the  following. 

(a)  Find  the  ZF  equalizers  of  delays  0. 1 .  and  2,  respectively  when  the  ZF  equalizer  filters  have 


order  1,  that  is. 


Fi(z)=fi[0)+fi\ Hz"*  '='-2 


(b)  Find  the  resulting  noise  distribution  at  the  equalizer  output  for  each  of  the  three  fractionally 
spaced  ZF  equalizers. 

(c)  Determine  the  probability  of  symbol  error  if  hard  PAM  decision  is  taken  from  the  equalizer 
output. 

. ,  .  ,  For  the  same  system  of  Prob.  13.4-1,  find  the  fractionally  spaced  MMSE  equalizers  of  delays  0 
and  1  respecti  vely,  when  each  ZF  equalizer  filter  have  order  0,  that  is. 

Fi(z)  =m  1-1.  2 

-f  prot,  1 3.4- 1 .  find  the  fractionally  spaced  MMSE  equalizers  of  delays  0, 
13'4'4  Landl're^c-ively.  when  each  ZF  equalizer  filter  have  order  I .  that  is. 

Fi(z)=fiio)+M  nr1  »'  =  1.2 

13.6-1  In  a  DFE  for  binary  polar  signaling.  5*  =  ±1  with  equal  probability.  The  feedforward  filter 
output  d[k]  is  given  by 


d[k]  =  sk—2  +  0.85*_3  +  w|k| 
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where  w[fc]  is  white  Gaussian  with  zero  mean  and  variance  0.04. 

(a)  Determine  the  DFE  filter  coefficient. 

(b)  Find  the  DFE  output  BER  when  the  decisions  in  feedback  are  error  free. 

(c)  If  the  decision  device  is  not  error  free,  then  there  w  ill  be  error  propagation.  Find  the  prob¬ 
ability  of  error  of  the  next  decision  on  symbol  s*_ 2  when  the  previous  decision  s*_ 3  is 
known  to  be  wrong. 

13.7- 1  Prove  that  Wjy  •  =z  INxN. 

13.7- 2  A  cyclic  matrix  is  a  matrix  that  is  completely  specified  by  its  first  row  (or  column).  Row  i  is 

a  circular  shift  of  the  elements  in  row  /  —l.  In  other  words,  if  the  first  row  of  matrix  C  is 
ci\ ,  . . . ,  aN*  then  its  second  row  is  a\ ,  . . . ,  a/v-D  and  so  on.  Prove  that  any  cyclic 

matrix  of  size  N  x  N  can  be  diagonalized  by  Wjy  and  W  ^ 1 ,  that  is, 

Wn  •  C  •  =  diagonal 

13.7- 3  Consider  an  FIR  channel  with  impulse  response 

MO]  =  1.0,  MU  =  -0.5,  M2]  =  0.3 

The  channel  noise  is  additive  white  Gaussian  with  spectrum  J\f / 2.  Design  an  OFDM  system 
with  N  =  16  by  (a)  specifying  the  length  of  the  cyclic  prefix;  (b)  determining  the  N  subchannel 
gains;  (c)  deriving  the  bit  error  rate  of  each  subchannels  for  BPSK  modulations;  (d)  finding  the 
average  bit  error  rate  of  the  entire  OFDM  system. 

13.7- 4  Consider  an  FIR  channel  of  order  up  to  L.  First,  we  apply  the  usual  IDFT  on  the  source  data 

vector  via 


Next,  instead  of  applying  a  cyclic  prefix  as  in  Eq.  ( 1 3.64b),  we  insert  a  string  of  L  zeros  in  front 
of  every  N  data  before  transmission  as  in 


'  SN 
SN- 1 


*1 

0 


(Af  +L)  x  1 


L  0 


This  zero-padded  data  vector  is  transmitted  normally  over  the  FIR  channel  {/z[A:]}.  At  the  receiver 
end,  we  stack  up  the  received  symbols  {-[/?]}  into 


-  z[Af]  1 
zlN-  1] 

■  0  - 

y  = 

z[L] 

+ 

0 

z[N  +  L) 

-  z[l]  . 

,z[N  +  1]_ 
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Prove  that 


[H\N  | 


z  = 


H\N-  1] 


s 


H{  +  \\. 


This  illustrates  the  equivalence  between  zero  padding  and  cyclic  prefix  in  OFDM. 

13.7-5  Show  that  in  AWGN  channels,  cyclic  OFDM  and  zero-padded  OFDM  achieve  identical  SNRs 
for  the  same  channel  input  power. 


A  INTRODUCTION  TO 
AJ-  INFORMATION  THEORY 


Among  all  the  means  of  communication  discussed  thus  far,  none  produces  error-free 
communication.  We  may  be  able  to  improve  the  accuracy  in  digital  signals  by  reducing 
the  error  probability  Pe.  But  it  appears  that  as  long  as  channel  noise  exists,  our  com¬ 
munications  cannot  be  free  of  errors.  For  example,  in  all  the  digital  systems  discussed  thus 
far,  Pe  varies  as  e~kEb  asymptotically.  By  increasing  Eb ,  the  energy  per  bit,  we  can  reduce  Pe 
to  any  desired  level.  Now,  the  signal  power  is  5/  =  £/>/?/,,  where  Rb  is  the  bit  rate.  Hence, 
increasing  Eb  means  either  increasing  the  signal  power  5/  (for  a  given  bit  rate),  decreasing  the 
bit  rate  Rb  (for  a  given  power),  or  both.  Because  of  physical  limitations,  however,  5/  cannot 
be  increased  beyond  a  certain  limit.  Hence,  to  reduce  Pe  further,  we  must  reduce  /?/,,  the  rate 
of  transmission  of  information  digits.  Thus,  the  price  to  be  paid  for  reducing  Pe  is  a  reduction 
in  the  transmission  rate.  To  make  Pe  approach  0,  Rb  also  approaches  0.  Hence,  it  appears  that 
in  the  presence  of  channel  noise  it  is  impossible  to  achieve  error-free  communication.  Thus 
thought  communication  engineers  until  the  publication  of  Shannon’s  seminal  paper1  in  1948. 
Shannon  showed  that  tor  a  given  channel,  as  long  as  the  rate  of  information  digits  per  second  to 
be  transmitted  is  maintained  within  a  certain  limit  determined  by  the  physical  channel  (known 
as  the  channel  capacity),  it  is  possible  to  achieve  error-free  communication.  That  is,  to  attain 
Pe  -►  0,  it  is  not  necessary  to  make  Rh  -*  0.  Such  a  goal  (Pe  -►  0)  can  be  attained  by 
maintaining  R/y  below  C,  the  channel  capacity  (per  second).  The  gist  of  Shannon’s  paper  is 
that  the  presence  of  random  disturbance  in  a  channel  does  not,  by  itself,  define  any  limit  on 
transmission  accuracy.  Instead,  it  defines  a  limit  on  the  information  rate  for  which  an  arbitrarily 
small  error  probability  (Pe  0)  can  be  achieved. 

We  have  been  using  the  phrase  “rate  of  information  transmission”  as  if  information  could 
be  measured.  This  is  indeed  so.  We  shall  now  discuss  the  information  content  of  a  message  as 
understood  by  our  “common  sense”  and  also  as  it  is  understood  in  the  “engineering  sense." 
Surprisingly,  both  approaches  yield  the  same  measure  of  information  in  a  message. 

14.1  MEASURE  OF  INFORMATION 

Commonsense  Measure  of  Information 

Consider  the  following  three  hypothetical  headlines  in  a  morning  paper: 

1 .  There  will  be  daylight  tomorrow. 

2.  United  States  invades  Iran. 

3.  Iran  invades  the  United  States. 
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The  reader  will  hardly  notice  the  first  headline  unless  he  or  she  lives  near  the  North  or  the 
South  Pole.  The  reader  will  be  very,  very  interested  in  the  second.  But  what  really  catches 
the  leader  s  attention  is  the  third  headline.  This  item  will  attract  much  more  interest  than  the 
other  two  headlines.  From  the  viewpoint  of  “common  sense,”  the  first  headline  conveys  hardly 
any  information;  the  second  conveys  a  large  amount  of  information;  and  the  third  conveys  yet 
a  larger  amount  of  information.  If  we  look  at  the  probabilities  of  occurrence  of  these  three 
events,  we  find  that  the  probability  of  occurrence  of  the  first  event  is  unity  (a  certain  event), 
that  of  the  second  is  low  (an  event  of  small  but  finite  probability),  and  that  of  the  third  is 
practically  zero  (an  almost  impossible  event).  If  an  event  of  low  probability  occurs,  it  causes 
greater  surprise  and,  hence,  conveys  more  information  than  the  occurrence  of  an  event  of  larger 
probability.  Thus,  the  information  is  connected  with  the  element  of  surprise,  which  is  a  result 
of  uncertainty,  or  unexpectedness.  The  more  unexpected  the  event,  the  greater  the  surprise, 
and  hence  the  more  information.  The  probability  of  occurrence  of  an  event  is  a  measure  of  its 
unexpectedness  and,  hence,  is  related  to  the  information  content.  Thus,  from  the  point  of  view 
of  common  sense,  the  amount  of  information  received  from  a  message  is  directly  related  to  the 
uncertainty  or  inversely  related  to  the  probability  of  its  occurrence.  If  P  is  the  probability  of 
occurrence  of  a  message  and  /  is  the  information  gained  from  the  message,  it  is  evident  from 
the  preceding  discussion  that  when  P  — ►  1 ,  /  — ►  0  and  when  P  —*■  0,  /  — ►  oo,  and,  in  general 
a  smaller  P  gives  a  larger  /.  This  suggests  the  following  information  measure: 

I 

/~  log-  (14.1) 


Engineering  Measure  of  Information 

We  now  show  that  from  an  engineering  point  of  view,  the  information  content  of  a  message 
is  consistent  with  the  intuitive  measure  [Eq.  (14.1)].  What  do  we  mean  by  an  engineering 
point  of  view?  An  engineer  is  responsible  for  the  efficient  transmission  of  messages.  For  this 
service  the  engineer  will  charge  a  customer  an  amount  proportional  to  the  information  to  be 
transmitted.  But  in  reality  the  engineer  will  charge  the  customer  in  proportion  to  the  time  that 
the  message  occupies  the  channel  bandwidth  for  transmission.  In  short,  from  an  engineering 
point  of  view,  the  amount  of  information  in  a  message  is  proportional  to  the  (minimum)  time 
required  to  transmit  the  message.  We  shall  now  show  that  this  concept  of  information  also 
leads  to  Eq.  (14.1).  This  implies  that  a  message  with  higher  probability  can  be  transmitted  in  a 
shorter  time  than  that  required  for  a  message  with  lower  probability.  This  fact  may  be  verified 
by  the  example  of  the  transmission  of  alphabetic  symbols  in  the  English  language  using  Morse 
code  This  code  is  made  up  of  various  combinations  of  two  symbols  (such  as  a  dash  and  a  dot 
in  Morse  code  or  pulses  of  height  4  and  -A  volts).  Each  letter  is  represented  by  a  certain 
combination  of  these  symbols,  called  the  codeword,  which  has  a  certain  length.  Obviously, 
for  efficient  transmission,  shorter  codewords  are  assigned  to  the  letters  e,  t,  a, .  and  o.  wh.ch 
occur  more  frequently.  The  longer  codewords  are  assigned  to  letters  x,k.q,  and  i,  wh.ch  occur 
less  frequently  Each  letter  may  be  considered  to  be  a  message.  It  is  obvious  that  the  letters  that 
occur  more  frequently  (with  higher  probability  of  occurrence)  need  a  shorter  t.me  to  transmit 
(shorter  codewords)  than  those  with  smaller  probability  of  occurrence.  We  shall  now  show 
that  on  the  average,  the  time  required  to  transmit  a  symbol  (or  a  message)  w.th  probability  of 

occurrence  P  is  indeed  proportional  to  log  (l//1). 

For  the  sake  of  simplicity,  let  us  begin  with  the  case  of  binary  messages  m,  and  m2,  which 
are  equally  likely  to  occur.  We  may  use  binary  digits  to  encode  these  messages,  representing 

consider  !he  cas^of  the  four  equiprobable  messages  mi .  m,.  m3,  and  mj.  If  drese  messages  are 


INTRODUCTION  TO  INFORMATION  THEORY 


encoded  in  binary  form,  we  need  a  minimum  of  two  binary  digits  per  message.  Each  binary 
digit  can  assume  two  values.  Hence,  a  combination  of  two  binary  digits  can  form  the  four 
codewords  00,  01,  10,  11,  which  can  be  assigned  to  the  four  equiprobable  messages  m\,  m2, 
m3,  and  m4,  respectively.  It  is  clear  that  each  of  these  four  messages  takes  twice  as  much 
transmission  time  as  that  required  by  each  of  the  two  equiprobable  messages  and,  hence, 
contains  twice  as  much  information.  Similarly,  we  can  encode  any  one  of  eight  equiprobable 
messages  with  a  minimum  of  three  binary  digits.  This  is  because  three  binary  digits  form  eight 
distinct  codewords,  which  can  be  assigned  to  each  of  the  eight  messages.  It  can  be  seen  that, 
in  general,  we  need  log2  n  binary  digits  to  encode  each  of  n  equiprobable  messages.*  Because 
all  the  messages  are  equiprobable,  P,  the  probability  of  any  one  message  occurring,  is  \/n. 
Hence,  to  encode  each  message  (with  probability  P),  we  need  log2(  1 7^)  binary  digits.  Thus, 
from  the  engineering  viewpoint,  the  information  /  contained  in  a  message  with  probability  of 
occurrence  P  is  proportional  to  log2  (1  //*), 


(14.2) 


where  k  is  a  constant  to  be  determined.  Once  again,  we  come  to  the  conclusion  (from  the  engi¬ 
neering  viewpoint)  that  the  information  content  of  a  message  is  proportional  to  the  logarithm 
of  the  reciprocal  of  the  probability  of  the  message. 

We  shall  now  define  the  information  conveyed  by  a  message  according  to  Eq.  ( 14.2).  The 
proportionality  constant  is  taken  as  unity  for  convenience,  and  the  information  is  then  in  terms 
of  binary  units,  abbreviated  bit  (binary  unit). 


(14.3) 


1  =  lo§2  p  bits 


According  to  this  definition,  the  information  /  in  a  message  can  be  interpreted  as  the 
minimum  number  of  binary  digits  required  to  encode  the  message.  This  is  given  by  log2  (l/P)> 
where  P  is  the  probability  of  occurrence  of  the  message.  Although  here  we  have  shown  this 
result  for  the  special  case  of  equiprobable  messages,  we  shall  show  in  the  next  section  that  it 
is  true  for  nonequiprobable  messages  also. 

Next,  we  shall  consider  the  case  of  r-ary  digits  instead  of  binary  digits  for  encoding.  Each 

of  the  r-ary  digits  can  assume  r  values  (0,  1,2 _ ,r  -  1).  Each  of  n  messages  (encoded  by 

r-ary  digits)  can  then  be  transmitted  by  a  particular  sequence  of  r-ary  signals.  Because  each 
r-ary  digit  can  assume  r  values,  k  r-ary  digits  can  form  a  maximum  of  r*  distinct  codewords. 
Hence,  to  encode  each  of  the  n  equiprobable  messages,  we  need  a  minimum  of  k  =  logr  n  r-ary 
digits. '  But  n  =  1  / P.  where  P  is  the  probability  of  occurrence  of  each  message.  Obviously, 
we  need  a  minimum  of  logr(l/P)  r-ary  digits.  The  information  /  per  message  is  therefore 


(14.4) 


/  =  logr  —  r-ary  units 
From  Eqs.  (14.3)  and  (14.4)  it  is  evident  that 


I  =  log2  —  bits  =  log,  —  r-ary  units 


*  Here  we  are  assuming  that  the  number  n  is  such  that  log2  n  is  an  integer.  Later  on  we  shall  observe  that  this 
restriction  is  not  necessary. 


Here  again  we  are  assuming  that  n  is  such  that  logr  n  is  an  integer.  As  we  shall  see  later,  this  restriction  is  not 


necessary. 
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Hence,* 


1  r- ary  unit  =  log2  r  bits 


(14.5) 


A  Note  on  the  Unit  of  Information:  Although  it  is  tempting  to  use  the  r- ary  unit  as  a 
general  unit  of  information,  the  binary  unit  bit  (r  =  2)  is  commonly  used  in  the  literature. 
There  is,  of  course,  no  loss  of  generality  in  using  r  —  2.  These  units  can  always  be  converted 
into  any  other  units  by  using  Eq.  (14.5).  Henceforth,  unless  otherwise  stated,  we  shall  use  the 
binary  unit  (bit)  for  information.  The  bases  of  the  logarithmic  functions  will  be  omitted,  but 
will  be  understood  to  be  2. 

Average  Information  per  Message:  Entropy  of  a  Source 

Consider  a  memoryless  source  m  emitting  messages  m\,  m2,  . . . ,  mn  with  probabilities  P 1, 

P2 . P/i,  respectively  (P\  +  P2  H - f-  P„  =  1).  A  memoryless  source  implies  that  each 

message  emitted  is  independent  of  the  previous  message(s).  By  the  definition  in  Eq.  (14.3  )  [or 
Eq.  (14.4)],  the  information  content  of  message  m,-  is  /,,  given  by 


//  =  log  —  bits 

*  1 


(14.6) 


The  probability  of  occurrence  of  m ,  is  P,.  Hence,  the  mean,  or  average,  information  per  message 
emitted  by  the  source  is  given  by  £"=i  PJi  bits.  The  average  information  per  message  of  a 
source  m  is  called  its  entropy,  denoted  by  tf(m).  Hence, 


n 


//(m)  =  £/>,/,  bits 


1=1 


1 

=  Pi  log  —  bits 


(14.7a) 


n 


=  -  Pj  log  P,  bits 


(14.7b) 


1=1 


The  entropy  of  a  source  is  a  function  of  the  message  probabilities.  It  is  interesting  to  find  the 
message  probability  distribution  that  yields  the  maximum  entropy.  Because  the  entropy  is  a 
measure  of  uncertainty,  the  probability  distribution  that  generates  the  maximum  uncertainty 
will  have  the  maximum  entropy.  On  qualitative  grounds,  one  expects  entropy  to  be  maximum 
when  all  the  messages  are  equiprobable.  We  shall  now  show  that  this  is  indeed  true. 


* 


In  general, 


1  r-ary  unit  =  logv  r  5-ary  units 


The  10-ary  1 
(along  with  1 
foundations 


1  hartley  =  log2  10  =  3.32  bits 


Sometimes  the  unit  nat  is  used. 


1  nat  =  log2  e  =  1 .44  bits 
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Because  H( m)  is  a  function  of  PUP2 . Pn,  the  maximum  value  of  W(m)  is  found  from 

.  _  o  for  i  =  1,  2,  . . . ,  n,  with  the  constraint  that 

(14.8) 


the  equation  dH{m)ldPi  =  0  for  /  =  1,  2 . n,  with  the  constraint  that 

1  =  Pi  +  P2  -\ - F  Pn-l  +  Pn 


Because  the  function  for  maximization  is 

n 

H(m  )  = -^  Pi  log  Pi 

i=i 

need  to  use  the  Lagrangian  to  form  a  new  function 

/(Pi,  Pi,...,  Pn)  =  -^PilogPi  +  k(P\  +P2  +  ---  +  Pn-\  +Pn~D 


(14.9) 


we 


/=! 


Hence, 


|  log  e  -  log  Pj  +  X 


=  -  log  Pj  +  k-  log  e  j  =  1,2,  . 


Setting  the  derivatives  to  zero  leads  to 

2X 

p,  =  p2  =  .  ■  ■  =  P„  =  — 

By  invoking  the  probability  constraint  of  Eq.  (14.8),  we  have 


...  n 


Thus, 


p,  =  p2  = . . .  =  pn  =  - 


(14.10) 


To  show  that  Eq.  (14.10)  yields  [W(m)]max  and  not  L//(m)]mjn,  we  note  that  when  Pi  1  J 
p2  =  p3  =  . . .  =  P„  =  0,  H( m)  =  0,  whereas  the  probabilities  in  Eq.  (14.10)  yie 

"  1  1 

H  (m)  =  —  -  log  -  =  log  n 

'  n  n 


•  Bar* 

The  Intuitive  (Commonsense)  and  the  Engineering  Interpretations  of  Entropy^  ^me 

Her  we  observed  that  both  the  intuitive  and  the  engineering  viewpoints  lead  to  ^  ^ 
definition  of  the  information  associated  with  a  message.  The  conceptual  bases,  oW^  reta. 
entirely  different  for  the  two  points  of  view.  Consequently,  we  have  two  physica  ,n^tent  Qf 
tions  of  information.  According  to  the  engineering  point  of  view,  the  information  c  ^ 
any  message  is  equal  to  the  minimum  number  of  digits  required  to  encode  the  mes 
therefore,  the  entropy  H(m)  is  equal  to  the  minimum  number  of  digits  per  message 


K  ^ 
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on  the  average,  for  encoding.  From  the  intuitive  standpoint,  on  the  other  hand,  information  is 
thought  of  as  being  synonymous  with  the  amount  of  surprise,  or  uncertainty,  associate  wit 
the  event  (or  message).  A  smaller  probability  of  occurrence  implies  more  uncertainty  about  the 
event  Uncertainty  is,  of  course,  associated  with  surprise.  Hence  intuitively,  the  information 
associated  with  a  message  is  a  measure  of  the  uncertainty  (unexpectedness)  of  the  message 
Therefore,  log  (1/P,)  is  a  measure  of  the  uncertainty  of  the  message  »i(,  and  2-/=i  r'  log  ' 
is  the  average  uncertainty  (per  message)  of  the  source  that  generates  messages/?! 

m„  with  probabilities  . P,  Both  these  interpretations  prove  useful  in  the  qualitative 

understanding  of  the  mathematical  definitions  and  results  in  information  theory.  En  ropy  y 
also  be  viewed  as  a  function  associated  with  a  random  variable  m  that  assumes  values  m,, 
m2,  . . . ,  >nn  with  probabilities  P(m\),P(m2),  P(mn)' 


W(m)  =  E  P(mi)  log  jL)  =  £  Pi  108  7, 

i=i  '=' 


Thnc  «/p  ran  associate  an  entropy  with  every  discrete  random  variable. 

If  the  source  is  not  memoryless  (i.e.,  in  the  event  that  a  message  emitted ^  at  an) aimers 
dependent  of  the  previous  messages  emitted),  then  the  source  entropy  will  be  less  than  Him) 
in  Eq.  (1 4.9).  This  is  because  the  dependence  of  a  message  on  previous  messages  reduce. 

uncertainty. 
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•  •  mkr-r  nf  himrv  dieits  required  to  encode  a  message  was  shown  to  be  equal  to 

The  minimum  nu  the  messages  of  Ihe  source  are  equiprobable  (each  message 

Ihe  source  Hze  this  res„l,  the  case  of  nonequiprobable  messages. 

^aSw  imthe“mge  number  of  binary  digits  per  message  required  for  encoding  ,s 

Sa^itc^rrseqnence.  Then  according  to  the  relative  frequency  interpretation 
(or  law  of  large  numbers), 

lim 

N  — ►  00  N 

KIP  in  a  seauence  of  N  messages  (provided  N  ->■  oo). 

Thus,  the  message  m,  occun >  >  ^  ^  w,  wi„  occur  APi  times,  m2  will  occur 

Therefore,  in  a  typical  seq  A11  other  compositions  are  extremely  unlikely  to 

NPl  times,  ;quence  (where  N  -  oo)  has  the  same  proportion  of  the 

occur  (P  -*■  0).  Thus,  ay  yp  different.  We  shall  assume  a  memoryless 

n  messages,  although  in  ge"era  message  is  emitted  from  the  source  independently  of  the 
source;  that  is,  we  a*der  now  a  typical  sequence  SN  of  N  messages  from  the  source, 
previous  messages.  ^hnbilitvFi  Pi  ?n)  occur  NP\ ,  NP2 ,  •  ••*  NPn  times,  and 

l^urc^c^ 

is  given  by 
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Because  all  possible  sequences  of  N  messages  from  this  source  have  the  same  composition, 
all  the  sequences  (of  N  messages)  are  equiprobable,  with  probability  P(S,\).  We  can  consider 
these  long  sequences  as  new  messages  (which  are  now  equiprobable).  To  encode  one  such 
sequence  we  need  L,\>  binary  digits,  where 


Ln  =  log 


binary  digits 


(14.12) 


Substituting  Eq.  (14.1 1)  into  Eq.  (14.12),  we  obtain 


1 

Ln  =nJ2  pi  log  =  Wtffm)  binary  digits 

i=l  ' 

Note  that  Ln  is  the  length  (number  of  binary  digits)  of  the  codeword  required  to  encode  N 
messages  in  sequence.  Hence,  L,  the  average  number  of  digits  required  per  message,  is  Ln/N 
and  is  given  by 


L=-^-=H( m)  binary  digits  (14.13) 

Thus,  by  encoding  N  successive  messages,  it  is  possible  to  encode  a  sequence  ot  source 
messages  using,  on  the  average,  H( m)  binary  digits  per  message,  where  H (m)  is  the  entropy 
of  the  source  message  (in  bits).  Moreover,  one  can  show  that  H( m)  is  indeed,  on  the  average, 
the  minimum  number  of  digits  required  to  encode  this  message  source.  It  is  impossible  to  fin 
any  uniquely  decodable  code  whose  average  length  is  less  than  5 


Huffman  Code 

The  source  encoding  theorem  says  that  to  encode  a  source  with  entropy  H( m),  we  need,  on  t  e 
average,  a  minimum  of  H (m)  binary  digits  per  message.  The  number  of  digits  in  the  codewor 
is  the  length  of  the  codeword.  Thus,  the  average  word  length  of  an  optimum  code  is  H( m). 
Unfortunately,  to  attain  this  length,  in  general,  we  have  to  encode  a  sequence  of  N  messages 
(jV  -»■  oo)  at  a  time.  If  we  wish  toencode  each  message  directly  without  using  longer  sequences, 
then,  in  general,  the  average  length  of  the  codeword  per  message  will  be  greater  than  H (m  • 
In  practice,  it  is  not  desirable  to  use  long  sequences,  since  they  cause  transmission  delay  an 
add  to  equipment  complexity.  Hence,  it  is  preferable  to  encode  messages  directly,  even  l 
price  has  to  be  paid  in  terms  of  increased  word  length.  In  most  cases,  the  price  turns  out  to 
small.  The  following  is  a  procedure,  given  without  proof,  for  finding  the  optimum  sourceco 
called  the  Huffman  code.  The  proof  that  this  code  is  optimum  can  be  found  elsewhere. 

We  shall  illustrate  the  procedure  with  an  example  using  a  binary  code.  We  first  arraa^ 
the  messages  in  the  order  of  descending  probability,  as  shown  in  Table  14.1.  Here  we 
six  messages  with  probabilities  0.30,  0.25,  0.15,  0.12,  0.08,  and  0.10,  respectively.  Weno^ 
aggregate  the  last  two  messages  into  one  message  with  probability  Ps  +  Pt  =  ' 

leaves  five  messages  with  probabilities,  0.30,  0.25,  0.18,  0.15,  and  0.12.  These  messages ^ 
now  rearranged  in  the  second  column  in  the  order  of  descending  probability.  We  rePeat.  m 
procedure  by  aggregating  the  last  two  messages  in  the  second  column  and  rearranging  ^ 
in  the  order  of  descending  probability.  This  is  done  until  the  number  of  messages  is  re 
tvo  (reduced)  messages  are  now  assigned  0  and  I  as  their  first  digits  in 


to  two.  These  two  (reduced)  messages  ; 
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TABLE  14.1 


Original  Source 


Reduced  Sources 


Messages  Probabilities  S\  S  2  S  3  54 


mi 

m2 

m3 

m4 

m5 

m6 


0.30 

0.25 

0.15 

0.12 

0.08 

0.10 


0.30 

0.25 

>0.18 

0.15 

0.12 


0.30 

0.27 

0.25 

0.18 


0.43 

0.30 

0.27 


I — >0.57 
0.43 


TABLE  14.2 


Original  Source 

Code 

Reduced  Sources 

Messages  Probabilities 

Si 

S2 

S3 

s4 

m  i 

0.30 

00 

0.30 

00 

0.30 

00 

r>0.43 

1 

r>0.57 

0 

i 

m2 

0.25 

10 

0.25 

10 

r>0.27 

01 

0.30 

00 

_J  0.43 

1 

m3 

0.15 

010 

— >  0. 1 8 

11 

0.25 

10 

0.27 

01 

tri4 

0.12 

on 

0.15 

010 

0.18 

11 

m5 

0.08 

110 

0.12 

on  _ 

m6 

0.10 

111 

sequence.  We  now  go  back  and  assign  the  numbers  0  and  1  to  the  second  digit  for  the  two 
messages  that  were  aggregated  in  the  previous  step.  We  keep  regressing  in  this  way  until  the 
first  column  is  reached.  The  code  finally  obtained  (for  the  first  column)  can  be  shown  to  be 
optimum.  The  complete  procedure  is  shown  in  Tables  14.1  and  14.2. 

The  optimum  (Huffman)  code  obtained  this  way  is  also  called  a  compact  code.  The 
average  length  of  the  compact  code  in  the  present  case  is  given  by 

l=J2  piL‘ = °-3(2) + °-25(2) + 0I5(3) + 012(3) + a,(3) + 008(3) 

1=1 

=  2.45  binary  digits 

The  entropy  tf(m)  of  the  source  is  given  by 


— ,  1 
W(m)  =  T  Pilogz- 

M  1 

=  2.418  bits 


Hence  the  minimum  possible  length  (attained  by  an  infinitely  long  sequence  of  messages) 
k  2  418  binary  digits.  By  using  direct  coding  (the  Huffman  code),  it  is  possible  to  attain  an 
average  length  of  2.45  bits  in  the  example  given.  This  is  a  close  approximation  of  the  optimum 
performance  atlainable.  Thus,  little  is  gained  by  complex  coding  of  a  number  of  messages  m 

this  case. 
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The  merit  of  any  code  is  measured  by  its  average  length  in  comparison  to  H{ m)  (the 
average  minimum  length).  We  define  the  code  efficiency  i)  as 

H(  m) 


where  L  is  the  average  length  of  the  code.  In  our  present  example. 


n  = 


2.418 

2.45 


=  0.976 


The  redundancy  y  is  defined  as 


Y  =  1  -  n 
=  0.024 

Even  though  the  Huffman  code  is  a  variable  length  code,  it  is  uniquely  decodable.  It 
we  receive  a  sequence  of  Huffman-coded  messages,  it  can  be  decoded  only  one  way,  that  is, 
without  ambiguity.  For  instance,  if  the  source  in  this  exercise  were  to  emit  the  message  sequence 
mimsmmirnmsmt ....  it  would  be  encoded  as  (>0110100001 1010111 ....  The  reader  may 
verify  that  this  message  sequence  can  be  decoded  only  one  way.  viz.  m\rn$m2m\ .... 
even  if  there  is  no  demarcation  between  individual  messages.  This  uniqueness  is  assured  by 
the  special  property  that  no  codeword  is  a  prefix  of  another  (longer)  codeword. 

A  similar  procedure  is  used  to  find  a  compact  r-ary  code.  In  this  case  we  arrange  t  e 
messages  in  descending  order  of  probability,  combine  the  last  r  messages  into  one  message, 
and  rearrange  the  new  set  (reduced  set)  in  the  order  of  descending  probability.  We  repeat  t  e 
procedure  until  the  final  set  reduces  to  r  messages.  Each  of  these  messages  is  now  assigne 

one  of  the  r  numbers  0.  1,2 . r  -  1.  We  now  regress  in  exactly  the  same  way  as  in  t  e 

binary  case  until  each  of  the  original  messages  has  been  assigned  a  code. 

For  an  r-ary  code,  we  will  have  exactly  r  messages  left  in  the  last  reduced  set  i  .an 
only  if.  the  total  number  of  original  messages  is  r  +  k(r  -  1),  where  k  is  an  integer,  is 
is  because  each  reduction  decreases  the  number  of  messages  by  r  —  1 .  Hence,  it  there  isa 
total  of  k  reductions,  the  total  number  of  original  messages  must  be  r  +  k(r  —  1).  In  ca^e 
original  messages  do  not  satisfy  this  condition,  we  must  add  some  dummy  messages  wit  2ero 
probability  of  occurrence  until  this  condition  is  fulfilled.  For  example,  if  r  =  4  and  the  num  er 
of  messages  n  is  6,  then  we  must  add  one  dummy  message  with  zero  probability  of  occurrence 
to  make  the  total  number  of  messages  7,  that  is,  [4  +  1(4—  1 )],  and  proceed  as  usua  • 
procedure  is  illustrated  in  Example  14. 1 . 


Example  1 4. 1 


A  memoryless  source  emits  six  messages  with  probabilities  0.3, 0.25, 0. 1 5, 0. 1 2, 0. 1 ,  and 
Find  the  4-ary  (quaternary)  Huffman  code.  Determine  its  average  word  length,  the  e  cie 
and  the  redundancy. 


f 

In  this  case,  we  need  to  add  one  dummy  message  to  satisfy  the  required  condition  ^ 
r  +  k(r  —  1)  messages  and  proceed  as  usual.  The  Huffman  code  is  found  in  Tab  e 
The  length  L  of  this  code  is 

L  —  0.3(1)  +  0.25(1 )  +  0.15(1)  +  0.1 2(2)  +  0. 1  (2)  +  0.08(2)  +  0(2) 

=  1.3  4-ary  digits 
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Example  1 4.2 


TABLE  14.3 


Original  Source 


Messages 

Probabilities 

Code 

Reduced  Sources 

m  i 

0.30 

0 

0.30  0 

mj 

0.25 

2 

—  >  0.30  1 

m$ 

0.15 

3 

0.25  2 

W4 

0.12 

10 

0.15  3 

m$ 

0.10 

11 

mb 

0.08 

12 

mi 

0.00 

13 

Also, 


W4(m)  =  ~YhPi  loS4  Pi 
1=1 

=  1.209  4-ary  units 


The  code  efficiency  t?  is  given  by 


„  =  f^=0.93 

'  1.3 


The  redundancy  y  =  1  —  t]  =  0.07. 


To  achieve  code  efficiency  if  -  1,  we  need  N  -  oo.  The  Huffman  code  uses  N  =  I ,  but 
its  efficiency  is,  in  general,  less  than  1.  A  compromise  exists  between  these  two  extremes  of 
N  =  1  and  N  =  oo  We  can  encode  a  group  of  N  =  2  or  3  messages.  In  most  cases,  the  use 
of  N  =  2  or  3  can  yield  an  efficiency  close  to  1 ,  as  the  following  example  shows. 

A  memoryless  source  emits  messages  m,  and  m2  with  probabilities  0.8  and  0.2  respectively. 

Find  .he  optimum  (Huffman)  binary  code  for  (his  source  as  well  as  fonts  second-  and  (htrd. 
order  extensions  (i.e„  for  N  =  2  and  3).  Determine  the  code  efficiencies  in  each  ease. 

The  Huffman  code  for  the  source  is  simply  0  and  1.  giving  L=  1,  and 

H(m)  =  -(0.8  log  0.8  +  0.2  log  0.2) 

=  0.72  bit 


Hence, 


if  =  0.72 


.  .  nr(ler  extension  of  the  source  (N  =  2),  there  are  four  possible  composite 

mes^s  Zt',. «,«.  and  oteo^with  pmbahilities  0.64.  0.16.  0.16.  and  0.04. 

respectively.  The  Huffman  code  is  obtained  in  Table  . 


846 


INTRODUCTION  TO  INFORMATION  THEORY 


TABLE  14.4 


Original  Source 

Messages  Probabilities  Code 

Reduced  Source 

m\m\  0.64  0 

mi  m2  0.16  11 

mjm  1  0.16  100 

m.2m.2  0.04  101 

0.64  0 

- >  0.20  10" 

0.16  11 

0.64  0 

l - >  0.36  1 

TABLE  14.5 


Messages 

Probabilities 

Code 

mimjmi 

0.512 

0 

mi  mi  m2 

0.128 

100 

mim2m] 

0.128 

101 

m2mimi 

0.128 

110 

m\m2fn2 

0.032 

11100 

m2  w  1  m2 

0.032 

11101 

m2m2ffi\ 

0.032 

11110 

n\2m2m2 

0.008 

11111 

In  this  case  the  average  word  length  L'  is 

L'  =  0.64(  1 )  +  0. 1 6(2)  +  0. 1 6(3)  +  0.04(3) 

=  1.56 

This  is  the  word  length  for  two  messages  of  the  original  source.  Hence  L,  the  word  length 
per  message,  is 


and 


L=-=  0.78 
2 


0.72 

n=  —  =  0.923 
0.78 


If  we  proceed  with  N  =  3  (the  third-order  extension  of  the  source),  we  have  eight  possible 
messages,  and  following  the  Huffman  procedure,  we  find  the  code  as  shown  in  Table  14.5. 
The  word  length  L"  is 


L"  =  (0.5 1 2)  1  +  (0. 1 28  +  0. 1 28  +  0. 1 28)3 
+  (0.032  +  0.032  +  0.032)5  +  (0.008)5 
=  2.184 


Then, 


L" 

L=  —  =  0.728 
3 
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0.72 

0.728 


0.989 


14.3  ERROR-FREE  COMMUNICATION  OVER  A 
NOISY  CHANNEL 

As  seen  in  the  previous  section,  messages  of  a  source  with  entropy  Him)  can  be  encoded  by 
using  an  average  of  H( m)  digits  per  message.  This  encoding  has  zero  redundancy.  Hence, 
if  we  transmit  these  coded  messages  over  a  noisy  channel,  some  of  the  information  will  be 
received  erroneously.  There  is  absolutely  no  possibility  of  error-free  communication  over  a 
noisy  channel  when  messages  are  encoded  with  zero  redundancy.  The  use  of  redundancy,  in 
general  helps  combat  noise.  This  can  be  seen  from  a  simple  example  of  a  single  parity  check 
code  in  which  an  extra  binary  digit  is  added  to  each  codeword  to  ensure  that  the  total  number 
of  Is  in  the  resulting  codeword  is  always  even  (or  odd).  If  a  single  error  occurs  in  the  received 
codeword  the  parity  is  violated,  and  the  receiver  requests  retransmission.  This  is  a  rather 
simple  example  to  demonstrate  the  utility  of  redundancy.  More  complex  coding  procedures, 
which  can  correct  up  to  n  digits,  will  be  discussed  in  the  next  chapter. 

The  addition  of  an  extra  digit  increases  the  average  word  length  to  W(m)  +  I,  giving 
„  _  H(m)/[H(m)  +  1],  and  the  redundancy  is  1  -  t]  =  \/\H(m)  +  11.  Thus,  the  addition  ol 
an  extra  check  digit  increases  redundancy,  but  it  also  helps  combat  noise.  Immunity  against 
channel  noise  can  be  increased  by  increasing  the  redundancy.  Shannon  has  shown  that  it  is 
possible  to  achieve  error-free  communication  by  adding  sufficient  redundancy. 


Transmission  over  Binary  Symmetric  Channels 

We  consider  a  binary  symmetric  channel  (BSC)  with  an  error  probability  then  for  error- 
free  communication  over  this  channel,  messages  from  a  source  with  entropy  H( m)  must  be 
encoded  by  binary  codes  with  a  word  length  of  at  least  H  (m)/Cs,  where 


=  !-[/<,  log  1 


+  <!-/•,>  log  r^r] 


(14.14) 


,„rr  ,r  <  n  is  called  the  channel  capacity  (to  be  discussed  next  in  Sec.  14.4). 
Because  of  .'he  intentional  addition  of  redundancy  fo,  error  protection,  the  efficiency  of 
the ofcodes  it  always  below  C,  <  I .  If  a  certain  binary  channel  has  C,  =  0.4.  a  code  that 
Tan  Lhre'e  enor-free  communication  must  have  a.  leas.  2.5  Him)  btntuy  dtgtts  pe,  message, 
can  achieve  required  for  coding  without  redundancy.  This  means 

2?  “ : 2,  5  ’Z) ^Ltt  bits  per  message  bit.  Thus,  on  the  average,  for  every  2.5  dig,, s 
transmitted,  one  digit  is  the  information  dig,,  and  1.5  digits  are  redundant,  or  check,  dig  . 

giving  a  reduitdancy  of  I-  ,his  chapKr,  p„  the  error  probability  of  binary  signaling. 

■  M  d”SSl£  hence  to  make  P,  -  II.  either  S, -  oo  or  R„  -  0.  Because  S,  must  be 
varies  as  e  ■  Q  Bu,  sha„no„s  results  slate  that  it  is  really  not  necessary  to 

F'  error-free  communication  over  bandwidth  B.  All  that  is  required  is  to  hold  Kt 

c,  thechannel  capacity  per  second  (C  =  2  BCJ.  Where  is  the  dtscrepancy?  To  answer 
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Figure  14.1 

Three- 
dimensional 
cube  in 

Hamming  space. 


001 


Oil 


010 


this  question  let  us  investigate  carefully  the  role  of  redundancy  in  error-free  communication. 
Although  the  discussion  here  is  with  reference  to  a  binary  scheme,  it  is  quite  general  and  can 
be  extended  to  the  M -ary  case. 

Consider  a  simple  method  of  reducing  Pc  by  repeating  a  given  digit  an  odd  number  of 
times.  For  example,  we  can  transmit  0  and  1  as  000  and  111.  The  receiver  uses  the  majority 
rule  to  make  the  decision;  that  is,  if  at  least  two  out  of  three  digits  are  1,  the  decision  is  1,  and 
if  at  least  two  out  of  three  digits  are  0,  the  decision  is  0.  Thus,  if  fewer  than  two  of  the  three 
digits  are  in  error,  the  information  is  received  error-free.  Similarly,  to  correct  two  errors,  we 
need  five  repetitions.  In  any  case,  repetitions  cause  redundancy  but  improve  Pe  (Example  8.8). 

It  will  be  instructive  to  understand  the  situation  just  described  from  a  graphic  point  of 
view.  Consider  the  case  of  three  repetitions.  We  can  show  all  eight  possible  sequences  ot 
three  binary  digits  graphically  as  the  vertices  of  a  cube  (Fig.  14.1).  It  is  convenient  to  map 
binary  sequences  as  shown  in  Fig.  14.1  and  to  talk  in  terms  of  what  is  called  the  Hamming 
distance  between  binary  sequences.  If  two  binary  sequences  of  the  same  length  differ  iny  places 
(j  digits),  then  the  Hamming  distance  between  the  sequences  is  considered  to  bey.  Thus,  the 
Hamming  distance  between  000  and  010  (or  001  and  101)  is  1,  and  is  3  between  000  and  111 
In  the  case  of  three  repetitions,  we  transmit  binary  1  by  111  and  binary  0  by  000.  The  Hamming 
distance  between  these  two  sequences  is  3.  Observe  that  of  the  eight  possible  vertices,  we  are 
occupying  only  two  (000  and  111)  for  transmitted  messages.  At  the  receiver,  however,  because 
of  channel  noise,  we  are  liable  to  receive  any  one  of  the  eight  sequences.  The  majority  decision 
rule  can  be  interpreted  as  a  rule  that  decides  in  favor  of  the  message  (000  or  111)  that  is  at 
the  closest  Hamming  distance  to  the  received  sequence.  Sequences  000.  001.  010,  and  100  are 
within  1  unit  of  the  Hamming  distance  from  000  but  are  at  least  2  units  away  from  111  -  Hence, 
when  we  receive  any  one  of  these  four  sequences,  our  decision  is  binary  0.  Similarly,  when 
any  one  of  the  sequences  110.  Ill,  011,  or  101  is  received,  the  decision  is  binary  1. 

We  can  now  see  why  the  error  probability  is  reduced  in  this  scheme.  Of  the  eight  possi 
vertices,  we  have  used  only  two,  which  are  separated  by  3  Hamming  units.  If  we  draw  a 
Hamming  sphere  of  unit  radius  around  each  of  these  two  vertices  (000  and  111)*  the  two 
Hamming  spheres*  will  be  nonoverlapping.  The  channel  noise  can  cause  a  distance  between 
the  received  sequence  and  the  transmitted  sequence,  and  as  long  as  this  distance  is  equ* 
or  less  than  1  unit,  we  can  still  detect  the  message  without  error.  In  a  similar  way,  the  case 
of  five  repetitions  can  be  represented  by  a  hypercube  of  five  dimensions.  The  transnu 
sequences  00000  and  11111  occupy  two  vertices  separated  by  five  units,  and  the  Hamminr 


*  Note  that  the  Hamming  sphere  is  not  a  true  geometrical  hypersphere  because  the  Hamming 
geometrical  distance  (e.g.,  sequences  001.  010.  and  100  lie  on  a  Hamming  sphere  centered  at 
radius  2). 


distance  is  not  a  true 
111  and  having  a 
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spheres  of  2-unit  radius  drawn  around  each  of  these  two  vertices  would  be  nonoverlapping. 
In  this  case,  even  if  channel  noise  causes  two  errors,  we  can  still  detect  the  message  correctly. 
Hence,  the  reason  for  the  reduction  in  error  probability  is  that  we  have  not  used  all  the  available 
vertices  for  messages.  Had  we  occupied  all  the  available  vertices  for  messages  (as  is  the  case 
without  redundancy,  or  repetition),  then  if  channel  noise  caused  even  one  error,  the  received 
sequence  would  occupy  a  vertex  assigned  to  another  transmitted  sequence,  and  we  would 
inevitably  make  a  wrong  decision.  Precisely  because  we  have  left  the  neighboring  vertices 
of  the  transmitted  sequence  unoccupied,  are  we  able  to  detect  the  sequence  correctly,  despite 
channel  errors  within  a  certain  limit.  The  smaller  the  fraction  of  vertices  used,  the  smaller  the 
error  probability.  It  should  also  be  remembered  that  redundancy  (or  repetition)  is  what  makes 
it  possible  to  have  unoccupied  vertices. 


Repetition  Is  Inefficient 

If  we  continue  to  increase  /?,  the  number  of  repetitions,  we  will  reduce  Pe,  but  we  will  also 
reduce  Rh  by  the  factor  n.  But  no  matter  how  large  we  make  n,  the  error  probability  never 
becomes  zero.  The  trouble  with  this  scheme  is  that  it  is  inefficient  because  we  are  adding 
redundant  (or  check)  digits  to  each  information  digit.  To  give  an  analogy,  redundant  (or  check) 
digits  are  like  guards  protecting  the  information  digit.  To  hire  guards  for  each  information  digit 
is  somewhat  similar  to  a  case  of  families  living  on  a  certain  street  that  has  been  hit  by  several 
burglaries.  Each  family  panics  and  hires  a  guard.  This  is  obviously  expensive  and  inefficient. 
A  better  solution  would  be  for  all  the  families  on  the  street  to  hire  one  guard  and  share  the 
expense  One  guard  can  check  on  all  the  houses  on  the  street,  assuming  a  reasonably  short 
street.  If  the  street  is  too  long,  it  might  be  necessary  to  hire  a  team  of  guards.  But  it  is  certainly 
not  necessary  to  hire  one  guard  per  house.  In  using  repetitions,  we  had  a  similar  situation. 
Redundant  (or  repeated)  digits  were  used  to  help  (or  check  on)  only  one  message  digit.  Using 
the  clue  from  the  preceding  analogy,  it  might  be  more  efficient  if  we  used  redundant  digits 
not  to  check  (guard)  any  one  individual  transmitted  digit  but,  rather,  a  block  of  digits.  Herein 
lies  the  key  to  our  problem.  Let  us  consider  a  group  of  information  digits  over  a  certain  time 
interval  of  T  seconds,  and  let  us  add  some  redundant  digits  to  check  on  all  these  digits. 

Suppose  we  need  to  transmit  a  binary  information  digits  per  second.  Then  over  a  period 
of  T  seconds  we  have  a  block  of  aT  binary  information  digits.  If  to  this  block  of  information 
digits  we  add  (6  -  a)T  check  digits  (i.e.,  0-a  check  digits,  or  redundant  digits,  per  second), 
then  we  need  to  transmit  >  a)  digits  for  every  aT  information  digits.  Therefore  over  a 

T-second  interval,  we  have 


aT  =  information  digits 
pT  =  total  transmitted  digits  {ft  >  a) 
(/)  _  a)T  =  check  digits 


(14.15) 


Thus  instead  of  transmitting  one  binary  digit  every  I  /O  second  we  let  «T  digits  accumulate 
over  V  seconds  Now  consider  this  as  a  message  to  be  transmuted.  There  are  a  total  of 
over  /  second  j  seconds  we  need  to  transmit  one  of  the  2"7  possible 

such  supermess  ge  ^  are  transmitted  by  a  sequence  of  pT  binary  digits.  There 

supermessag  cgs  q{ ^  binary  digits,  and  they  can  be  represented  as  vertices  of 

are  in  all  _  P  .  Recause  we  have  only  2aT  messages  to  be  transmitted,  whereas 

a ^-dimensional  "Bemuse we^ha ^  ^  ^  ^  ^  rf  the  ^ 

2?  vertices  are  aval  a  •  ^  ,hat  we  have  reduced  the  transmission  rate  by  a  factor  of 

dimensional  hypercu .  is  independent  of  T.  The  fraction  of  the  vertices  occupied 

1 — -  “"d  - bc  - sma" as 
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simply  by  increasing  T.  In  the  limit  as  T  ->  oo.  the  occupancy  factor  approaches  0.  This  will 
make  the  error  probability  go  to  0,  and  we  have  the  possibility  of  error-free  communication. 

One  important  question,  however,  still  remains  unanswered.  What  must  be  the  rate  reduc¬ 
tion  ratio  a/P  for  this  dream  to  come  true?  To  answer  this  question,  we  observe  that  increasing 
T  increases  the  length  of  the  transmitted  sequence  ( pT  digits).  If  Pe  is  the  digit  error  proba¬ 
bility,  then  it  can  be  seen  from  the  relative  frequency  definition  (or  the  law  of  large  numbers) 
that  as  T  -*■  oo,  the  total  number  of  digits  in  error  in  a  sequence  of  PT  digits  ( PT  -*■  oo)  is 
exactly  PTPe.  Hence,  the  received  sequences  will  be  at  a  Hamming  distance  of  pTPe  from  the 
transmitted  sequences.  Therefore,  for  error-free  communication,  we  must  leave  all  the  vertices 
unoccupied  within  spheres  of  radius  pTPe  drawn  around  each  of  the  2al  occupied  vertices. 
In  short,  we  must  be  able  to  pack  laT  nonoverlapping  spheres,  each  of  radius  pTPe ,  into 
the  Hamming  space  of  dimensions  pT.  This  means  that  for  a  given  p,  a  cannot  be  increased 
beyond  some  limit  without  causing  overlap  in  the  spheres  and  the  consequent  failure  of  the 
error  correction  scheme.  Shannon's  theorem  states  that  tor  this  scheme  to  work.  a/P  must  be 
less  than  the  constant  (channel  capacity)  Cs,  which  physically  is  a  function  of  the  channel  noise 
and  the  signal  power: 


-  <  Cs  (14-16) 

P 

It  must  be  remembered  that  such  perfect,  error-free  communication  is  not  practical.  In 
this  system  we  accumulate  the  information  digits  for  T  seconds  before  encoding  them,  and 
because  T  — ►  oo,  for  error-free  communication  we  would  have  to  wait  until  eternity  to  start 
encoding.  Hence,  there  will  be  an  infinite  delay  at  the  transmitter  and  an  additional  delay  ot 
the  same  amount  at  the  receiver.  Second,  the  equipment  needed  for  the  storage,  encoding, 
and  decoding  sequence  of  infinite  digits  would  be  monstrous.  Needless  to  say,  the  dream  ot 
error-free  communication  cannot  be  achieved  in  practice.  Then  what  is  the  use  of  Shannon  s 
result?  For  one  thing,  it  indicates  the  upper  limit  on  the  rate  of  error-free  communication  that 
can  be  achieved  on  a  channel.  This  in  itself  is  monumental.  Second,  it  indicates  that  we  can 
reduce  the  error  probability  below  an  arbitrarily  small  level  by  allowing  only  a  small  reduction 
in  the  rate  of  transmission  of  information  digits.  We  can  therefore  seek  a  compromise  between 
error-free  communication  with  infinite  delay  and  virtually  error-free  communication  with  a 
finite  delay. 


14.4  CHANNEL  CAPACITY  OF  A  DISCRETE 
MEMORYLESS  CHANNEL 

This  section  treats  discrete  memoryless  channels.  Consider  a  source  that  generates  a  message 
that  contains  r  symbols  x\,  x%,  . . . ,  xr.  The  receiver  receives  symbols  yj,  yj,  •  •  •  >  Ts-  The  set 
of  symbols  { vjt }  may  or  may  not  be  identical  to  the  set  fx* ),  depending  on  the  nature  of  the 
receiver.  If  we  use  receivers  of  the  types  discussed  in  Chapter  1 1,  the  set  of  received  symbols 
will  be  the  same  as  the  set  transmitted.  This  is  because  the  optimum  receiver,  upon  receiving  a 
signal,  decides  which  of  the  r  symbols  x\ ,  xj,  ....  x,  has  been  transmitted.  Here  we  shall  be 
more  general  and  shall  not  constrain  the  set  {y* )  to  be  identical  to  the  set  {x* ). 

If  the  channel  is  noiseless,  then  the  reception  of  some  symbol  Yj  uniquely  determines 
the  message  transmitted.  Because  of  noise,  however,  there  is  a  certain  amount  of  uncertainty 
regarding  the  transmitted  symbol  when  yj  is  received.  If  P(x,|y/)  represents  the  conditiona 
probabilities  that  x,  was  transmitted  when  y/  is  received,  then  there  is  an  uncertainty  o 
log  [1/P(x,|y,)]  about  x,  when  yj  is  received.  When  this  uncertainty  is  averaged  over  a 
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Xi  and  yjy  we  obtain  H(x |y),  which  is  the  average  uncertainty  about  the  transmitted  symbol  x 
when  a  symbol  y  is  received.  Thus, 


h uiy)  =  E  E p{Xi • »)  '°g  bits  i*1-  symbo1 


(14.17) 


For  noiseless  channels,  the  uncertainty  would  be  zero.*  Obviously,  this  uncertainty,  H(\ |y), 
is  caused  by  channel  noise.  Hence,  it  is  the  average  loss  of  information  about  a  transmitted 
symbol  when  a  symbol  is  received.  We  call  tf(x|y)  the  conditional  entropy  of  x  given  y  (i.e., 

the  amount  of  uncertainty  about  x  once  y  is  known). 

Note  that  PiyjM  represents  the  a  priori  probability  that  yj  is  received  when  x{  is  trans¬ 
mitted.  This  is  a  characteristic  of  the  channel  and  the  receiver.  Thus,  a  given  channel  (with  its 
receiver)  is  specified  by  the  channel  matrix. 


Inputs 


Outputs 

y' 

y2 

ys 

X\ 

X2 

Ffyiki) 
P(y\  \X2) 

P(yi\x\)  •• 
P(y2\x2)  •• 

•  P(ys\x\) 
■  P(ys\x2) 

Xr 

P(y\ W 

P(yi\xr)  •  ■ 

•  P(y's\Xr) 

We  can  use  Bayes’  rule  to  obtain  the  a  posteriori  (or  reverse)  conditional  probabilities  P(xi\yj): 


=  PW 

_  P(yj\x,)P(xi) 
P(yj\xi)P(xi) 


ZlP(xl)P(yj\xi) 


(14. 1 8a) 
(14.18b) 
(14.18c) 


Thus  if  the  input  symbol  probabilities  P(Xi)  and  the  channel  matrix  are  known  the  a  posteriori 
Thus,  if  the  input  symp  uted  from  Eqs.  (14.18).  The  a  postenon  conditional 

conditional  pro  _  hahilitv  that  x  was  transmitted  when  yj  is  received, 

probability  P(x,|y>)  is  ep  average* amount  of  information  received  would  be  W(x) 

For  a  nolsftfE,^r  received  symbol.  Note  that  ff(x)  is  the  average  information 
bits  (entropy  of  the  s  )  P  B^ause  of  channel  noise,  even  when  receiving  y  we 

transmitted  over  in  lhe  average  amount  of  W(x|y)  bits  of  information  per 

Stm  bTTrefore  in  this  transaction  of  receiving  y,  the  amount  of  information  the  receiver 

receives  te!  on  ^  average.  ««»>  P"  "«i«d  Wh‘re 

/(x;y)  =  //(x)-//(x|y)  bits  per  symbol  (14.19) 

- ^  f  r  o  nnivu»less  channel  all  the  probabilities  in  Eq.  (14.17)  are  either  0  or  1 

*  This  can  be  verified  o°and  if  P(Xi\yj)  =  0,  then  P(xhyj)  =  PiyjWxty)  =  °  ™s  shows  tha‘ 

If  P(Xj\yj)  =  K  then  logll 
H(x  |y)=0. 
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/(x;  y)  is  called  the  mutual  information  of  x  and  y.  Because 


//(x)  =  £>(*,)  log 

I 


1 

m) 


bits 


we  have 


i (x; y)  =  X p(Xi) lo§ ^  “El] **<•*»») ,0s 


P(xi\yj) 


Also  because 


P(xi,yj)  =  />(*,) 
j 


We  have 


/(x;y) 


X  X! '°g 

<  j 


1 


X  X  Pu'’»)  los 
•  ; 


1 

miv,) 


X  X  ^yy)  ,og 

«  j 


nxj\yj) 

PM 


X  X  ^yy)  log 

«  j 


P(xj,yj) 

P(xi)P(yj) 


(14.20a) 

(14.20b) 


Alternatively,  by  using  Bayes’  rule  in  Eq.  (14.20a),  we  can  express  /(x;  y)  as 

;<x;y>  =  ££p(*,,;>l„g^>  ,14.20c) 

i  j  Cy/ 

or  we  may  substitute  Eq.  (14.18c)  into  Eq.  (14.20a): 


7  (x;  y)  =  X  X  p(*/)p(yy  ta)  |o§ 

i  j 


P(yj\xi ) 

Ei  p(*<)p(yyl*/) 


(14.20d) 


Equation  ( 14.20d)  expresses  / (x;  y)  in  terms  of  the  input  symbol  probabilities  and  the  channel 
matrix. 

The  units  of  /(x;  y)  should  be  carefully  noted.  Since  /(x;  y)  is  the  average  amount  of 
information  received  per  symbol  transmitted,  its  units  are  bits  per  symbol.  If  we  use  binary 
digits  at  the  input,  then  the  symbol  is  a  binary  digit,  and  the  units  of  /(x;  y)  are  bits  per  binary 
digit. 

Because  /(x;  y)  in  Eq.  ( 14.20b)  is  symmetrical  with  respect  to  x  and  y,  it  follows  that 

(14.21a) 
(14.21b) 


/(x;y)  =  /(y;x) 

=  //(y)-//(y|x) 
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The  quantity  H( y|x)  is  the  conditional  entropy  of  y  given  x  and  is  the  average  uncertainty 
about  the  received  symbol  when  the  transmitted  symbol  is  known.  Equation  ( 14.21b)  can  be 
rewritten  as 


H(\)  -  H(\ |y)  =  H( y)  -  H( y|x)  (14.21c) 

From  Eq.  ( 14.20d)  it  is  clear  that  /(x;  y)  is  a  function  of  the  transmitted  symbol  probabilities 
P(Xi)  and  the  channel  matrix.  For  a  given  channel,  /(x;y)  will  be  maximum  tor  some  set  of 
probabilities  P(jcz).  This  maximum  value  is  the  channel  capacity  Cv, 

Cs  =  max  /(x;  y)  bits  per  symbol  (14.22) 

P(Xi) 

Thus,  because  we  have  allowed  the  channel  input  to  choose  any  symbol  probabilities  P(*z), 
Cs  represents  the  maximum  information  that  can  be  transmitted  by  one  symbol  over  the 
channel.  These  ideas  will  become  clear  from  the  following  example  of  a  binary  symmetric 
channel  (BSC). 


Example  1  4.3  Find  the  channel  capacity  of  the  BSC  shown  in  Fig.  14.2. 


Figure  14.2 

Binary  symmetric 
channel. 


Let  P(jct)  =  a  and  P(xi )  =  a  =  (1  -  a).  Also, 

P(yi\x2)  =  P(y2 1*1 )  =  Pe 
PCyiljq)  =  Piyito)  =  Pe  =  \  -  Pe 

Substitution  of  these  probabilities  into  Eq.  (14.20d)  gives 

/(x;y)  =  aP, log  (  gr^fr  )  +  "P" log  («/>,  + 5 P, ) 

+  “p' log  (3^ r. :)  +  “pr  108  {jtfT+ap' ) 

=  (0P,  +  oP,)log(s-^?-) 

+  ,o.P,  +  «P,)log(--^) 

_(p,logi  +  fMogJ-) 
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Figure  14.3 

Plot  of  p(z). 


If  we  define 


p(z)  =  z  log  -  +  z  log  3 
z  z 


with  z  =  \  —  z,  then 


J(x;y)  =  p(otPe+aPe)  -  p(Pe) 


The  function  p(z)  vs.  z  is  shown  in  Fig.  14.3  It  can  be  seen  that  p(z)  is 
Z  =  5.  (Note  that  we  are  interested  in  the  region  0  <  z  <  1  only.)  For  a  gi\ 
is  fixed.  Hence  from  Eq.  (14.23)  it  follows  that  /(x;  y)  is  maximum  when  p 
is  maximum.  This  occurs  when 


oePe  +  aPe  =  0.5 


or 


a/^  +  0  -<*)(1  -Pe)  =  0.5 
This  equation  is  satisfied  when 


a  =  0.5 


14.4  Channel  Capacity  of  a  Discrete  Memoryless  Channel  855 


(14.23) 


p(z)  is  maximum  at 
For  a  given  Pe,  p(P <■) 
i  when  p(aPe  +  otP e) 


(14.24) 


(14.25) 


Figure  14.4 

Binary  symmetric 
channel  capacity 
as  a  function  of 
error 

probability  Pe- 


From  Fig.  14.4,  which  shows  Cs  vs.  Pe>  it  follows  that  the  maximum  value  of  C.v  is 
unity.  This  means  we  can  transmit  at  most  I  bit  of  information  per  binary  digit.  This  is  the 
expected  result,  because  one  binary  digit  can  convey  one  of  the  two  equiprobable  messages. 
The  information  content  of  one  of  the  two  equiprobable  messages  is  log2  2  =  1  bit.  Second, 
we  observe  that  Cs  is  maximum  when  the  error  probability  Pe  =  0  or  Pe  =  1 .  When  the 
error  probability  Pe  =  0,  the  channel  is  noiseless,  and  we  expect  C,  to  be  maximum.  But 
surprisingly,  Cs  is  also  maximum  when  Pe  =  1 .  This  is  easy  to  explain,  because  a  channel 
that  consistently  and  with  certainty  makes  errors  is  as  good  as  a  noiseless  channel.  All  we 
have  to  do  to  have  error-free  reception  is  reverse  the  decision  that  is  made;  that  is,  if  0  is 
received,  we  decide  that  1  was  actually  sent,  and  vice  versa.  The  channel  capacity  Cs  is 
zero  (minimum)  when  Pe  =  If  the  error  probability  is  then  the  transmitted  symbols 
and  the  received  symbols  are  statistically  independent.  If  we  received  0,  for  example, 
either  1  or  0  is  equally  likely  to  have  been  transmitted,  and  the  information  received 

is  zero. 


Channel  Capacity  per  Second 

The  channel  capacity  C5  in  Eq.  (14.22)  gives  the  maximum  possible  information  transmitted 
when  one  symbol  (digit)  is  transmitted.  If  K  symbols  are  being  transmitted  per  second,  then  the 
maximum  rate  of  transmission  of  information  per  second  is  KCS.  This  is  the  channel  capacity 
in  information  units  per  seconds  and  will  be  denoted  by  C  (in  bits  per  second): 

C  =  KCS 


A  Comment  on  Channel  Capacity:  Channel  capacity  is  the  property  of  a  partic¬ 
ular  nhvsical  channel  over  which  the  information  is  transmitted.  This  is  true  provided 
the  term  channel  is  correctly  interpreted.  A  channel  means  not  only  the  transmission 
medium  it  also  includes  the  specifications  of  the  kind  of  signals  (binary,  /-ary,  etc.,  or 
orthogonal,  simplex,  etc.)  and  the  kind  of  receiver  used  (the  receiver  determines  the  error 
probability)  All  these  specifications  are  included  in  the  channel  matrix.  A  channel  matrix 
completely  specifies  a  channel.  If  we  decide  to  use.  for  example.  4-ary  digits  instead  of 
u-2v  digits  over  the  same  physical  channel,  the  channel  matrix  changes  (it  becomes  a 
4x4  matrix)  as  does  the  channel  capacity.  Similarly,  a  change  in  the  receiver  or  the 
signal  power  or  noise  power  will  change  the  channel  matrix  and.  hence,  the  channel 

capacity. 
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Measuring  Channel  Capacity 

The  channel  capacity  Cs  is  the  maximum  value  of  H(\)  -  H(\ |y);  naturally,  Cs  <  max//(x) 
[because  //(x|y)  >  0].  But  H(\)  is  the  average  information  per  input  symbol.  Hence,  Cs 
is  always  less  than  (or  equal  to)  the  maximum  average  information  per  input  symbol.  If 
we  use  binary  symbols  at  the  input,  the  maximum  value  of  H(\)  is  1  bit,  occurring  when 
P(x\)  =  P(x 2)  =  2-  Hence,  for  a  binary  channel,  Cs  <  1  bit  per  binary  digit.  If  we  use 
r-ary  symbols,  the  maximum  value  of  //r(x)  is  1  r-ary  unit.  Hence,  Cs  <  1  r- ary  unit  per 
symbol. 


Verification  of  Error-Free  Communication  over  a  BSC 

We  have  shown  that  over  a  noisy  channel,  Cs  bits  of  information  can  be  transmitted  per  symbol. 
If  we  consider  a  binary  channel,  this  means  that  for  each  binary  digit  (symbol)  transmitted, 
the  received  information  is  Cs  bits  (Cs  <  1).  Thus,  to  transmit  1  bit  of  information,  we 
need  to  transmit  at  least  \/Cs  binary  digits.  This  gives  a  code  efficiency  Cs  and  redundancy 
1  —  Cv.  Here,  the  transmission  of  information  means  error-free  transmission,  since  7(x;  y) 
was  defined  as  the  transmitted  information  minus  the  loss  of  information  caused  by  channel 
noise. 

The  problem  with  this  derivation  is  that  it  is  based  on  a  certain  speculative  definition  of 
information  [Eq.  (14.1)].  And  based  on  this  definition,  we  defined  the  information  lost  during 
the  transmission  over  the  channel.  We  really  have  no  direct  proof  that  the  information  lost  over 
the  channel  will  oblige  us  in  this  way.  Hence,  the  only  way  to  ensure  that  this  whole  speculative 
structure  is  sound  is  to  verify  it.  If  we  can  show  that  Cs  bits  of  error-free  information  can  be 
transmitted  per  symbol  over  a  channel,  the  verification  will  be  complete.  A  general  case  will 
be  discussed  later.  Here  we  shall  verify  the  results  for  a  BSC. 

Let  us  consider  a  binary  source  emitting  messages  at  a  rate  of  a  digits  per  second.  We  accu¬ 
mulate  these  information  digits  over  T  seconds  to  give  a  total  of  aT  digits.  Because  aT  digits 
form  2aT  possible  combinations,  our  problem  is  now  to  transmit  one  of  these  2uT  supermes¬ 
sages  every  T  seconds.  These  supermessages  are  transmitted  by  a  code  of  word  length  pT  digits, 
with  p  >  a  to  ensure  redundancy.  Because  fiT  digits  can  form  2^J  distinct  patterns  (vertices  of 
a  pT -dimensional  hypercube),  and  we  have  only  2aT  messages,  we  are  utilizing  only  a  2-(^  a) 
fraction  of  the  vertices.  The  remaining  vertices  are  deliberately  unused,  to  combat  noise.  If  we 
let  T  — ►  00,  the  fraction  of  vertices  used  approaches  0.  Because  there  are  pT  digits  in  each 
transmitted  sequence,  the  number  of  digits  received  in  error  will  be  exactly  PTPe  when  T  00. 

We  now  construct  Hamming  spheres  of  radius  pTPe  each  around  the  2aT  vertices  used  for  the 


messages.  When  any  message  is  transmitted,  the  received  message  will  be  in  the  Hamming 
sphere  surrounding  the  vertex  corresponding  to  that  message.  We  use  the  following  decision 


rule:  If  a  received  sequence  falls  inside  or  on  a  sphere  surrounding  message  m,,  then  the  decision 
is  “w/  is  transmitted.”  If  T  00,  the  decision  will  be  without  error  if  all  the  2aT  spheres  are 
nonoverlapping. 

Of  all  the  possible  sequences  of  PT  digits,  the  number  of  sequences  that  differ  from  a 


given  sequence  by  exactly  j  digits  is  (see  Example  8.6).  Hence,  K ,  the  total  number  of 


sequences  that  differ  from  a  given  sequence  by  less  than  or  equal  to  pTPe  digits,  is 


(14.26) 
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Here  we  use  an  inequality  often  used  in  information  theory:4, 7 


£  (PT)  <  2<)TpiP')  Pe  <  0.5 

y=0  '  J  ' 


Hence, 


K  <  2pTp{P,,) 


(14.27) 


with  the  definition  that 


1  1 

p(Pe)  =  Pe  log  —  +  ( 1  -  Pe)  log 


From  the  2^T  possible  vertices  we  choose  2al  vertices  to  be  assigned  to  the  supermessages. 
How  shall  we  select  these  vertices?  From  the  decision  procedure  it  is  clear  that  if  we  assign 
a  particular  vertex  to  a  supermessage,  then  none  of  the  other  vertices  lying  within  a  sphere 
of  radius  pTPe  can  be  assigned  to  another  supermessage.  Thus,  when  we  choose  a  vertex  tor 
m\  the  corresponding  K  vertices  [Eq.  ( 14.26)]  become  ineligible  for  consideration.  We  must 
choose  from  the  remaining  2^  -  K  vertices,  another  vertex  for  m2.  We  proceed  in  this  way 
until  all  the  2^  vertices  have  been  exhausted.  This  is  a  rather  tedious  procedure.  Let  us  see 
what  happens  if  we  choose  the  required  2af  vertices  randomly  from  the  2  vertices.  In  this 
procedure  there  is  the  danger  of  selecting  more  than  one  vertex  lying  within  a  distance  pTPe.  If, 
however,  a/p  is  sufficiently  small,  the  probability  of  making  such  a  choice  is  extremely  small 
as  t  -»•  oo.  The  probability  of  choosing  any  particular  vertex  s\  as  one  of  the  _  vertices 

from  2^  vertices  is  2“r/2^  =  2-^~a)T. 

Remembering  that  K  vertices  lie  within  a  distance  of  PTP„  digits  from  s , ,  the  probability 
that  we  may  also  choose  another  vertex  s2  that  is  within  the  distance  pTPe  from  each  of  these 
K  vertices  (that  form  the  Hamming  sphere  around  s, )  is 


From  Eq.  (14.27)  it  follows  that 

P  <  2-imi-p(^)i-«i7' 

Hence,  as  T  —*■  oo,  P  0  if 

0[  1  -  p(Pe)]  >  « 


that  is,  if 


-  <  1  -  p(Pe) 

P 


(14.28a) 


But  1 


-  piP.)  is  C„  the  Channel  capacity  of  a  BSC  |Eq.  (14.25)].  Therefore. 


a  r 

r  * 


( 1 4.28b) 


U  hnhilitv  of  choosing  two  sequences  randomly  within  a  distance  pTPe  approaches 

Hence,  tePi°bparSe°d  a/p<  C»  and  we  have  error-free  communication.  We  can  choose 

=  cs  -  where  e  is  arbitrarily  small. 
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1 4.5  CHANNEL  CAPACITY  OF  A  CONTINUOUS 
MEMORYLESS  CHANNEL 


For  a  discrete  random  variable  x  taking  on  values  x\,  xi . x„  with  probabilities  P(x i), 

P(x 2), _ P(x„),  the  entropy  H(x)  was  defined  as 


rt 


H(x)  =  -  ^2  P(Xi)  log  P(Xj) 


(14.29) 


For  analog  data,  we  have  to  deal  with  continuous  random  variables.  Therefore,  we  must  extend 
the  definition  of  entropy  to  continuous  random  variables.  One  is  tempted  to  state  that  H (x)  for 
continuous  random  variables  is  obtained  by  using  the  integral  instead  of  discrete  summation 
in  Eq.  (14.29)*: 


1 


(14.30) 


H(x)=  /  p(x)  log  — -dx 

J- 00  Pto 


We  shall  see  that  Eq.  (14.30)  is  indeed  the  meaningful  definition  of  entropy  for  a  continu¬ 
ous  random  variable.  We  cannot  accept  this  definition,  however,  unless  we  show  that  it  has 
the  meaningful  interpretation  as  uncertainty.  A  random  variable  x  takes  a  value  in  the  range 
(nAx,  ( n  +  1 )  Ax)  with  probability  p(nAx)  Ax  in  the  limit  as  Ax  — *■  0.  The  error  in  the  approx¬ 
imation  will  vanish  in  the  limit  as  Ax  — *■  0.  Hence  H(x),  the  entropy  of  a  continuous  random 
variable  x,  is  given  by 


(14.31) 


In  the  limit  as  Ax  —*■  0,  log  Ax  — >  —00.  It  therefore  appears  that  the  entropy  of  a  continuous 
random  variable  is  infinite.  This  is  quite  true.  The  magnitude  of  uncertainty  associated  with 
a  continuous  random  variable  is  infinite.  This  fact  is  also  apparent  intuitively.  A  continuous 


random  variable  assumes  an  uncountable  infinite  number  of  values,  and,  hence,  the  uncertainty 
is  on  the  order  of  infinity.  Does  this  mean  that  there  is  no  meaningful  definition  of  entropy  tor 


is  on  the  order  of  infinity.  Does  this  mean  that  there  is  no  meaningful  definition  of  entropy  tor 
a  continuous  random  variable?  On  the  contrary,  we  shall  see  that  the  first  term  in  Eq.  (14.31 ) 
serves  as  a  meaningful  measure  of  the  entropy  (average  information)  of  a  continuous  random 


variable  x.  This  may  be  argued  as  follows.  We  can  consider  J'  p(x)  log  [  1  /p(x)]  dx  as  a  relative 
entropy  with  -  log  Ax  serving  as  a  datum,  or  reference.  The  information  transmitted  over  a 
channel  is  actually  the  difference  between  the  two  terms  H(x)  and  //(x|y).  Obviously,  if  we 
have  acommon  datum  for  both  H(x)  and  H(x |y),  the  difference  H(x) -H(x\y)  will  be  the  same 


*  Throughout  this  discussion,  the  PDF  px(x)  will  be  abbreviated  as  p(x):  this  practice  causes  no  ambiguity  and 
improves  the  clarity  of  the  equations. 
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as  the  difference  between  their  relative  entropies.  We  are  therefore  justified  in  considering  the 
first  term  in  Eq.  (14.31)  as  the  differential  entropy  of  x.  We  must,  however,  always  remember 
that  this  is  a  relative  entropy  and  not  the  absolute  entropy.  Failure  to  realize  this  subtle  point 
generates  many  apparent  fallacies,  one  of  which  will  be  given  in  Example  14.4. 

Based  on  this  argument,  we  define  //(x),  the  differential  entropy  of  a  continuous  random 
variable  x,  as 


r°°  l 

H(\)=  /  p(x)  log  —  dx  bits 


(14.32a) 


- dx  bits 

p(x) 


00 


(14.32b) 


p(x)  log  p(x)  dx  bits 


-00 


Although  H(x)  is  the  differential  (relative)  entropy  of  x,  we  shall  call  it  the  entropy  of  random 
variable  x  for  brevity. 


Example  14.4  a  signal  amplitude  x  is  a  random  variable  uniformly  distributed  in  the  range  (-1  1).  This  signal 

_  .  .  t-n  _ o  rru«  ^..trvnt  \i  tc  alcr*  ck  rtnrlnm  VJiriimlP.  iinitormlV 


is  passed  through  an  amplifier  of  gain  2.  The  output  y  is  also  a  random  variable,  uniformly 
distributed  in  the  range  (-2, 2).  Determine  the  (differential)  entropies  H(x)  and  H( y). 


We  have 


2 


W  <  1 


P(x)  = 


0 


otherwise 


4 


\y\  <2 


P(y)  = 


o 


otherwise 


Hence, 
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(in  the  limit  as  A.v,  Ay  -*■  0), 


R  |  =  lim  —  log  A.v 


/?2  =  lim  —  log  Ay 


A.y->-0 


and 


R\  —  R~>  =  lim 

A.r.A.v->-0 


=  log 


(S) 


=  log  2  =  1  bit 


Thus,  R i ,  the  reference  entropy  of  x,  is  higher  than  the  reference  entropy  R2  for  y.  Hence, 
if  x  and  y  have  equal  absolute  entropies,  their  differential  (relative)  entropies  must  differ 
by  1  bit. 


Maximum  Entropy  for  a  Given  Mean  Square  Value  of  x 

For  discrete  random  variables,  we  observed  that  entropy  was  maximum  when  all  the  outcomes 
(messages)  were  equally  likely  (uniform  probability  distribution).  For  continuous  random 
variables,  there  also  exists  a  PDF  p(x )  that  maximizes  H(\)  in  Eqs.  (14.32).  In  the  case  of  a 
continuous  distribution,  however,  we  may  have  additional  constraints  on  x.  Either  the  maximum 
value  of  x  or  the  mean  square  value  of  x  may  be  given.  We  shall  find  here  the  PDF  p(x)  that 
will  yield  maximum  entropy  when  x2  is  given  to  be  a  constant  a2.  The  problem,  then,  is  to 
maximize  H{\ ): 


H(\)=  /  p(x)  log  dx 
J- 00  P(x) 


(14.33) 


with  the  constraints 


00 


p(x)  dx  =  1 


(14.34a) 


J  — OO 
'00 


(14.34b) 


To  solve  this  problem,  we  use  a  theorem  from  the  calculus  of  variation.  Given  the  integral  /, 


b 


I  = 


(14.35) 
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subject  to  the  following  constraints: 


<p\(x,p)dx  =  X\ 


<P2(x,p)dx  =  A.  2 


(pk(x,p)dx  =  Xk 


(14.36) 


where  Xj,  k2,  . .  • ,  X*  are  given  constants.  The  result  from  the  calculus  of  variation  states  that 
the  form  of  p(x)  that  maximizes  /  in  Eq.  (14.35)  with  the  constraints  in  Eq.  ( 14.36)  is  found 
from  the  solution  of  the  equation 


3 F  d<p\  d<p2  . 

—  +0f!—  +«2—  + 
dp  dp  dp 


+  a*  —  =  0 
dp 


(14.37) 


The  quantities  a,.  . . «*  are  adjustable  constants,  called  undetermined  multipliers. 

which  can  be  found  by  substituting  the  solution  of  p(x)  [obtained  from  Eq.  (14.37)]  in 
Eq.  (14.36).  In  the  present  case, 

1 

F(p,x)  —  p  log  ~ 


<p\(x,p)=p 
<p2(X,p)  =  x2p 


Hence,  the  solution  for  p  is  given  by 


d_ 

dp 


(plo4) 


d  2  n 
+  ai  +a2— x  p  =  0 
dp 


or 

-(1  +  log  p)  +  ai  +a2x2  =  0 


Solving  for  p ,  we  have 

p  =  *<«<-' 

Substituting  Eq.  ( 14.38)  into  Eq.  (14.34a).  we  ha.e 

1  =  f°°  ea'-'ea2x2dx 

J-OQ 

r°°  2 

=  2eofl'~1  /  **2X  dx 

-4m 


(14.38) 
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provided  a2  is  negative,  or 


Next  we  substitute  Eqs.  (14.38)  and  (14.39)  into  Eq.  (14.34b): 


1 

2c*2 


or 


and 


1 


a2  =  - 


2  a2 


e 


ori-l 


Substituting  Eqs.  (14.40)  into  Eq.  (14.38),  we  have 


P(x)  = 


1  e-*>/2o> 

(jy/2jr 


We  therefore  conclude  that  for  a  given  mean  square  value,  the  maximum  entropy  (or 
uncertainty)  is  obtained  when  the  distribution  of  x  is  Gaussian.  This  maximum  e 
uncertainty,  is  given  by 


Note  that 


Hence, 


H(\)=  f  p(x)  log2  -3—  dx 

J- 00  PW 


log  =  log  (•v/2jror2eJc2/2'T‘) 


1  x2 

=  -  log  (lira2)  +  ^2  l°ge 


H(x) 


=  J  p(x)  ^  log  (2 no2)  +  ^  log  dx 

1  ~  log  e  r°°  o 

=  -  log  (2na~)  J  p(x)  dx  +  J  x2p(x)  dx 


(14.39) 


(14.40a) 


(14.40b) 


(14.41) 


um  entropy  (or  maximum 
lis  maximum  entropy,  or 


(14.42a) 


2p( x)  dx 
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1  ,  log  e  j 

=  -  log  (27TOT2)  +  ° 

=  ^  log  (2rtea2)  ( 14.42b) 

=  ^  log  (17.1cr2)  (14.42c) 

To  reiterate,  for  a  given  mean  square  value  x2,  the  entropy  is  maximum  lor  a  Gaussian 
distribution,  and  the  corresponding  entropy  is  |  log  (Inea-). 

The  reader  can  similarly  show  (Prob.  14.5-4)  that  if  x  is  constrained  to  some  peak  value 
M  (_A/  <  x  <  M),  then  the  entropy  is  maximum  when  x  is  uniformly  distributed: 

— M  <  x  <  M 

otherwise 

Entropy  of  a  Band-Limited  White  Gaussian  Noise 

Consider  a  band-limited  white  Gaussian  noise  n(r)  with  power  spectral  density  (PSD)  A/72. 
Because 


p(x)  = 


1 

2M 

0 


Rn(r)  =  A fB  sine  (IttBt) 

we  know  that  sine  (2nBr)  is  zero  at  r  =  ±k/2B  (k  integer).  Therefore, 

Rn{h)=°  k  =  ±U±2'± 3’--- 


Hence, 


''"(4)  =  n(,>n('+^)_0  ‘_±l'±2"" 

/x  A  J.  (k  -  ±1  ±2,  . . .)  are  Nyquist  samples  of  n(f),  it  follows  that  all 
^afuLrelated.  Because  n,„  is  Gaussian.  uncormlatedness  imp.ies 
dependence.  Hence,  all  Nyquist  samples  of  n(l>  are  independent.  Note  that 

n2  =  R„(0)  =  MB 

ence  the  variance  of  each  Nyquist  sample  is  MB.  From  Eq.  ,  14.42b,  it  follows  that  the 
aropy  H( n)  of  each  Nyquist  sample  of  n(0  is 

W(n)  =  I  log  (2 neAfB)  bits  per  sample  (14.43a) 

cnecified  bv  2 B  Nyquist  samples  per  second,  the  entropy  per  sec- 
nTroW^heTmlopy  of 2  B  Nyquist  samples.  Because  all  the  samples  are  independent. 
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knowledge  of  one  sample  gives  no  information  about  any  other  sample.  Hence,  the  entropy  of 
2 B  Nyquist  samples  is  the  sum  of  the  entropies  of  the  2 B  samples,  and 

H\ n)  =  £log  (IneAfB)  bit/s  (14.43b) 

where  H'( n)  is  the  entropy  per  second  of  n(f). 

From  the  results  derived  thus  far,  we  can  draw  one  significant  conclusion.  Among  all 
signals  band-limited  to  B  Hz  and  constrained  to  have  a  certain  mean  square  value  a2,  the 
white  Gaussian  band-limited  signal  has  the  largest  entropy  per  second.  To  understand  the 
reason  for  this,  recall  that  for  a  given  mean  square  value,  Gaussian  samples  have  the  largest 
entropy;  moreover,  all  the  2 B  samples  of  a  Gaussian  band-limited  process  are  independent. 
Hence,  the  entropy  per  second  is  the  sum  of  the  entropies  of  all  the  2 B  samples.  In  pro¬ 
cesses  that  are  not  white,  the  Nyquist  samples  are  correlated,  and,  hence,  the  entropy  per 
second  is  less  than  the  sum  of  the  entropies  of  the  2 B  samples.  If  the  signal  is  not  Gaussian, 
then  its  samples  are  not  Gaussian,  and,  hence,  the  entropy  per  sample  is  also  less  than  the 
maximum  possible  entropy  for  a  given  mean  square  value.  To  reiterate,  for  a  class  of  band- 
limited  signals  constrained  to  a  certain  mean  square  value,  the  white  Gaussian  signal  has 
the  largest  entropy  per  second,  or  the  largest  amount  of  uncertainty.  This  is  also  the  reason 
why  white  Gaussian  noise  is  the  worst  possible  noise  in  terms  of  interference  with  signal 
transmission. 


Mutual  Information  /(x;  y) 

The  ultimate  test  of  any  concept  is  its  usefulness.  We  shall  now  show  that  the  relative  entropy 
defined  in  Eqs.  (14.32)  does  lead  to  meaningful  results  when  we  consider  /(x;y),  the  mutual 
information  of  continuous  random  variables  x  and  y.  We  wish  to  transmit  a  random  variable 
x  over  a  channel.  Each  value  of  x  in  a  given  continuous  range  is  now  a  message  that  may  be 
transmitted,  for  example,  as  a  pulse  of  height  x.  The  message  recovered  by  the  receiver  will 
be  a  continuous  random  variable  y.  If  the  channel  were  noise  free,  the  received  value  y  would 
uniquely  determine  the  transmitted  value  jc.  But  channel  noise  introduces  a  certain  uncertainty 
about  the  true  value  of  jc.  Consider  the  event  that  at  the  transmitter,  a  value  of  x  in  the  interval 
(jc,  x  +  Ajc)  has  been  transmitted  (A*  — ►  0).  The  probability  of  this  event  is  p(x) Ax  in  the 
limit  Ajc  — ►  0.  Hence,  the  amount  of  information  transmitted  is  log  [l/p(x)Ax].  Let  the  value 
of  y  at  the  receiver  be  y  and  let  p(x\y)  be  the  conditional  probability  density  of  x  when  y  =  >’■ 
Then  p(x\y)Ax  is  the  probability  that  x  will  lie  in  the  interval  (jc,  jc  +  Ajc)  when  y  =  y  (provided 
Ax  — ►  0).  Obviously,  there  is  an  uncertainty  about  the  event  that  x  lies  in  the  interval  U, 
jc  +  Ajc).  This  uncertainty,  log  [\/p(x\y) Ajc],  arises  because  of  channel  noise  and  therefore 
represents  a  loss  of  information.  Because  log  [  1  /p(x)  Ax]  is  the  information  transmitted  and 
log  [  1  /p(x\y)Ax]  is  the  information  lost  over  the  channel,  the  net  information  received  is  /(*»> ) 
given  by 


'(^)=log[^]-log[*^] 


=  log 


p(x\y) 

p(x) 


(14.44) 


Note  that  this  relation  is  true  in  the  limit  Ajc  -*■  0.  Therefore,  /(.r;y),  represents  the  infor¬ 
mation  transmitted  over  a  channel  if  we  receive  y  (y  =  y)  when  .v  is  transmitted  (x  =  *)•  f 
are  interested  in  finding  the  average  information  transmitted  over  a  channel  when  some  x  is 
transmitted  and  a  certain  y  is  received.  We  must  therefore  average  /(jc; y)  over  all  values  o 
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x  and  y.  The  average  information  transmitted  will  be  denoted  by  /( x;  y),  where 


/oo  poo 

/  p(x,y)I(x\y)  dx  dy 

-00  J —oo 


-00  J-OO 

,00  roo  p(jc|y) 


dx  dy 


Hence, 


p(x\y) 

~LL*'*'*n* 

/oo  po o  ]  f00 

/  d(jc,  y)  log - dxdy  +  /  /  p(*.  y )  log  pUly) 

-00  J —oc  P(*)  •'-00  •'-oo 

/oo  /-oo  |  /*°°  r°° 

/  p(Jc)p(yW  log  —  dx  dy  +  /  /  p(*. y)  log  P(*ly>  *  ^ 

-oo  J — 00  W  J-OO  J-OO 

/oo  1  POO  POO  POO 

n(jt)  log  — /  P(y w  <fy  +  /  /  P(*.y>  logptxly)  ^  dy 

-oo  p(*)  ./-oo  J-ooJ- 


(14.45a) 

(14.45b) 


-00  J-OO 


Note  that 


/.«  r00  .  1  j 

j  p(y\x)dy  =  1  and  J^p(x)\og—dx  = 


=  H(\) 


=  H(x) 


POO  POO 

/  /  p(x,  y)  log  p(x\y)  dxdy 

J-oo  J-oo 

(14.46a) 

POO  poo  1 

LLAy)lo|p(r»"’ 

(14.46b) 

The  integral  on  the  right-hand  side  is  the  average  over  x  and  y  of  log  [l/p(*|y)].  But 
i  1 1  tnfx lv)l  represents  the  uncertainty  about  *  when  y  is  received.  This,  as  we  have  seen 
log  U/pWv)]  P  channel.  The  average  of  log  [  1  /p(*|y)]  is  the  average  loss  of 

r„Sr“  ImeTis  —  and  so.e  ,  is  received.  1*,  by  dehnition. ,  «<*ly>. 

the  conditional  (differential)  entropy  of  x  given  y. 


/OO  poo  1 

.oJV0‘-y)'OgPM'> 


dx  dy 


(14.47) 


Hence, 


/(X;y)  =  tf(x)-W(x|y) 


(14.48) 


i  of  x  is  transmitted  and  some  value  of  y  is  received,  the  average 
Thus,  when  some  value  of  x  .  (14.48).  We  can  define  the 

of  information  that  can  be  transmitted,  on  the 

average,  per  sample  or  per  value  transmitted. 


Cs  =  max  /(x;  y) 


(14.49) 
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For  a  given  channel,  /(x;  y)  is  a  function  of  the  input  probability  density  p(x)  alone.  This  can 
be  shown  as  follows: 


p(x,y)  =  p(x)p(y\x) 
p(x\y)  _  p(.v|.v) 
p(x)  p(y) 

_  piy\x) 

~  I-ooP^y^ 

=  r(y\x) 

f-ooP(x)p(y\x)dx 

Substituting  Eqs.  (14.50)  and  (14.51)  into  Eq.  (14.45b),  we  obtain 


(14.50) 


(14.51) 


/(x;y) 


/OO  rOO  / 

/  p(x)p(y\x)  log  I 
-oo  J — oo  \ 


p(y\x) 


\f—ooP(x)p(y\x)  dX  j 


dx  dy 


(14.52) 


The  conditional  probability  density  p(y\x)  is  characteristic  of  a  given  channel.  Hence,  for  a 
given  channel  specified  by  p(y\x),  /(x;y)  is  a  function  of  the  input  probability  density  p(x) 
alone.  Thus, 


Cs  =  max  /(x;y) 
p(x) 

If  the  channel  allows  the  transmission  of  K  values  per  second,  then  C,  the  channel  capacity 
per  second,  is  given  by 


C  =  KCS  bit/s  (14.53) 

Just  as  in  the  case  of  discrete  variables,  /(x;y)  is  symmetrical  with  respect  to  x  and  y  tor 
continuous  random  variables.  This  can  be  seen  by  rewriting  Eq.  (14.45b)  as 

/(x;y)=  f  f  p(x,y )  log  —  ^  -  dxdy  (14.54) 

J — oo  J- oo  p(x)p(y) 

This  equation  shows  that  /(x;y)  is  symmetrical  with  respect  to  x  and  y.  Hence, 

/(x;y)  =  /( y;x) 


From  Eq.  (14.48)  it  now  follows  that 

/(x;y)  =  H(x)  -  H(x\y)  =  H( y)  -  //(y|x)  (14.55) 

Capacity  of  a  Band-Limited  AWGN  Channel 

The  channel  capacity  C  is,  by  definition,  the  maximum  rate  of  information  transmission  over 
a  channel.  The  mutual  information  /(x;y)  is  given  by  Eq.  (14.55): 

/(x;y)  =  //(y)-//(y|x)  <l4’56) 

The  channel  capacity  C  is  the  maximum  value  of  the  mutual  information  /(x;  y)  per  second. 
Let  us  first  find  the  maximum  value  of  /(x;y)  per  sample.  We  shall  find  here  the  capacity 
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of  a  channel  band-limited  to  B  Hz  and  disturbed  by  a  white  Gaussian  noise  of  PSD  M /2.  In 


addition,  we  shall  constrain  the  signal  power  (or  its  mean  square  value)  to  S.  The  disturbance 
is  assumed  to  be  additive;  that  is,  the  received  signal  y(/)  is  given  by 


(14.57) 


y(t)  =  x(t)  +  n(/) 


Because  the  channel  is  band-limited,  both  the  signal  x(t)  and  the  noise  n(/)  are  band-limited 
to  B  Hz.  Obviously,  y(f)  is  also  band-limited  to  B  Hz.  All  these  signals  can  therefore  be 
completely  specified  by  samples  taken  at  the  uniform  rate  of  2 B  samples  per  second.  Let  us 
find  the  maximum  information  that  can  be  transmitted  per  sample.  Let  x,  n,  and  y  represent 
samples  of  x(f),  n(r),  and  y(f),  respectively.  The  information  /(x;  y)  transmitted  per  sample  is 
given  by  Eq.  (14.56): 

/(x;y)  =  H(y)-H(y\x) 

We  shall  now  find  H( y|x).  By  definition  [Eq.  (14.47)], 


jix)dx  LpW‘"ogM) dy 


Because 


y  =  x  +  n 


for  a  given  *,  v  is  equal  to  n  plus  a  constant  (x).  Hence,  the  distribution  of  y  when  x  has  a  given 


value  is  identical  to  that  of  n  except  for  a  translation  by  *.  If  />„(•)  represents  the  PDF  of  noise 


sample  n,  then 


p(y\x)  =  pn(y  ~  *) 


(14.58) 


Letting  y  —  x  =  z,  we  have 


The  right-hand  side  is  the  entropy  H( n)  of  the  noise  sample  n.  Hence 


J  —no 


00 


=  H(  n) 


(14.59) 


made  no  assumptions  about  the  noise.  Hence,  Eq.  (14.59)  is  very 


in  an  additive  fashion.  Thus, 

/(x;y)  =  //(y)-//(n)  bits  per  sample 


(14.60) 
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We  have  assumed  that  the  mean  square  value  of  the  signal  \(t)  is  constrained  to  have  a  value 
S,  and  the  mean  square  value  of  the  noise  is  TV.  We  shall  also  assume  that  the  signal  \{t)  and 
the  noise  n(f )  are  independent.  In  such  a  case,  the  mean  square  value  of  y  will  be  the  sum  of 
the  mean  square  values  of  x  and  n.  Hence, 

y i  =  S  +  N 

For  a  given  noise  [given  H( n)],  /(x;y)  is  maximum  when  H( y)  is  maximum.  We  have  seen 
that  for  a  given  mean  square  value  of  y  (y2  =  5  +  N),  H( y)  will  be  maximum  if  y  is  Gaussian, 
and  the  maximum  entropy  Hmdx(y)  is  then  given  by 

/W(y)  =  \  log  [2ne(S  +  N)]  (14.61) 


Because 


y  =  x  +  n 


and  n  is  Gaussian,  y  will  be  Gaussian  only  if  x  is  Gaussian.  As  the  mean  square  value  of  x  is 
5,  this  implies  that 


p(x)  = 


y/2nS 


and 


Anax(*9  y)  —  ^max(y)  H  (n) 


=  l-log[2ne(S+N)]  -//( n) 
For  a  white  Gaussian  noise  with  mean  square  value  N, 


H{ n)  =  ~  log  2 neN  N  =  A fB 

Cs  —  Anax(x,y)  —  2  1°8  ^ ^ 

(14.62a) 

=il06(l+£) 

(14.62b) 

The  channel  capacity  per  second  will  be  the  maximum  information  that  can  be  transmitted 
per  second.  Equations  (14.62)  represent  the  maximum  information  transmitted  per  sample. 
If  all  the  samples  are  statistically  independent,  the  total  information  transmitted  per  second 
will  be  2 B  times  Cs.  If  the  samples  are  not  independent,  then  the  total  information  will  be 
less  than  2 BCS.  Because  the  channel  capacity  C  represents  the  maximum  possible  information 
transmitted  per  second. 


c=“[Mi+£) 

=  Blog(l  +  |) 


bit/s 


(14.63) 
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The  samples  of  a  band-limited  Gaussian  signal  are  independent  if  and  only  if  the  signal 
power  spectral  density  (PSD)  is  uniform  over  the  band  (Example  9.2  and  Prob.  9.2-4).  Obvi¬ 
ously,  to  transmit  information  at  the  maximum  rate  [Eq.  ( 14.63)J,  the  PSD  of  signal  y(/)  must 
be  uniform.  The  PSD  of  y  is  given  by 


Sy(f)  =  Sx(f)  +  Sn(f) 


Because  Sn(f)  =  J\T / 2,  the  PSD  of  \(t)  must  also  be  uniform.  Thus,  the  maximum  rate  of 
transmission  (C  bit/s)  is  attained  when  x(/)  is  also  a  white  Gaussian  signal. 

To  recapitulate,  when  the  channel  noise  is  additive,  white,  and  Gaussian  with  mean  square 
value  N  (N  =  NB),  the  channel  capacity  C  of  a  band-limited  channel  under  the  constraint  of 
a  given  signal  power  S  is  given  by 


bit/s 


where  B  is  the  channel  bandwidth  in  hertz.  The  maximum  rate  of  transmission  (C  bit/s)  can 
be  realized  only  if  the  input  signal  is  a  white  Gaussian  signal. 

Capacity  of  a  Channel  of  Infinite  Bandwidth 

Superficially,  Eq.  (14.63)  seems  to  indicate  that  the  channel  capacity  goes  to  oo  as  the  channel’s 
bandwidth  £  goes  to  oo.  This,  however,  is  not  true.  For  white  noise,  the  noise  power  N  =  A fB. 
Hence,  as  B  increases,  N  also  increases.  It  can  be  shown  that  in  the  limit  as  B  -*•  oo,  C 
approaches  a  limit: 


This  limit  can  be  found  by  noting  that 


Hence, 


(14.64) 


Thus,  foi 
as  B  -*■ 
be  made 

powers,  the  channel  capacity  always  rer 


1C 

„  the  channel  capacity  always  remains  nnite. 
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Figure  14.5 

Channel  cap¬ 
acity  vs. 

bandwidth  for  a 
channel  with 
white  Gaussian 
noise  and  fixed 
signal  power. 


t 


Figure  14.6 

(a)  Signal  space 
representation  of 
transmitted  and 
received  signals 
and  noise  signal. 

(b)  Choice  of 
signals  for 
error-free 
communication. 


(a) 


(b) 


Verification  of  Error-Free  Communication  over 
a  Continuous  Channel 

Using  the  concepts  of  information  theory,  we  have  shown  that  it  is  possible  to  transmit  error- 
free  information  at  a  rate  of  B  log2  (1  +  S/N)  bit/s  over  a  channel  band-limited  to  B  Hz.  The 
signal  power  is  S,  and  the  channel  noise  is  white  Gaussian  with  power  N.  This  theorem  can  be 
verified  in  a  way  similar  to  that  used  for  the  verification  of  the  channel  capacity  of  a  discrete 
case.  This  verification  using  signal  space  is  so  general  that  it  is  in  reality  an  alternate  proof  ot 
the  capacity  theorem. 

Let  us  consider  M -ary  communication  with  M  equiprobable  messages  m\,  m2,  —  mM 
transmitted  by  signals  si  (/),  52(f),  sM  ( t ).  All  signals  are  time-limited  with  duration  T  and 

have  an  essential  bandwidth  B  Hz.  Their  powers  are  less  than  or  equal  to  S.  The  channel  is 
band-limited  to  B,  and  the  channel  noise  is  white  Gaussian  with  power  N. 

All  the  signals  and  noise  waveforms  have  2 BT  +  1  dimensions.  In  the  limit  we  shall  let 
T  — >  00.  Hence  2 BT  1,  and  the  number  of  dimensions  will  be  taken  as  2BT  in  our  future 
discussion.  Because  the  noise  power  is  N,  the  energy  of  the  noise  waveform  of  T-second 
duration  is  NT.  Given  signal  power  S,  the  maximum  signal  energy  is  ST.  Because  signals  and 
noise  are  independent,  the  maximum  received  energy  is  ( S  +  N)T.  Hence,  all  the  receive 
signals  will  lie  in  a  2Z?7’-dimensional  hypersphere  of  radius  V(5  +  N)T  (Fig.  14.6a).  A  typical 
received  signal  s,(f)  +  n(t)  has  an  energy  (5,  +  N)T,  and  the  point  r  representing  this  signa 
lies  at  a  distance  of  V(S; +  N)T  from  the  origin  (Fig.  14.6a).  The  signal  vectors,,  the  noise 
vector  n,  and  the  received  vector  r  are  shown  in  Fig.  14.6a.  Because 


|s/l  =  y/SJ,  |/j|  =  s/nT, 


|r|  =  y/(Si  +  N)T 


(14.65) 
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it  follows  that  vectors  s;,  /i,  and  r  form  a  right  triangle.  Also,  n  lies  on  the  sphere  of  radius 


\//V7\  centered  at  s,-.  Note  that  because  n  is  random,  it  can  lie  anywhere  on  the  sphere  centered 
at  Si* 


We  have  M  possible  transmitted  vectors  located  inside  the  big  sphere.  For  each  possible 


spheres,  the  center  of  that  sphere  is  the  transmitted  waveform.  If  we  pack  the  big  sphere  with 
M  nonoverlapping  and  nontouching  spheres,  each  of  radius  \J~NT  (Fig.  14.6b),  and  use  the 
centers  of  these  M  spheres  for  the  transmitted  waveforms,  we  will  be  able  to  detect  all  these 
M  waveforms  correctly  at  the  receiver  simply  by  using  the  maximum  likelihood  receiver. 
The  maximum  likelihood  receiver  looks  at  the  received  signal  point  r  and  decides  that  the 
transmitted  signal  is  that  one  of  the  M  possible  transmitted  points  that  is  closest  to  r  (smallest 
error  vector).  Every  received  point  r  will  lie  on  the  surface  of  one  of  the  M  nonoverlapping 
spheres,  and  using  the  maximum  likelihood  criterion,  the  transmitted  signal  will  be  chosen 
correctly  as  the  point  lying  at  the  center  of  the  sphere  on  which  r  lies. 

Hence,  our  task  is  to  find  out  how  many  such  nonoverlapping  small  spheres  can  be  packed 
into  the  big  sphere.  To  compute  this  number,  we  must  determine  the  volume  of  a  sphere  of  D 
dimensions. 


Volume  of  a  D-Dimensional  Sphere 

A  D-dimensional  sphere  is  described  by  the  equation 


x ^  4-  x\  +  •  •  ■  +  Xp  —  R~ 


where  R  is  the  radius  of  the  sphere.  We  can  show  that  the  volume  V  (R)  of  a  sphere  of  radius 
R  is  given  by 


V(R)=RnV(  1) 


(14.66) 


where  V(1 )  is  the  volume  of  a  D-dimensional  sphere  of  unit  radius  and.  thus,  is  constant.  To 
prove  this,  we  have  by  definition 


dx  i  dx 2  ■  •  dxo 

Jt5+JC2  +-+xD-R2 

Letting  y/  =  Xj/R,  we  have 


V(R)  =  R°  

y\+yl+-+y2D<\ 

=  RdV(\) 

Hence,  the  ratio  of  the  volumes  of  two  spheres  of  radii  R  and  R  is 


ViR)  = 
V(R) 


D 


noise  power,  the  energy  over  an  interval  T  is  NT  +  (.  where  < 


0  as  T  -►  oc.  Hence. 


*  Because  N  is  the  average  noise  power, 
we  can  assume  that  n  lies  on  the  sphere. 
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As  direct  consequence  of  this  result,  when  D  is  large,  almost  all  of  the  volume  of  the  sphere 
is  concentrated  at  the  surface.  This  is  because  if  R/R  <  1,  then  (R/R)l)  ->  0  as  D  oo. 
This  ratio  approaches  zero  even  if  R  differs  from  R  by  a  very  smallamount  A  (Fig.  14.7).  This 
means  that  no  matter  how  small  A  is,  the  volume  within  radius  R  is  a  negligible  fraction  of 
the  total  volume  within  radius  R  if  D  is  large  enough.  Hence,  for  a  large  D,  almost  all  of  the 
volume  of  a  D-dimensional  sphere  is  concentrated  at  the  surface.  Such  a  result  sounds  strange, 
but  a  little  reflection  will  show  that  it  is  reasonable.  This  is  because  the  volume  is  proportional 
to  the  Dth  power  of  the  radius.  Thus,  for  large  D,  a  small  increase  in  R  can  increase  the  volume 
tremendously,  and  all  the  increase  comes  from  a  tiny  increase  in  R  near  the  surface  of  the 
sphere.  This  means  that  most  of  the  volume  must  be  concentrated  at  the  surface. 

The  number  of  nonoverlapping  spheres  of  radius  <JNT  that  can  be  packed  into  a  sphere 
of  radius  y/(S  +  AO 7  is  bounded  by  the  ratio  of  the  volume  of  the  signal  sphere  to  the  volume 
of  the  noise  sphere.  Hence, 


[V(S  +  Apr]2*7  v(i) 

(Vnt) 


(14.67) 


Each  of  the  M -ary  signals  carries  the  information  of  log2  M  binary  digits.  Hence,  the  trans¬ 
mission  of  one  of  the  M  signals  every  T  seconds  is  equivalent  to  the  information  rate  C 
given  by 


c  =  -  fli°g  (i  +  j}')  bit/s 


(14.68) 


This  equation  gives  the  upper  limit  of  C. 

To  show  that  we  can  actually  receive  error-free  information  at  a  rate  of  B  log  (1  +  5/^)- 
we  use  the  argument  proposed  by  Shannon.8  Instead  of  choosing  the  M  transmitted  messages 
at  the  centers  of  nonoverlapping  spheres  (Fig.  14.6b),  Shannon  proposed  selecting  the  - 
points  randomly  located  in  the  signal  sphere  Is  of  radius  \/ST  (Fig.  14.8).  Consider  one 
particular  transmitted  signal  s*.  Because  the  signal  energy  is  assumed  to  be  <  5,  point  s* 
will  lie  somewhere  inside  the  signal  sphere  ls  of  radius  s/sf.  Because  all  the  M  signals  are 
picked  randomly  from  this  sphere,  the  probability  of  finding  a  signal  within  a  volume  A  1S 
min(  1,  M  A  V/Vs),  where  Vv  is  the  volume  of  Is.  But  because  for  large  D  all  of  the 
of  the  sphere  is  concentrated  at  the  surface,  all  M  signal  points  selected  randomly  would  >e 
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Figure  14.8 

Derivation  of 

channel 

capacity. 


near  the  surface  of  Is.  Figure  14.8  shows  the  transmitted  signal  sk,  the  received  signal  r ,  and 
the  noise  n.  We  draw  a  sphere  of  radius  JNT  with  r  as  the  center.  This  sphere  intersects  the 
sphere  Is  and  forms  a  common  lens-shaped  region.  The  signal  sk  lies  on  the  surface  of  both 
spheres.  We  shall  use  a  maximum  likelihood  receiver.  This  means  that  when  r  is  received, 
we  shall  make  the  decision  that  “s*  was  transmitted,’'  provided  none  of  the  remaining  M  -  1 
signal  points  are  closer  to  r  than  sk.  The  probability  of  finding  any  one  signal  in  the  lens  is 
Viens/V,.  Hence  Pe,  the  error  probability  in  the  detection  of  sk  when  r  is  received,  is 


From  Fig.  14.8,  we  observe  that  V,ens  <  V(h),  where  V(h)  is  the  volume  of  the  D- 
dimensional  sphere  of  radius  h.  Because  r.  sk,  and  n  form  a  right  triangle. 


Figure  14.8 

Derivation  of 

channel 

capacity. 


Hence, 


Also, 


Vs  =  (ST)BrV(  1) 


and 


If  we  choose 
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then 


Pe  <  [k]BT 


If  we  let  k  =  1  -  A,  where  A  is  a  positive  number  chosen  as  small  as  we  wish,  then 


BT 


0  as 


oc 


This  means  that  Pe  can  be  made  arbitrarily  small  by  increasing  7\  provided  M  is  chosen 
arbitrarily  close  to  (1  +  S/N)BT .  Thus, 


C  =  ~  log2  M 


(14.69) 


bit/s 


where  6  is  a  positive  number  chosen  as  small  as  we  please.  This  leads  to  k  =  2  (I  and  proves 


the  desired  result.  A  more  rigorous  derivation  of  this  result  can  be  found  in  Wozencraft  and 
Jacobs.9 

Because  the  M  signals  are  selected  randomly  from  the  signal  space,  they  tend  to  acquire 
the  statistics  of  white  noise8  (i.e.,  a  white  Gaussian  random  process). 

Comments  on  Channel  Capacity 

According  to  the  result  derived  in  this  chapter,  theoretically  we  can  communicate  error-tree 
up  to  C  bit/s.  There  are,  however,  practical  difficulties  in  achieving  this  rate.  In  proving  the 
capacity  formula,  we  assumed  that  communication  is  effected  by  signals  of  duration  T.  This 
means  we  must  wait  T  seconds  to  accumulate  the  input  data  and  then  encode  it  by  one  of  the 
waveforms  of  duration  T.  Because  the  capacity  rate  is  achieved  only  in  the  limit  as  T  ->  oo, 
we  have  a  long  wait  at  the  receiver  to  get  the  information.  Moreover,  because  the  number  of 
possible  messages  that  can  be  transmitted  over  interval  T  increases  exponentially  with  7,  the 
transmitter  and  receiver  structures  increase  in  complexity  beyond  imagination  as  T  — >  oo. 

The  channel  capacity  indicated  by  Shannon’s  equation  [Eq.  ( 14.69)]  is  the  maximum  error- 
free  communication  rate  achievable  on  an  optimum  system  without  any  restrictions  (except 
for  bandwidth  /?,  signal  power  5,  and  Gaussian  white  channel  noise  power  N).  If  we  have 
any  other  restrictions,  this  maximum  rate  will  not  be  achieved.  For  example,  if  we  consider 
a  binary  channel  (a  channel  restricted  to  transmit  only  binary  signals),  we  will  not  be  able 
to  attain  Shannon’s  rate,  even  if  the  channel  is  optimum.  In  Sec.  14.9,  MATLAB  Computer 
Exercise  14.2  supplies  numerical  confirmation.  The  channel  capacity  formula  [Eq.  (14.63)1 
indicates  that  the  transmission  rate  is  a  monotonically  increasing  function  of  the  signal  power  5. 
If  we  use  a  binary  channel,  however,  we  know  that  increasing  the  transmitted  power  beyon 
a  certain  point  buys  very  little  advantage  (Fig.  10. 15).  Hence,  on  a  binary  channel,  increasing 
5  will  not  increase  the  error-free  communication  rate  beyond  some  value.  This  does  not  mean 
that  the  channel  capacity  formula  has  failed.  It  simply  means  that  when  we  have  a  large 
amount  of  power  (with  a  finite  bandwidth)  available,  the  binary  scheme  is  not  the  optimum 
communication  scheme. 

One  last  comment:  Shannon’s  results  tell  us  the  upper  theoretical  limit  of  error-tree  com 
munication.  But  they  do  not  tell  us  precisely  how  this  can  be  achieved.  To  quote  the  words  o 
Abramson,  written  in  1963;  ‘"[This  is  one  of  the  problems]  which  has  persisted  to  mock  intor 
mation  theorists  since  Shannon’s  original  paper  in  1948.  Despite  an  enormous  amount  of  et  o 
spent  since  that  time  in  quest  of  this  Holy  Grail  of  information  theory,  a  deterministic  met 
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of  generating  the  codes  promised  by  Shannon  is  still  to  be  found.”4  Amazingly,  30  years  later, 
the  introduction  of  turbo  codes  and  the  rediscovery  of  the  low-density  parity  check  (LDPC) 


Figure  14.9 

Ideal  exchange 
between  SNR 
and  bandwidth. 


codes  would  completely  alter  the  landscape.  We  shall  introduce  these  codes  in  Chapter  15. 


14.6  PRACTICAL  COMMUNICATION  SYSTEMS  IN 
LIGHT  OF  SHANNON'S  EQUATION 


It  would  be  instructive  to  determine  the  ideal  law  for  the  exchange  between  the  SNR  and 
the  transmission  bandwidth  by  using  the  channel  capacity  equation.  Consider  a  message  of 
bandwidth  B  that  is  used  for  modulation  (or  coding),  with  the  resulting  modulated  signal  of 
bandwidth  Br.  This  signal  is  received  at  the  input  of  an  ideal  demodulator  with  signal  and  noise 
powers  of  S,  and  N,-,  respectively*  (Fig.  14.9).  The  demodulator  output  bandwidth  isfi,  and  the 
SNR  is  S0/N0.  Because  an  SNR  S/N  and  a  bandwidth  B  can  transmit  ideally  B  log  ( 1  +S/N) 
bits  of  information,  the  ideal  information  rates  of  the  signals  at  the  input  and  the  output  of  the 
demodulator  are  BT  log  ( 1  +  St/Nt)  bits  and  B  log  <  1  +  S„/N„)  bits,  respectively.  Because  the 
demodulator  neither  creates  nor  destroys  information,  the  two  rates  should  be  equal,  that  is, 


Figure  14.9 

Ideal  exchange 
between  SNR 
and  bandwidth. 


Bj  log 


and 


(14.70a) 


In  practice,  for  the  majority  of  systems,  S0/N0  as  well  as  Si/Nt  »  1  -  and 


(14.70b) 


Also, 


-  s‘ 
N,  MBt 


Si 

Y  MB 


*  An  additive 


white  Gaussian  channel  noise  is  assumed. 
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Figure  14.10 

Ideal  behavior  of 
SNR  vs.  y  for 
various  ratios  of 
Bt  to  B. 


Hence,  Eqs.  (14.70)  become 


(14.71a) 

(14.71b) 


Equations  (14.70)  and  (14.71)  give  the  ideal  law  of  exchange  between  the  SNR  and  the  band¬ 
width.  The  output  SNR  S0/N()  is  plotted  in  Fig.  14. 10  as  a  function  of  y  for  various  values  ot 

Bt/b- 

The  output  SNR  increases  exponentially  with  the  bandwidth  expansion  factor  Bt/B •  Thls 
means  that  to  maintain  a  given  output  SNR,  the  transmitted  signal  power  can  be  reduced 
exponentially  with  the  bandwidth  expansion  factor.  Thus,  for  a  small  increase  in  bandwidth, 
we  can  cut  the  transmitted  power  considerably.  On  the  other  hand,  for  a  small  reduction  in 
bandwidth,  we  need  to  increase  the  transmitted  power  considerably. 

Let  us  now  investigate  various  systems  studied  thus  far  and  see  how  they  fare  in  comparison 
to  the  ideal  system. 


AM 

For  baseband  and  SSB-SC  systems,  Bj/B  =  1,  and  Eqs.  (14.71)  yield 


5, 

No  V 


(14.72) 


which  agrees  exactly  with  the  performance  of  the  SSB-SC  system  [Eqs.  (9.52)  and  (10.8)1- 
For  DSB-SC,  Bj/B  =  2,  and  Eqs.  (14.71)  predict 


(14.73) 

N0  4 


Thus  DSB-SC,  for  which  S0/N0  =  y,  falls  short  of  ideal  performance.  If.  however,  we  conf*  ^ 
that  quadrature  multiplexing  can  be  used  to  transmit  two  DSB  signals  simultaneously,  wit 
effective  bandwidth  per  signal  as  B  rather  than  2 B.  DSB-SC  has  ideal  performance.  Becaus 
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Figure  14.11 

Comparison  of 
ideal  behavior 
and  FM  system 
behavior. 


Ideal 


for  AM  quadrature  multiplexing  is  not  used,*  AM  performance  |Eq.  ( 10. 10)1  lads  considerably 
short  of  the  ideal  performance  [Eq.  (14.73)]. 

The  ideal  performance  of  the  baseband  and  SSB-SC  or  DSB-SC  systems  is  really  an  empty 
boast,  because  these  systems  do  not  exchange  SNR  tor  bandwidth. 


FM 

For  FM.  we  have  [Eq.  (10.32b)] 


Because  Bt /B  =  2(/J  +  1)> 


(14.74) 


„.  14  11  nlots  S  /N  for  Bt/B  =  8  and  20,  assuming  mj/m2  =  2.  The  ideal  S„/N„  plots 

fof  7/B  =  8  and  20  are  also  shown  for  comparison.  FM  performance  falls  far  below  ideal 
°  .  7  /  c  if  „  „a:n  of  13  dB  resulting  from  preemphasis  and  deemphasis  is  added  to 
£■ “fS  i  s  s veS  infenor  mthc  ideal  ca.es.  The  comparison  he,ween  FM 

these  plots,  FM  sively  worse  as  y  increases.  At  the  threshold,  however,  the 

and  the  'dealsysK  E  P  a  „  can  be  shown  that  for  FM.  when  optimum  demodulation 
££^1^^* ofhe  SNR  a,  the  threshold  „he  light  dashed  line* 

in  Fig.  14.11)  is  closer  to  ideal.10 


- -  •  ~ha«.  reference  at  the  receiver,  it  defeats  the  purpose  of  AM. 

*  Because  this  requires  a  phase  re  frequency  discriminator.  For  optimum  demodulation  by 
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PCM 

As  seen  earlier.  M- ary  PCM  shows  a  saturation  effect  unless  we  go  to  higher  values  of  M  as 
y  increases.  If  the  message  signal  is  quantized  in  L  levels,  then  each  sample  can  be  encoded 
by  logM  L  number  of  M  -ary  pulses.  If  B  is  the  bandwidth  of  the  message  signal,  we  need  to 
transmit  2 B  samples  per  second.  Consequently,  R.u .  the  number  of  M  -ary  pulses  per  second,  is 

Rm  =  2Blogw  L 

Also,  the  transmission  bandwidth  Bj  is  half  the  number  of  (M -ary)  pulses  per  second.  Hence, 

Bj  =  =  B  \ogM  L  (14.75a) 

From  Eq.  ( 1 1 .98a),  the  power  5,  is  found  as 

Si  =  '^±EpRM  (14.75b) 

Also, 

Nj  =  XBt  =  ( 1 4-76) 

Each  of  the  M  -ary  pulses  carries  the  information  of  log2  M  bits,  and  we  are  transmitting 
2BlogM  L  number  of  M  -ary  pulses  per  second.  Hence,  we  are  transmitting  information  at  a 
rate  of  R /;  bits,  where 


Rb  =  (2 B  logw  L)(log2  M ) 

=  2 Bj  log2  M 
=  Br  log2  M 2  bit/s 

Substitution  of  Eqs.  (14.75b)  and  (14.76)  into  this  equation  yields 

Rb  =  Bj  log2  ^1  +  ^  bit/s  (14.77) 

We  are  transmitting  the  information  equivalent  of  Rb  binary  digits  per  second  over  the  AT -ary 
PCM  channel.  The  reception  is  not  error  free,  however.  The  pulses  are  detected  with  an  error 
probability  PeM  given  in  Eq.  (1 1 ,99c).  If  PeM  is  on  the  order  of  10-6,  we  could  consider  the 
reception  to  be  essentially  error  free.  From  Eq.  ( 1 1 .99c), 

PeM  ~  2  Q  =  10-6  A 1  »  1 

This  gives 

2 Ep 

AT  ~ 


24 
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Substitution  of  this  value  in  Eq.  (14.77)  gives 

R„  =  Bt\  ogz(l  +  ^)  bit/s  <14-78) 

Thus,  over  a  channel  of  bandwidth  Bj  with  an  SNR  of  Sj/Ni ,  a  PCM  system  can  transmit 
information  at  a  rate  of  Rb  in  Eq.  (14.78).  The  ideal  channel  with  bandwidth  Bj  and  SNR 
Si/Ni  transmits  information  at  a  rate  of  C  bit/s,  where 

C  =  Bt  log2  bit/s  (14.79) 

It  follows  that  PCM  uses  roughly  eight  times  (9  dB)  as  much  power  as  the  ideal  system.  This 
performance  is  still  much  superior  to  that  of  FM.  Figure  14.12  shows  Rb/Br  as  a  function  of 
Si/Nj.  For  the  ideal  system,  we  have 

^  ^  /+M  (14.80) 

Bt  Bt  2V  NiJ 

PCM  at  the  threshold  is  9  dB  inferior  to  the  ideal  curve. 

When  PCM  is  in  saturation,  the  detection  error  probability  approaches  0.  Each  M  -ary 
pulse  transmits  log2  M  bits,  and  there  are  2 BT  pulses  per  second.  Hence, 


Rb  =  2 Bt  log2  M 


(14.81) 


or 


— -  =  2  log2  M 

DJ 


This  is  clearly  seen  in  Fig.  14.12  (solid  horizontal  lines). 


Figure  14.12 

Comparison  of 
ideal  system 
behavior  to  that 
of  PCM. 
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(14.82) 
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Orthogonal  Signaling 

We  have  already  shown  that  [Eq.  (11.122)]  for  M  -ary  orthogonal  signaling,  the  error-free 
communication  rate  is 


Rh  <  1.44  Yr  bit/s 


(14.83) 


A 


We  showed  in  Eq.  (14.64)  that  this  is  precisely  the  rate  of  error-free  communication  over 
an  ideal  channel  with  infinite  bandwidth.  Therefore,  as  M  -*■  oo,  the  bandwidth  of  an  M-ary 
scheme  also  approaches  infinity,  and  its  rate  of  communication  approaches  that  of  an  ideal 
channel. 


1 4.7  FREQUENCY-SELECTIVE  CHANNEL  CAPACITY 


Thus  far,  we  have  limited  the  discussion  of  capacity  to  distortionless  channels  of  finite  band¬ 
width  under  white  Gaussian  noise.  Such  a  channel  model  is  suitable  for  application  when 
channels  are  either  flat  or  flat  fading.  In  reality,  we  often  face  many  types  of  complex  channels. 
In  particular,  we  have  shown,  in  Chapter  13,  that  most  wireless  communication  channels  in 
the  presence  of  significant  multipath  tend  to  be  frequency-selective  channels.  We  now  take  a 
look  at  the  capacity  of  frequency-selective  channels  that  do  not  exhibit  a  distortionless  (flat) 
spectrum. 

First,  consider  a  band-limited  AWGN  channel  whose  random  output  is 


y  =  H  •  x  +  n 


This  channel  has  a  constant  gain  of  H  across  the  bandwidth.  Based  on  Eq.  (14.63)  this  band- 
limited  (low-pass)  AWGN  channel  with  bandwidth  B  has  capacity 


(14.84) 


bit/s 


in  which  S  and  N  are  the  signal  power  and  the  noise  power,  respectively.  Furthermore,  in 
Chapter  4  and  Chapter  9,  we  have  demonstrated  the  equivalence  of  baseband  and  passband 
channels  through  modulation.  Therefore,  given  the  same  noise  spectrum  and  bandwidth. 
AWGN  low-pass,  band-limited  channels  and  AWGN  bandpass  channels  possess  identical 
channel  capacity.  We  are  now  ready  to  describe  the  capacity  of  frequency-selective  channels. 

Consider  a  bandpass  channel  of  infinitesimal  bandwidth  A /  centered  at  a  frequency/- 
Within  this  small  band,  the  channel  gain  is  //(/}),  the  signal  power  spectral  density  (PSD)  is 
5X(/1),  and  the  Gaussian  noise  PSD  is  5n(/}).  Since  this  small  bandwidth  is  basically  a  band 
limited  AWGN  channel,  according  to  Eq.  (14.63),  its  capacity  is 


C(A)  =  A/  •  log  [l  +  |//(/i)|2M^l 
L  Sn(fi)Af  J 


bit/s 


(14.85) 


This  means  that  we  can  divide  a  frequency-selective  channel  H(f)  into  small  disjoint 
AWGN  bandpass  channels  of  bandwidth  A/.  Thus,  the  sum  channel  capacity  is  simply 


channel  capacity  is  simply 
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approximated  by 


In  fact,  the  practical  OFDM  (or  DMT)  system  discussed  in  Chapter  13  is  precisely  such  a 
system,  which  consists  of  a  bank  of  parallel  flat  channels  with  different  gains.  This  capacity 
is  an  approximation  because  the  channel  response,  the  signal  PSD,  or  the  noise  PSD,  may 
not  be  constant  over  a  nonzero  A/.  By  taking  A /  — ►  0,  we  can  determine  the  total  channel 
capacity  as 


(14.86) 


Maximum  Capacity  Power  Loading 

In  Eq.  (14.86),  we  have  established  that  the  capacity  of  a  frequency-selective  channel  with 
response  H{f)  under  colored  Gaussian  noise  of  power  spectral  density  (PSD)  Sn(f)  depends 
on  the  input  PSD  Sx(f).  For  the  transmitter  to  utilize  the  full  channel  capacity,  we  now  need 
to  find  the  optimum  input  power  spectral  density  (PSD)  Sx{f)  that  can  further  maximize  the 

integral  capacity 


To  do  so,  we  have  noted  that  it  would  not  be  fair  to  consider  arbitrary  input  PSD  Sx{f)  because 


Hiff.-n/nnwer  snectral  densities  may  lead  to  different  values  of  total  input  power.  Given  two 


To  solve  this  optimization  problem,  we 
narrow  flat  channels  centered  at  (/i,  i  = 


again  partition  the  channel  (of  bandwidth  B)  into  K 
1,  2,  . . . ,  A")  of  bandwidth  A /  =  B/K.  By  denoting 


K\  of  bandwidth  A/  =  B/K.  By  denoting 


narrow 


/V,  =  Sn(fi)  A/ 
Hi  =  H(fi) 

Si  =  Sx(fi)  A/ 
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the  optimization  problem  becomes  a  discrete  problem  of 


K 

max  ^  log 
{•S/ 1  . 

i=l 

(14.88a) 

K 

subject  to 

M 

Co 

II 

*0 

(14.88b) 

i=l 


The  problem  of  finding  the  N  optimum  power  values  {5/}  is  the  essence  of  the  optimum  power 
loading  problem. 

This  problem  can  be  dealt  by  introducing  a  standard  Lagrange  multiplier  X  to  form  a 
modified  objective  function 


G(SU  S2 . SK)  =  £  log  (l  +  A/  +  X  • 


(14.89) 


Taking  a  partial  derivative  of  G(Si,  . . . ,  Sk)  with  respect  to  Sj  and  setting  it  to  zero,  we  have 


A/  \Hj\2 
l  +  \HjpSj/Nj  Nj 


-  A  In  2  =  0 


j  =  1,2 . K 


We  rewrite  this  optimality  condition  into 


A/  Nj 
A.  In  2  \Hj\2  1 


j  =  1.2 . K 


By  defining  a  new  variable  IT  =  (A.  In  2)  *,  we  ensure  that  the  optimum  power  allocation 
among  the  K  subchannels  is 


Si  =  W  •  A/  -  i=\,2,...,K  (14.90a) 

l«*l 

such  that  =  p  (14.90b) 

The  optimum  power  loading  condition  of  Eq.  (14.90)  is  not  quite  yet  complete  because  some 
5,  may  become  negative  if  no  special  care  is  taken.  Therefore,  we  must  further  constrain  the 
solution  to  ensure  that  5,  >  0  via 

Si  =  max  (w  •  A/  -  o)  i=l,2,  ...,K  04-91a) 

such  that  Sj  =  P  (14.91b) 

The  two  relationships  in  Eq.  (14.91)  describe  the  solution  of  the  power  loading  optimization 
problem.  We  should  note  that  there  remains  an  unknown  parameter  W  that  needs  to  be  specified. 
By  enforcing  the  total  power  constraint  £  5/  =  P,  we  can  finally  determine  the  unknown 
parameter  W. 
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Finally,  we  take  the  limit  as  A/  -►  0  and  K  — ►  oo.  Since  5/  =  Sx(/i)A/  and  /V,  = 
Sn(//)A/,  the  optimum  input  signal  PSD  becomes 


s„(0  =  max (iV -  o)  (14.92a) 

We  note  again  that  there  is  no  closed-form  solution  given  for  the  optimum  constant  W .  Instead, 
the  optimum  W  is  obtained  from  the  total  input  power  constraint 

/oo 

Sx{f)df  =  P  (14.92b) 

-OO 


or 


J{f:W-Sn(f)/\H(f)\2>0)  \  / 


df 


(14.92c) 


Substituting  the  optimum  PSD  Eq.  (14.92)  into  the  capacity  formula  will  lead  to  the  maximum 
channel  capacity  value  of 


1, 


[f:W-Sn(f)/\H{f)\2>0) 


log 


/VVW)I2\ 
V  sn<f )  ) 


df 


(14.93) 


Water-Pouring  Interpretation  of  Optimum  Power  Loading 

The  optimum  channel  input  PSD  must  satisfy  the  power  constraint  Eq.  ( 44.92c).  Once  the 
constant  W  has  been  determined,  the  transmitter  can  adjust  its  transmission  PSD  to  Eq.  (14.  92a), 
which  will  maximize  the  channel  capacity.  This  optimum  solution  to  the  channel  input  PSD 
optimization  problem  is  known  as  the  water-filling  or  water-pouring  solution. 

The  literal  water-pouring  interpretation  of  optimum  PSD  design  is  illustrated  by  Fig.  1 4. 1 3. 
First  plot  the  frequency  response  Sn(f)/ \H  (f) |2.  This  curve  is  viewed  as  shaped  like  the  bottom 
of  a  water  container.  Consider  the  total  power  as  a  bucket  ol  water  with  total  volume  P  Wc 
can  then  pour  the  entire  bucket  of  water  into  the  container  to  achieve  equal  water  level.  The 
final  water  level  will  be  raised  to  W  when  the  bucket  is  empty.  The  depth  of  the  water  lor  every 

/  dean*  optimum  PSD  l«v=i  S.OT  «  in  Eq.  (14.92a,.  Cleary 

when  the  noise  PSD  is  large  such  that  Sn(f  )/\H(f  )\2  is  high  for  some/,  then  there  may  be 

zero  water  poured  at  those^ln  ^^^^a  .Tv^oTchaTneTsNR 

r  oSvJU.  when  Sn(f  )/\H(f)\2  b  low  or  SNR  is  high  the  optimum  PSD 

i  crn  should  be  kept  high.  In  short,  water-pouring  power  loading  allocates  more  signal 
value  *  Phic®  the  channel  SNR  \H(f)\2/Sn(f)  is  high  and  allocates  little  or 

power  to  trequen  ^  ^  which  ^  channel  SNR  \H(f)\2/Sn(f  )  is  low. 

n°  STWs  solution  is  similar,  but  not  the  same  with  the  transmitter  power  loading  tor  maximum 
receiver  SNR  in  the  DMT  system  discussed  in  Sec.  13.8. 

Optimum  Power  I -oading  ^  OTO1WDMT  to  find  a  closed-form  expression  of  W . 

As  the  water-filling  l0"  iterative  water-filling  algorithm  can  be  used  to  eventually 

Once  P  has  been* loading  PSD  Sx«).  Of  course,  the  approach  in 

pracficeto  water  level  W  is  by  numerically  solving  for  W.  The  numerical  solution 
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Figure  14.13 

Illustration  of 
water-pouring 
power  allocation 
for  maximizing 
frequency- 
selective  channel 
capacity. 


requires  dividing  the  entire  channel  bandwidth  into  sufficiently  small  nonoverlapping  bands 
of  width  A/. 

Indeed,  for  practical  OFDM  or  DMT  communication  systems,  the  iterative  water-filling 
algorithm  is  tailor-made  to  achieve  maximum  channel  capacity.  Maximum  capacity  can  be 
realized  for  OFDM  channels  by  allocating  different  powers  5,  to  the  different  orthogonal 
subcarriers.  In  particular,  the  power  allocated  to  subcarrier  fj  should  be 

Si  =  A/  •  max  ( W - 0^ 

such  that  5/  =  P.  This  optimum  power  allocation  or  power  loading  can  be  solved  by  adding 
incremental  power  to  the  subcarriers  one  at  a  time  until  ^2  $  =  P 


1 4.8  MULTIPLE-INPUT-MULTIPLE-OUTPUT 
COMMUNICATION  SYSTEMS 


In  the  past  decade,  one  of  the  important  breakthroughs  in  wireless  communications  is  the 
advent  of  multiple-input-multiple-output  (MIMO)  technologies.  In  fact,  both  the  Wi-Fi  (IEEE 
802.1  In)  standard  and  the  WiMAX  (IEEE  802. 16e)  standard  have  incorporated  MIMO  trans¬ 
mitters  and  receivers  (or  transceivers).  The  key  advantage  of  MIMO  wireless  communication 
systems  lies  in  their  ability  to  significantly  increase  wireless  channel  capacity  without  either 
requiring  additional  bandwidth  or  substantially  increasing  the  signal  power  at  the  transmitter. 
Interestingly,  the  MIMO  development  originates  from  the  fundamentals  of  information  theory. 
We  shall  explain  this  connection  here. 


14.8.1  Capacity  of  MIMO  Channels 

Whereas  earlier  only  a  single  signal  variable  was  considered  for  transmission,  we  now  deal 
with  input  and  output  signal  vectors.  In  other  words,  each  signal  vector  consists  of  multiple 
data  symbols  to  be  transmitted  or  received  concurrently  in  MIMO  systems.  Consider  a  random 
signal  vector  x  =  (X|,  X2,  . . . ,  \\)T .  If  the  random  signal  vector  is  discrete  with  probabilities 


Pi  =  P(X  =  Xi) 


i=  1,2, ... 
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then  the  entropy  of  x  is  determined  by 

H(X)  =  -^/7/log/J/ 


(14.94) 


Similarly,  when  x  is  continuously  distributed  with  probability  density  function  p(x i, 
JC2,  . . . ,  jcjv),  its  differential  entropy  is  defined  by 


H(x) 


=  -  /  •••  j  p(x  1,  X2 ,  . . Xn)  \ogp(X\,  X2,  xu)dx\y  dx  2 . dXN  (14.95) 


Consider  a  real-valued  random  vector  x  consisting  of  N  i.i.d.  Gaussian  random  variables. 
Let  x  have  (vector)  mean  p  and  covariance  matrix 

Cx  =  E[(x-  fi)(x-  fi)T) 


Its  differential  entropy  can  be  found"'  to  be 


1 


H(x)=  -  [N  •  log  (2ne)  +  log det  (Cx)] 


(14.96) 


Clearly,  the  entropy  of  a  random  vector  is  not  affected  by  the  mean  fi.  It  is  therefore  convenient 
to  consider  only  the  random  vectors  with  zero  mean.  From  now  on,  we  will  assume  that 

li  =  E{x}=  0 

Among  all  the  real-valued  random  variable  vectors  that  have  zero  mean  and  satisfy  the  condition 

Cx  =  Cov(x,  x)  =  E\xx' ) 


we  have11 

max  H(x)  =  Un-  log(2^e)  +  log  det  (Cx)] .  (14.97) 

px[X):  Cov(x,  xT)=C% 

This  means  that  Gaussian  vector  distribution  has  maximum  entropy  among  all  real  random 

vectors  of  the  same  covariance  matrix.  .  „  ..  .  .  ,. 

Now  consider  a  flat  fading  MIMO  channel  with  matrix  gam  H .  The  N  x  M  channel  matrix 

H  connects  the  M  x  1  input  vector  x  and  N  x  1  output  vector  y  such  that 

y  =  H  x  +  w  (14.98) 

where  w  is  the  iV  x  1  additive  white  Gaussian  noise  vector  with  zero  mean  and  covariance 
matrix  Cw.  As  shown  in  Fig.  14.14.  a  MIMO  system  consists  of  M  transmit  antennas  at  the 


Figure  14.14 

MIMO  system 
with  M  transmit 
antennas  and  N 
receive 
antennas. 
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transmitter  end  and  N  receive  antennas  at  the  receiver  end.  Each  transmit  antenna  can  transmit 
to  all  N  receive  antennas.  Given  a  fixed  channel  H  of  dimensions  N  x  M  (i.e.,  M  transmit 
antennas  and  N  receive  antennas),  the  mutual  information  between  the  channel  input  and  output 
vectors  is 


/(x,  y)  =  H(y)  -  H(y\x) 

=  //(y)  —  //(H  •  x  +  w|x) 


(14.99a) 

(14.99b) 


Recall  that  under  the  condition  that  x  is  known.  H  •  x  is  a  constant  mean.  Hence,  the  conditional 
entropy  of  y  given  x  is 


(14.100) 


H( y|x)  =  H(H  •  x  +  w|x)  =  H( w) 


and 


(14.101a) 


/(x,  y)  =  H (y)  —  H (w) 


=  H(y)  -  -  [N  •  log2  (2 7ie)  +  log  det  (Cw )] 


(14.101b) 


As  a  result,  we  can  use  the  result  of  Eq.  (14.97)  to  obtain 


max/(x,  y)  =  max//(y)  -  -  [N  ■  log2  (2ne)  +  log  det  (Cw)]  (14.102a) 

=  ^  [N  '  *°g2  (2?re)  +  log  det  (Cy )]  —  -  [W  •  log2  (2  ne)  +  logdet(Cw)] 


(14.102a) 


(14.102b) 


=  ^  [log  det  (Cy)  -  log  det  (Cw)] 


Since  the  channel  input  x  is  independent  of  the  noise  vector  w,  we  have 

Cy  =  Cov(y,  y )=H  CxHr  +  Cw 
Thus,  the  capacity  of  the  channel  per  vector  transmission  is 

Cs  =  max  /  (x,  y) 

p(x ) 


(14.103) 


Given  a  symmetric  low-pass  channel  with  B  Hz  bandwidth,  2 B  samples  of  x  can  be 
transmitted  to  yield  provide  channel  capacity  of 


C(H)  =  Blog  det  (l  +  HCxHtC~'} 
=  B  log  det  (7  +CxHtC~'h) 


(14.104) 
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where  we  have  invoked  the  equality  that  for  matrices  A  and  B  of  appropriate  dimensions, 
det(/  +  A  •  B)  =  det(7  +  B  A).  We  clearly  can  see  from  Eq.  (14.104)  that  the  channel 
capacity  depends  on  the  covariance  matrix  Cx  of  the  Gaussian  input  signal  vector.  This  result 
shows  that,  given  the  knowledge  of  the  MIMO  channel  (H1  C~]H)  at  the  transmitter,  an 
optimum  input  signal  can  be  determined  by  designing  Cx  to  maximize  the  overall  channel 
capacity  C(H). 

We  now  are  left  with  two  scenarios  to  consider:  (1)  MIMO  transmitters  without  the 
MIMO  channel  knowledge  and  (2)  MIMO  transmitters  with  channel  knowledge  that  allows 
Cx  to  be  optimized.  We  shall  discuss  the  MIMO  channel  capacity  in  these  two  separate 
cases. 


14.8.2  Transmitter  without  Channel  Knowledge 

For  transmitters  without  channel  knowledge,  the  input  covariance  matrix  Cx  should  be  chosen 
without  showing  any  preference.  As  a  result,  the  default  Cx  =  o~I  should  be  selected.  In  this 
case,  the  MIMO  system  capacity  is  simply 


(14.105) 


Consider  the  eigendecomposition  of 


HrC~'H  =  UDUh 


where  V  is  a  N  x  N  square  unitary  matrix  such  that  U  ■  U"  =  In  and  D  is  a  diagonal  matrix 
with  nonnegative  diagonal  elements  in  descending  order: 


D  =  Diag(d|,  di . dr,  0,  0) 


Notice  that  dr  > 


INotice  inai  ar  ,  aiiHh 

r  <  min(N,  M ).  Because  det  (/  +AB)  =  det  (/  +  BA)  and  L  U 


0  is  the  smallest  nonzero  eigenvalue  of  HrCjH  whose  rank  is  bounded  by 
</  +AB)  =  det  (/  +  BA)  and  UHU  =/,  we  have 


C  =  Blog  det  (/  +°l  ■  UDU") 
=  Blog  det  (/  +  o'  •  DU11 


(14.106a) 


=  Blog[~[(l  +o*dt) 


(14.106b) 


r 


=  B  ^  log(  1  +  cf\di) 


i=i 


(14.106c) 
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Figure  14.15 

r-Channel 
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equivalent  to  a 
MIMO  system 
without  channel 


In  the  special  case  of  channel  noise  that  is  additive,  white,  and  Gaussian,  then  Cw  =  o^l 
and 


UH  (14.107) 


htc~'h  =  —HtH  =  —V 

"  it2  n- 

UW  VJ 


0 


0 


where  y,  is  the  ;'th  largest  eigenvalue  of  H7 H.  which  is  assumed  to  have  rank  r.  Consequently, 
the  channel  capacity  for  this  MIMO  system  is 


(14.108) 


In  short,  this  channel  capacity  is  the  sum  of  the  capacity  of  r  parallel  AWGN  channels.  Each 
subchannel  SNR  is  a*  ■  Yi/^w-  Figure  14. 15  demonstrates  the  equivalent  system  that  consists 
of  r  parallel  AWGN  channels  with  r  active  input  signals  xj ,  . . . ,  \r. 

In  the  special  case  when  the  MIMO  channel  is  so  well  conditioned  that  all  its  nonzero 
eigenvalues  are  identical  y,  =  y,  the  channel  capacity  is 


(14.109) 


Compared  with  the  single-input-single-output  channel  for  which  H  is  a  scalar  such  that  r  =  1, 
the  SISO  channel  capacity  is  simply 


(14.110) 


Therefore,  by  applying  MIMO  transceivers,  the  channel  capacity  is  increased  to  r  times 
the  capacity  of  the  original  single-input-single-output  channel.  This  result  strongly  demon¬ 
strates  the  significant  advantages  of  MIMO  technology  in  providing  much-needed  capacity 
improvement  for  wireless  communications. 


Figure  14.15 
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14.8.3  Transmitter  with  Channel  Knowledge 


In  a  number  of  wireless  communication  systems,  the  transmitter  may  acquire  the  knowledge 
of  the  MIMO  channel  H 1  C^H  through  a  feedback  mechanism.  In  this  case,  the  transmitter 
can  optimize  the  input  signal  covariance  matrix  Cx  to  maximize  the  MIMO  system  capacity. 

First,  we  observe  that  the  channel  capacity  of  Eq.  (14.104)  can  be  increased  simply  by 
scaling  the  matrix  Cx  with  a  large  constant  k.  Of  course,  doing  so  would  be  effectively  increas¬ 
ing  the  transmission  power  k  times  and  would  be  untair.  This  means  that  to  be  lair,  the  design 
of  optimum  covariance  matrix  Cx  must  be  based  on  some  practical  constraint.  In  a  typi¬ 
cal  communication  system,  we  know  that  a  transmitter  with  higher  signal  power  will  lead  to 
higher  SNR  and,  hence,  larger  capacity.  Therefore,  similar  to  the  water-pouring  PSD  design  for 
frequency-selective  channels,  we  should  constrain  the  total  transmission  power  of  the  MIMO 
transmitter  by  the  transmitter  power  threshold  P. 

To  show  how  this  power  constraint  would  affect  the  input  covariance  matrix  C\,  we  first 
need  to  introduce  the  “trace”  (Tr)  operator  of  square  matrices.  Consider  an  M  x  M  square 
matrix  F  whose  element  on  the  /th  row  and  the  jth  column  is  denoted  by  Fjj.  Then  the  trace 
of  the  matrix  F  is  the  sum  of  its  diagonal  elements 


M 

Tr  (F)  =  £>,,•  04.111) 

i=i 

Since  the  trace  operator  is  linear,  it  follows  from  the  property  ot  the  expectation  operator  /.  { • ) 
[Eq.  (8.59)]  that 

£{Tr(F)}  =Tr(£[F))  (14.112) 

We  now  introduce  a  very  useful  property  of  the  trace  operator.  If  matrix  products  A  B  and 
BA  are  both  square  matrices  of  appropriate  sizes,  then  they  both  have  the  same  trace,  that  is, 

Tr  (AB)  =  Tr(F/t)  (14.113) 


This  equality  turns  out  to  be  very  important.  By  applying  Eq.  (14.1 13),  we  know  that  for 
vector  x 

xrx  =  Tr[xrx]  (14.114a) 

=  Tr[xxr]  (14.114b) 

For  the  signal  vector  x  =  (x,.  x2,  ....  xw),  we  can  apply  Eqs.  (14.1 12)  and  (14.1 14)  to 
show  that  the  average  sum  power  of  the  signal  vector  x  is 


=£lTrK]l 

=  Tr[£KI] 

=  Tr|Cx] 


(14.115a) 


(14.115b) 
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As  a  result,  we  have  established  that  the  power  constraint  translates  into  the  trace  constraint 

Tr  (Cx)  <  P 

Therefore,  given  the  knowledge  of  HTC~[H  at  the  transmitter,  the  optimum  input  signal 
covariance  matrix  to  maximize  the  channel  capacity  is  defined  by 

max  B  logdet  (7  +  CxHrC~]H )  (14.116) 

Cx:  Tr(Cx)<P  "  '  ; 

This  optimization  problem  is  henceforth  well  defined. 

To  find  the  optimum  Cx,  recall  the  eigendecomposition 

htc~'h  =  uduh 

By  applying  the  trace  property  of  Eq.  (14.1 13),  we  can  rewrite  the  optimum  covariance  design 
problem  into 

max  B  logdet  (/  +  CXUDUH\  =  max  B log det  (7  +  UhCxUd)  (14.117) 

Tr  (CX)<P  '  '  TnCx)</>  '  ' 

Because  covariance  matrices  are  positive  semidefinite  (Appendix  D.7),  we  can  define  a  new 
positive  semidefinite  matrix 


CX  =  UHCXU  (14.118) 

According  to  Eq.  (14.1 13),  we  know  that 

Tr[Cx]=Tr[t/wCxt/] 

=  Tr[cxt/t/"J 

=  Tr[Cx/] 

=  Tr  [Cx]  (14.119) 

In  fact,  Eq.  ( 14. 1 19)  states  that  the  traces  of  Cx  and  Cx  are  identical.  This  equality  allows 
us  to  simplify  the  capacity  maximization  problem  into 

c  =  filogdet(/  +  UhCxUd) 

Cx:Tr(C*)</>  '  > 

=  _  max  B  log  det  (/  +  CXD)  (14.120a) 

Cx-  Tr (C%)<p 

=  max  Blogdet  (/  +  Dl/2CxD]/2^\  (14.120b) 

C«:Tr<cx)<p  V  ) 

The  problem  of  Eq.  (14.120b)  is  simpler  because  D  is  a  diagonal  matrix.  Furthermore, 
we  can  invoke  the  help  of  a  very  useful  tool  often  used  in  matrix  optimization  known  as  the 

Hadamard  Inequality. 
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Hadamard  Inequality:  Let  aij  be  the  element  of  complex  n  x  n  matrix  A  on  the  ith  row 
and  the  jth  column.  A  is  positive  semidefinite  and  Hermitian ,  that  is,  (conj{A))r  =  A.  Then 
the  following  inequality  holds : 


n 

del  (A)  <  PI  au 

i=i 


with  equality  if  and  only  if  A  is  diagonal. 

We  can  easily  verify  that  I  +  Dl/2CxDl/2  is  positive  semidefinite  because  Cx  is  positive 
semidefinite  (Prob.  14.8-3).  By  invoking  Hadamard  inequality  in  Eq.  ( 14. 1 2()b),  it  is  clear  that, 
for  maximum  channel  capacity  we  need 

D''2CxD]'2  =  diagonal 


In  other  words,  the  optimum  channel  input  requires  that 

Cx  =£>-1/2  •  diagonal  D~x/2  =  diagonal  (14.121) 

Equation  (14.121)  establishes  that  the  optimum  structure  of  Cx  is  diagonal.  This  result  greatly 
simplifies  the  capacity  maximization  problem.  Denote  the  optimum  structure  covariance 

matrix  as 


Cx  =  diagonal  (ci,  C2,  cm) 


Then  the  capacity  is  maximized  by  a  positive  semidefinite  matrix  Cx  according  to 

C=  max  Blogdeti/  +D,/2CxD,/~)  (14.122a) 

cyTr(C„)</> 

M 

=  max  B  log  (1  +  c,d,-)  (14.122b) 

££,*•</>.  c>o  ,-=l 


In  other  words,  our  job  is  to  find  the  optimum  positive  elements  {c, )  to  maximize  Eq.  ( 1 4. 1 22b) 

subject  to  the  constraint  51/ ci  —  ...  ,  ,  .  c 

Taking  the  Lagrangian  approach,  we  define  a  modified  objective  function 


M 


g(c\ 


,c2,...,CM)  =  B^og(\+Cidi)  +  X 


i=l 


("I") 


(14.123) 


Taking  derivative  of  the  modified  objective  function  with  respect  to  Cj  (j  =  \,  2 . A/)and 

setting  them  to  zero,  we  have 

log e1di_x  =  0  j  =  1,2 . M 

1  +  Cjdj 


c>  =  [j^2 


j=  1,  2,  ....  M 


or 
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The  optimum  diagonal  elements  {c/}  are  subject  to  the  constraints 


M 


Cj  >  0  j  =  1 ,  . . . ,  M 


Similar  to  the  problem  of  colored  Gaussian  noise  channel  power  loading,  we  can  define  a 
water  level  W  =  B/(\  In  2).  By  applying  the  same  iterative  water-pouring  procedure,  we  can 
find  the  optimum  power  loading  (on  each  eigenvector)  to  be 


1,  2 . M 


(14.124a) 


with  the  total  power  constraint  that 


M 


(14.124b) 


The  water-filling  interpretation  of  the  optimum  power  loading  at  a  MIMO  transmitter  given 
channel  knowledge  can  be  illustrated  (Fig.  14.16). 

The  optimum  input  signal  covariance  matrix  is  therefore  determined  by 


CX  =  U  Diag  (c,  c2,  . . . ,  c,„,  0 . 0)  •  UH 


In  other  words,  the  input  signal  vector  can  be  formed  by  a  unitary  transformation  V  after  we 
have  found  c/  based  on  water  pouring.  In  effect,  c,  is  the  amount  of  power  loaded  on  the  /th 
column  of  (/,  that  is,  the  /th  eigenvector  of  Hl  C~]H . 

Suppose  we  would  like  to  transmit  m  independent  signal  streams  {51 ,  $2*  •  •  •  ^  sm)  °f  zero 
mean  and  unit  variance.  Then  the  optimum  MIMO  channel  input  can  be  formed  via 


*1 

*2 


(14.125) 


x  =  J7i  diagf^cT,  Vq,  • 


Figure  14.16 

Water-filling 
interpretation  of 
MIMO  trans¬ 
mission  power 


loading  based 
on  channel 
knowledge. 


m 


Subchannels  (eigenvectors) 
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Figure  14.17 

Water-pouring 
interpretation  of 
the  optimum 
MIMO 

transmission 
power  loading 
based  on 
channel 
knowledge. 


Noise 


where  U\  are  the  first  m  columns  of  U.  Figure  14.17  is  the  block  diagram  of  this  optimum 
MIMO  transmitter,  which  will  maximize  channel  capacity  based  on  knowledge  of  the  MIMO 
channel.  The  matrix  multiplier  U i  diagf^O,  Jci . at  the  transmitter  is  known  as 

the  optimum  linear  precoder. 


14.9  MATLAB  EXERCISES 

In  this  section,  we  provide  MATLAB  exercises  to  reinforce  the  concepts  of  source  coding  and 
channel  capacity  in  this  chapter. 


COMPUTER  EXERCISE  14.1:  HUFFMAN  CODE 

The  first  program,  huffmancode.m,  is  a  Huffman  encoder  function.  The  user  need  only  supply  a 
probability  vector  that  consists  of  all  the  source  symbol  probabilities.  The  probability  entries  do  not  need 

to  be  ordered. 


function  [huffcode.n] =huf fmancode (p) ; 

%  input  p  is  a  probability  vector  consisting  of 
%  probabilities  of  source  symbols  x_i 

^  ^rror^ 'Negative  element  cannot  be  in  a  probability  vector') 


return 

else  if  abs (sum(p) -1) >l-e'12^ 

error ('Sum  of  input  probability  is  not 


1# ) 


return 


end 


[psort,pord] =sort (p) ; 


n=length(p) ; 

q=p; 

for  i=l:n-l 

[q,  1]  =sort (q) ;  .  n  . 

m(i/  ;)  =  [l(l:n-i+l)  #  zeros  (l,i  Di» 

q=[q(l)+q(2)  ,q(3:end)  ,1]  ; 

end 

Cword=blanks (n  2) , 


Cword (n) =' 0 ' ; 
Cword (2*n) =/ 1 
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for  il=l:n-2 

Ctemp= Cword; 

idxO  =  f ind (m (n-il , : ) = = 1 ) *  n ; 

Cword (1 :n) = [Ctemp (idxO-n+2 : idxO)  '0'] ; 

Cword (n+1 : 2*n) = [Cword (1 :n-l)  ' 1 ' ] ; 

for  i2=2:il+l 

idx2=f ind (m (n-il , : ) ==i2 ) ; 

Cword (i2*n+l : (i2+l) *n) =Ctemp (n* (idx2-l) +1 :n*idx2) ; 

end 

end 

for  i=l:n 

idxl=f ind (m (1, : ) ==i) ; 

huffcode (i, l:n) =Cword(n* (idxl-1) +l:idxl*n) ; 

end 

end 


The  second  program,  huf  f  manEx.  m,  generates  a  very  simple  example  of  Huffman  encoding.  In 
this  exercise,  we  provide  an  input  probability  vector  of  length  8.  The  M  ATLAB  program  huf  f  manEx .  m 
will  generate  the  list  of  codewords  for  all  the  input  symbols.  The  entropy  of  this  source  H  (x)  is  computed 
and  compared  against  the  average  Huffman  codeword  length.  Their  ratio  shows  the  efficiency  of  the 
code. 

%  Mat lab  Program  <huff manEx. m> 

%  This  exercise  requires  the  input  of  a 
%  probability  vector  p  that  list  all  the 
%  probabilities  of  each  source  input  symbol 
clear; 

P= [0.2  0.05  0.03  0.1  0.3  0.02  0.22  0.08];  %Symbol  probability  vector 
[huffcode , n] =huffmancode (p) ;  %Encode  Huffman  code 

entropy=sum ( -log (p) *p' ) /log (2) ;  %Find  entropy  of  the  source 

%  Display  the  results  of  Huffman  encoder 
display ( [ ' symbol 9 , '  -->  ' codeword','  Probability']) 
for  i=l:n 

codeword_Length(i) =n- length (find (abs (huffcode (i, :) )==32) ) ; 

display ( ['x' ,num2str(i) , '  -->  ' , huffcode (i, :) , '  ' , num2str  (p  (i) ) 1 > ; 

end 

codeword_Length 

avg_length=codeword__Length*p '  ; 

display ( ['Entropy  =  ',  num2str (entropy) ] ) 

display ([' Average  codeword  length  =  ',  num2str (avg_length) ] ) 

By  executing  the  program  huf  f  manEx  .m,  we  can  obtain  the  following  results. 


huf f manEx 

symbol 

—  > 

codeword 

Probability 

xl 

-  -  > 

00 

i 

x2 

-  -  > 

10111 

0.05 

x3 

-  -  > 

101101 

0.03 

x4 

-  -  > 

100 

0.1 

x5 

-  -  > 

11 

0.3 

x6 

-  -  > 

101100 

0.02 

x7 

-  -  > 

01 

0.22 

1 4.9  MATLAB  Exercises  895 


x8  -->  1010  0.08 

codeword_Length  = 

25632624 

Entropy  =  2.5705 

Average  codeword  length  =  2.61 


COMPUTER  EXERCISE  14.2:  CHANNEL  CAPACITY  AND  MUTUAL  INFORMATION 

This  exercise  provides  an  opportunity  to  compute  the  single-input-single-output  channel  capacity  under 
additive  white  Gaussian  noise. 

MATLAB  program  mutualinfo.m  contains  a  function  that  can  compute  the  average  mutual 
information  between  two  data  sequences  *  and  y  of  equal  length.  We  use  a  histogram  to  estimate  the  joint 
probability  density  function  pU.  y)  before  calculating  the  mutual  information  according  to  the  definition 

of  Eq.  (14.45a). 


function  tnuinf o__bit—  mutualinf o  (x, y) 

%mutualinf o  Computes  the  mutual  information  of  two 
%  vectors  x  and  y  in  bits 

%  muinfo_bit  =  mutualinfo (X, Y) 

% 

%  output  :  mutual  information 

a  y  y  :  The  1-D  vectors  to  be  analyzed 


minx=min(x) ; 
maxx=max(x); 

deltax= (maxx-minx) / (length (x) -1) ; 

lowerx=minx-deltax/2 ; 
upperx=maxx+deltax/2 ; 

ncellx=ceil (length(x) * (1/3) ) ; 

miny=min (y) ; 

maxy=max(y); 

deltay= (maxy-miny) / (length (y) -  , 

lowery=miny-deltay/2 ; 
uppery=maxy+deltay/2 ; 

ncelly=ncellx; 


rout (l sncellx, 1 sncelly) =0; 


xx = round ( 
yy= round ( 


(x 

(Y 


-lowerx) / (upperx-lowerx) *ncellx  +  1/2  ); 
-lowery) / (uppery-lowery) *ncelly  +  1/2  ); 


for  n= 1 • length (x) 


indexx=xx(n) ; 

indexy=yy(n) • 
if  indexx  >= 
rout ( indexx 


1  &  indexx  <=  ncellx  &  indexy  >= 
, indexy) =rout (indexx, indexy) +1; 


end; 

end; 


1  Sc  indexy  <=  ncelly 
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h=rout ; 

estimate=0 ; 
sigma=0 ; 
count =0 ; 

%  determine  row  and  column  sums 

hy=sum(h) ; 
hx=sum (h' ) ; 

for  nx=l:ncellx 
for  ny=l:ncelly 
if  h (nx, ny) ~  =  0 

logf =log (h (nx, ny) /hx (nx) /hy (ny) ) ; 
else 

logf =0 ; 
end; 

count = count +h (nx, ny) ; 
estimate=estimate+h (nx, ny) *logf ; 
sigma=sigma+h (nx, ny) *logf ~2 ; 
end; 
end; 

%  biased  estimate 
estimate=estimate/count ; 

sigma  =sqrt (  ( sigma/count -estimate^ 2 ) / (count-1)  ); 

estimate=estimate+log (count) ; 

nbias  = (ncellx-1) * (ncelly-1) / (2*count) ; 

%  remove  bias 

muinfo_bit=  (estimate -nbias)  /  log  (2)  ,* 

In  the  main  MATLAB  program,  capacity_plot .  m,  we  calculate  AWGN  channel  capacity  for 
S/N  ratio  of  0,  5,  10,  15,  and  20  dB.  The  channel  capacity  under  different  SNRs  is  plotted  in  Fig.  14.18. 
In  addition,  we  can  test  the  mutual  information  /(x,  y)  between  the  channel  input  x  and  the  corresponding 
channel  output  y  under  the  same  SNR  levels. 

In  this  program,  we  estimate  /(x,  y)  for  five  different  zero-mean  input  signals  of  unit  variance: 

•  Gaussian  input 

•  Binary  input  of  equal  probability 

•  PAM-4  input  of  equal  probability 

•  PAM-8  input  of  equal  probability 

•  Uniform  input  in  interval  [— \/3,  \/3] 

The  corresponding  mutual  information  /(x,  y)  is  estimated  by  averaging  over  1,000,000  data  samples. 

%  Matlab  program  <capacity_plot . m> 
clear ; elf ; 

Channel_gain=l ; 

H=Channel_gain;  %  AWGN  Channel  gain 
SNRdb=  0:5:20;  %  SNR  in  dB 
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Figure  14.18 

Channel 
capacity 
compared  with 
mutual 
information 
between  channel 
output  and 
different  input 
signals. 


L=1000000 ; 


SNR=10 . " (SNRdb/10) ; 

%  Compute  the  analytical  channel  capacity 
Capacity=l/2*log (1+H*SNR) /log (2) ; 

%  Now  to  estimate  the  mutual  information  between  the  input 
%  and  the  output  signals  of  AWGN  channels 


for  kk=l : length (SNRdb) , 
noise=randn (L, 1) / sqrt (SNR (kk) ) ; 
x=randn (L, 1) ; 
xl=sign(x); 

x2= (floor (rand(L,l) *4-4. e-10) *2-3) /sqrt (5); 

x3= (floor (rand(L,l) *8-4. e-10) *2-7) /sqrt (21); 
x4  = ( rand ( L , 1 ) - 0 . 5 ) *  sqrt ( 12 ) ; 


muinfovec (kk, 1) =mutualinfo (x, x+noise)  ; 
muinfovec (kk, 2) =mutualinfo (xl,xl+noise) ; 
muinfovec (kk, 3 ) =mutualinf o (x2 , x2+noise  ; 
muinfovec ( kk , 4 ) =mutual info(x3,x3+noise; 

muinfovec (kk, 5) =mutualinfo(x4,x4+noise) ; 


%Gaussian  input 
%  Binary  input  (-1,+1) 

%  4 - PAM  input  (-3,-1, 1,3) 

%  8 -PAM  input  (-7, -5, -3, -1,1, 3, 5, 7) 
%  Uniform  input ( -0 . 5 , 0 . 5) 


end  ,  .  , 

plot (SNRdb, Capacity, 'k-d') ; hold  on 

plot ( SNRdb, muinfovec (:,1) , 'k*0'  > 
plot (SNRdb, muinfovec ( : ,2) 
plot (SNRdb, muinfovec ( : ,3) 
plot ( SNRdb, muinfovec ( : ,  4 ) 

n lot (SNRdb, muinfovec (:, 5) ,  ~  • 

Label  C  SNR  (dBI  ■ )  .ylabel  C '  ™t»al 

lege„d  (•  capacity '.■oauesl.n,Msj), 


'k-s' ) 
'k-v' ) 
'k-x' ) 
'k-*' ) 


(bits/sample) ' ) 
' PAM -8 ' ,  . .  . 


hold  off 


*  1  information  is  plotted  against  the  channel  capacity  under  different  SNR 

tor  ?£SZ£32Z  0»PO— i  («  PAM4,; 
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(4)  8-level  PAM  (or  PAM-8);  (5)  uniform.  All  five  symmetric  distributions  are  scaled  to  have  the  same 
zero  mean  and  unit  power  (variance).  As  shown  in  Figure  14.18.  the  mutual  information  achieved  by 
Gaussian  input  closely  matches  the  theoretical  channel  capacity.  This  result  confirms  the  conclusion 
of  Sec.  14.5  that  Gaussian  channel  input  achieves  channel  capacity.  Fig.  14.18  shows  that  the  mutual 
information  for  all  other  channel  inputs  falls  below  the  mutual  information  achieved  by  the  Gaussian 
input.  Among  the  five  different  distributions,  binary  input  achieves  the  least  mutual  information,  whereas 
the  mutual  information  of  PAM-8  input  is  very  close  to  the  channel  capacity  for  the  SNR  below  20 dB. 
This  observation  indicates  that  higher  mutual  information  can  be  achieved  when  the  distribution  of  the 
channel  input  is  closer  to  Gaussian. 


COMPUTER  EXERCISE  14.3:  MIMO  CHANNEL  CAPACITY 

We  show  in  this  exercise  how  MIMO  channel  capacity  varies  for  different  numbers  of  transmit  antennas 
and  receive  antennas.  The  MATLAB  program  mimocap  .  m  will  calculate  the  theoretical  MIMO  capacity 
of  200  random  MIMO  channels  of  different  sizes  at  an  SNR  of  3  dB.  We  consider  the  case  of  a  transmitter 
that  does  not  have  the  MIMO  channel  knowledge.  Hence,  each  transmit  antenna  is  allocated  the  same 
signal  power  o\.  Additionally,  the  channel  noises  are  assumed  to  be  independent  additive  white  Gaussian 
with  variance  <r^. 

The  entries  in  the  MIMO  channel  matrix  H  are  randomly  generated  from  Gaussian  distribution  of 
zero  mean  and  unit  variance.  Because  the  channels  are  random,  for  M  transmit  antennas  and  N  receive 
antennas,  the  MIMO  capacity  per  transmission  is 

lN  +  ?l-HHT 

Because  the  entries  in  the  MIMO  channel  matrix  H  are  randomly  generated,  its  corresponding  capacity 
is  also  random.  From  the  200  channels,  each  N  x  M  MIMO  configuration  should  generate  200  different 
capacity  values. 

%  Matlab  Program  <mimocap.m> 

%  This  program  calculates  the  capacity  of  random  MIMO  (mxn)  channels 
%  and  plots  the  cumulative  distribution  (CDF)  of  the  resulting 
%  capacity; 

%  Number  of  random  channels:  K=200 

%  Signal  to  noise  ratio:  SNRdb=3dB 

clear 

hold  off 

elf 

K=  200; 

SNRdb=3 ; 

SNR=1(T  (SNRdb/10 )  ; 

m=l;  n=l ;  %  lxl  channels 

for  kk=l : K 

H=randn ( [m  n] ) ;  %Random  MIMO  Channel 

capll (kk) =log (det (eye (n,n)  +SNR*H' *H) ) / (2*log (2) ) ; 

end 

[Nil , Cll] =hist (capll , K/10) ;  %CDF  of  MIMO  capacity 

m=2 ; n=2 ;  %  2x2  channels 

for  kk=l : K 

H=randn([m  n] ) ;  %Random  MIMO  Channel 

cap22 (kk) =log (det (eye (n,n) +SNR*H' *H) ) /  (2*log  (2) )  ; 


end 
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[N22 , C22] =hist (cap22 , K/10) ;  %CDF  of  MIMO  capacity 


m=4 ; n=2 ;  %  4x2  channels 

for  kk=l : K 

H=randn([m  n] ) ;  %Random  MIMO  Channel 

cap42 (kk) =log (det (eye (n, n) +SNR*H' *H) ) / (2*log  (2) )  ; 

end 

[N42 , C42] =hist (cap42 , K/10 ) ;  %CDF  of  MIMO  capacity 


m=2 ; n=4 ;  %  4x2  channels 

for  kk=l : K 

H=randn([m  n] ) ;  %Random  MIMO  Channel 

cap24(kk)=log(det(eye(n,n)+SNR*H'*H) )/(2*log(2) ) ; 

end 

[N24 , C24] =hist (cap24 , K/10) ;  %CDF  of  MIMO  capacity 


m=4 ; n=4 ;  %  4x2  channels 

for  kk=l : K 

H=randn ( [m  n] ) ;  %Random  MIMO  Channel 

cap44 (kk) -log (det (eye (n,n) +SNR*H' *H) ) / (2*log (2) ) ; 

end 

[N44 , C44] =hist (cap44 , K/10) ;  %CDF  of  MIMO  capacity 


m=8;n=4;  %  4x2  channels 

for  kk=l:K 

H=randn([m  n] ) ;  %Random  MIMO  Channel 

cap84 (kk) =log (det (eye (n,n) +SNR*H' *H) ) / (2*log (2) ) ; 

end 

[N84 , C84] =hist (cap84 , K/10) ;  %CDF  of  MIMO  capacity 

m=8;n=8;  %  4x2  channels 

for  kk=l : K 

H=randn([m  n] ) ;  %Random  MIMO  Channel 

cap88 (kk) -log (det (eye (n, n) +SNR*H' *H) ) / (2*log  (2 ) )  ; 

^N88 , C88] =hist (cap88 , K/10) ;  %CDF  of  MIMO  opacity 
%  now  ready  to  plot  the  CDF  of  the  capacity  distribution 
plot (Cll , cumsum (Nil) /K, ' k-x' , C22 , cumsum (N22) /K,  k-o  .... 

C24  cumsum (N2 4 ) /K, ' k-d' , C42 , cumsum (N42) /K,  k-v 
C44  cumsum (N44 ) /K, ' k-s ' , C84 , cumsum (N84 ) /K, ' k- * ' ) ; 
legend  ( ^ 1x1 ' , ' 2x2 ' , ' 2x4 ' , ' 4x2 ' , ' 4x4 '  ( '  8x4 ' , ' Location' , ' SouthEast ' ) ; 

x lab el ( ' Rate  or  Capacity  (bits/sample)  for  SNR=3dB' ) ;ylabel ( ' CDF' ) ; 

%  End  of  the  plot 

In  Fig.  14.19,  we  illustrate  the  cumulative  distribution  function  (CDF)  of  the  channel  capacity 
CmIMO 

Prob(CjviIMO  ^  r) 

-  n  pcti  mated  from  the  200  random  channels.  We  computed  the  CDF  of  channel 
of  each  gL«»:  I  x  1,2x2, 2x4. 4x2, 4x4.  and  8x4.  Hie  real,, 

capacity  for  six  dl  svstems  wjth  more  transmit  and  receive  antennas  will  have  CDF  distributions 

clearly  show  that  MIMU  y  example,  2x2  MIMO  systems  will  have  capacity  below 

concentrated  at  higher  capacity  or  rate,  ror  ” 
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4  bits/sample  with  a  probability  of  1.  However,  for  4  x  4  MIMO  systems,  the  probability  drops  to  only 
0.2.  When  considering  8x4  MIMO  systems,  the  probability  falls  below  0.05.  These  numerical  examples 
clearly  demonstrate  the  higher  capacity  achieved  by  MIMO  technologies. 
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PROBLEMS 


14.1-1  A  message  source  generates  one  of  four  messages  randomly  every  microsecond.  The  probabi  iue 
of  these  messages  are  0.4,  0.3,  0.2,  and  0.1.  Each  emitted  message  is  independent  ot  the  ot 
messages  in  the  sequence. 
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(a)  What  is  the  source  entropy? 

(b)  What  is  the  rate  of  information  generated  by  this  source  (in  bits  per  second)? 


14.1-2  A  standard  television  picture  is  composed  of  approximately  300,000  basic  picture  elements 
(about  600  picture  elements  in  a  horizontal  line  and  500  horizontal  lines  per  frame).  Each  of 
these  elements  can  assume  10  distinguishable  brightness  levels  (such  as  black  and  shades  of 
gray)  with  equal  probability.  Find  the  information  content  of  a  television  picture  frame. 


14.1- 3  A  radio  announcer  describes  a  television  picture  orally  in  1000  words  from  his  vocabulary  of 

10,000  words.  Assume  that  each  of  the  10,000  words  in  the  announcer's  vocabulary  is  equally 
likely  to  occur  in  the  description  of  this  picture  (a  crude  approximation,  but  good  enough  to 
give  an  idea).  Determine  the  amount  of  information  broadcast  by  the  announcer  in  describing 
the  picture.  Would  you  say  the  announcer  can  do  justice  to  the  picture  in  1000  words?  Is  the 
old  adage  “A  picture  is  worth  a  thousand  words”  an  exaggeration  or  an  understatement  of  the 
reality?  Use  data  in  Prob.  14.1-2  to  estimate  the  information  of  a  picture. 

14.1- 4  From  the  town  of  the  Old  North  Church  in  Boston,  Paul  Revere's  friend  was  to  show  one  lantern 

if  the  British  army  began  advancing  overland  and  two  lanterns  if  they  had  chosen  to  cross  the 

bay  in  boats. 

(a)  Assume  that  Revere  had  no  way  of  guessing  ahead  of  time  what  route  the  British  might 
choose.  How  much  information  did  he  receive  when  he  saw  two  lanterns? 

(b)  What  if  Revere  were  90%  sure  the  British  would  march  overland?  Then,  how  much 
information  would  the  two  lanterns  have  conveyed? 


14  1-5  Estimate  the  information  per  letter  in  the  English  language  by  various  methods,  assuming  that 

each  character  is  independent  of  the  others.  (This  is  not  true,  but  is  good  enough  to  get  a  rough 

idea.) 

(a)  In  the  first  method,  assume  that  all  27  characters  (26  letters  and  a  space)  are  equiprobable. 
This  is  a  gross  approximation,  but  good  for  a  quick  answer. 

(b)  In  the  second  method,  use  the  table  of  probabilities  of  various  characters  (Table  PI 4. 1-5). 

(c)  Use  Zipf’s  law  relating  the  word  rank  to  its  probability.  In  English  prose,  if  we  order  words 
according  to  the  frequency  of  usage  so  that  the  most  frequently  used  word  {the)  is  word 
number  f  (rank  1),  the  next  most  probable  word  {of)  is  number  2  (rank  2),  and  so  on.  then 
empirically  it  is  found  that  P(r),  the  probability  of  the  rth  word  (rank  r)  is  very  nearly 

0.1 

P(r)  =  — 
r 


Now  use  Zipf’s  law  to  compute  the  entropy  per  word.  Assume  that  there  are  8727  word. 
The  reason  for  this  number  is  that  the  probabilities  P(r)  sum  to  I  for  r  from  I  to  8727  Zipf 
law  surprisingly,  gives  reasonably  good  results.  Assuming  there  are  5.5  letters  (mclud.ng 
space)  per  word  on  the  average,  determine  the  entropy  or  information  per  letter. 

coven  messages  with  probabilities  1/2,  1/4,  1/8,  1/16,  1/32.  1/64.  and  1/64. 
,4-2'1  the  source.  Obtain  the  compact  binary  code  and  find  the  average 

length  of  the  codeword.  Determine  the  efficiency  and  the  redundancy  of  the  code. 

.,  cpven  messages  with  probabilities  1/3, 1/3, 1/9,  1/9, 1/27.  1/27.  and  l/27,respec- 

14.2-2  A  Qf^^source^Obliiin  the  compile  3-aiy  code  and  find  'he  average  lenglh 

Sfcvord.  DeLmioe  lh=  ef"™"'*  ”d  ““  redundancy  of  the  code. 

.,  onp  nf  four  messages  randomly  every  microsecond.  The  probabilities  of  these 
,4.2-3  A  source  emirs  »“  f ,g  MesslgCs  am  f.nerured  mdependemly. 
messages  are  u.j,  v ^  ^ 
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idDie  r.  i  i  - j 

Probability  of  Occurrence  of  letters  in  the  English 
Language 


Letter 

Probability 

-  log  Pi 

Space 

0.187 

2.46 

E 

0.1073 

3.22 

T 

0.0856 

3.84 

A 

0.0668 

3.90 

0 

0.0654 

3.94 

N 

0.0581 

4.11 

R 

0.0559 

4.16 

I 

0.0519 

4.27 

S 

0.0499 

4.33 

H 

0.04305 

4.54 

D 

0.03100 

5.02 

L 

0.02775 

5.17 

F 

0.02395 

5.38 

C 

0.02260 

5.45 

M 

0.02075 

5.60 

U 

0.02010 

5.64 

G 

0.01633 

5.94 

Y 

0.01623 

5.95 

P 

0.01623 

5.95 

W 

0.01620 

6.32 

B 

0.01179 

6.42 

V 

0.00752 

7.06 

K 

0.00344 

8.20 

X 

0.00136 

9.54 

J 

0.00108 

9.85 

Q 

0.00099 

9.98 

z 

0.00063 

10.63 

(a)  What  is  the  source  entropy? 

(b)  Obtain  a  compact  binary  code  and  determine  the  average  length  of  the  codeword,  the 
efficiency,  and  the  redundancy  of  the  code. 

(c)  Repeat  part  (b)  for  a  compact  ternary  code. 

14.2- 4  For  the  messages  in  Prob.  14.2-1 ,  obtain  the  compact  3-ary  code  and  find  the  average  length  i 

the  codeword.  Determine  the  efficiency  and  the  redundancy  of  this  code. 

14.2- 5  For  the  messages  in  Prob.  14.2-2,  obtain  the  compact  binary  code  and  find  the  average  length 

of  the  codeword.  Determine  the  efficiency  and  the  redundancy  of  this  code. 

14.2- 6  A  source  emits  three  equiprobable  messages  randomly  and  independently. 

(a)  Find  the  source  entropy.  ^ 

(b)  Find  a  compact  ternary  code,  the  average  length  of  the  codeword,  the  code  efficiency, 
the  redundancy. 

(c)  Repeat  part  (b)  for  a  binary  code. 
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Figure 

P.14.4-2 


(d)  To  improve  the  efficiency  of  a  binary  code,  we  now  code  the  second  extension  of  the  source. 
Find  a  compact  binary  code,  the  average  length  of  the  codeword,  the  code  efficiency,  and 
the  redundancy. 

14.4-1  A  binary  channel  matrix  is  given  by 


Outputs 


Inputs 


x\ 


*2 


3 

1 

To 

V 


7\ 

3 

9_ 

To 


This  means  /Wyilx,)  =  2/3,  Py\M*\)  =  'A  etc.  You  are  also  given  that  Px(x,)  =  1/3 
and  px(jC2)  =  2/3.  Determine  tf(x).  tf(x|y),  H( y).  W(y|x).  and  /(x;  y). 


14.4-2  For  the  ternary  channel  in  Fig.  PI  4.4-2,  P  x(*i)  =  P,  PM  =  PM  =  QANote:  P+2Q  -  1.) 


Input 

*1* 


Output 

-V'l 


(a)  Determine  tf(x).  W(x|y),  ff(y).  and  /(x;  y). 

(b)  Show  that  the  channel  capacity  Cs  is  given  by 


14.4-3 


where  p  =  ^ 

Csider  the  binam  synnneiric  channei  shown  in  Fig.  PI4.4-3,.  The  channel  main,  is  given  b, 


M  = 


[ 


-Pe 

Pe 


Figure  P14.4-3b  shows  a  cascade  of  two  such  BSCs. 
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(a)  Determine  the  channel  matrix  for  the  cascaded  channel  in  Fig.  P14.4-3b.  Show  that  this 
matrix  is  Af  . 

(b)  If  the  two  BSC  channels  in  Fig.  P14.4-3b  have  error  probabilities  Pe\  and  Pe 2,  with  channel 
matrices  M\  and  M2,  respectively,  show  that  the  channel  matrix  ot  the  cascade  ot  these  two 
channels  is  M\Mi. 

(c)  Use  the  results  in  part  (b)  to  show  that  the  channel  matrix  for  the  cascade  of  k  identical 
BSCs  each  with  channel  matrix  M  isM*.  Verify  your  answer  for  n  =  3  by  confirming  the 
results  in  Example  8.7. 

(d)  Use  the  result  in  part  (c)  to  determine  the  channel  capacity  for  a  cascade  of  k  identical  BSC 
channels  each  with  error  probability  Pe. 

14.4-4  In  data  communication  using  error  detection  code,  as  soon  as  an  error  is  detected,  an  automatic 
request  for  retransmission  ( ARQ)  enables  retransmission  of  the  data  in  error.  In  such  a  channel, 
the  data  in  error  is  erased.  Hence,  there  is  an  erase  probability  /?,  but  the  probability  of  error  is 
zero.  Such  a  channel,  known  as  a  binary  erasure  channel  (BEC),  can  be  modeled  as  shown 
in  Fig.  P14.4-4.  Determine  //(x),  77(x |y),  and  7(x;  y)  assuming  the  two  transmitted  messages 
equiprobable. 


14.4-5  A  cascade  of  two  channels  is  shown  in  Fig.  P14.4-5.  The  symbols  at  the  source,  at  the  output  of 
the  first  channel,  and  at  the  output  of  the  second  channel  are  denoted  by  x,  y,  and  z.  Show  t  a 


H (x|z)  >  77 (x|y) 


and 


7(x;y)  >  7(x;z) 

This  shows  that  the  information  that  can  be  transmitted  over  a  cascaded  channel  can  be  no  greate 
than  that  transmitted  over  one  link.  In  effect,  information  channels  tend  to  leak  information. 

Hint:  For  a  cascaded  channel,  observe  that 


P(Zk\yj,*i)=P(tk\yj) 
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Hence,  by  Bayes'  rule. 


P(*k\yj>zk)  =  ?(*i\yj) 


Figure 

P.14.4-5 


x 


y 


z 


14.5- 1  For  a  continuous  random  variable  x  constrained  to  a  peak  magnitude  M  (  M  <  x  <  M ), 

show  that  the  entropy  is  maximum  when  x  is  uniformly  distributed  in  the  range 

and  has  zero  probability  density  outside  this  range.  Show  that  the  maximum  entropy  is  given 

by  log  2 M . 

14.5- 2  For  a  continuous  random  variable  x  constrained  to  only  positive  values  0  <  x  <  oo  and  a  mean 

value  A,  show  that  the  entropy  is  maximum  when 


px(jc)  =  -e  X/Au(x) 
A 


Show  that  the  corresponding  entropy  is 

H(\)  =  log^A 


channel  if  the  SNR  at  the  receiver  is  required  to  be  at  least  50  dB. 

14.7-1  In  a  communication  system  over  a  frequency-selective  channel  with  transfer  function 


2 


H(f)  1  +  jn(f  /200) 


the  input  signal  PSD  is 


The  channel  noise  is  AWGN  with  spectrum  Sn{f  )  =  10'2.  Find  the  mutual  information  between 
the  channel  input  and  the  channel  output. 

14.7-2  A  communication  channel  has  transfer  function 


The  channel  noise  is 
P  =  10  W. 


is  AWGN  with  spectrum  Sn{f)  =  10~2.  The  transmitter  power  constraint  is 


(a,  Find  the  optimum  input  signal  PSD  under  the  power  constraint. 
(b)  Find  the  maximum  channel  capacity  under  the  power  constraint. 


14.8-1  A  2  x 


2  flat  fading  channel  matrix  is  given  by 
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The  input  signal  is  Gaussian  with  zero  mean  and  power  constraint  P  =  1  The  channel  i 
Gaussian  with  zero  mean  and  variance  a~  =  0. 1 

(a)  Calculate  the  MIMO  channel  capacity,  assuming  that  the  channel  knowledge  is  un 
to  the  receiver. 

(b)  Compared  with  a  single  channel  H  =  1.  quantify  the  capacity  improvement  of  th< 
MIMO  channel. 

14.8- 2  For  the  2  x  2  MIMO  channel  system  in  Prob.  14.8-1.  find  the  channel  capacity  if  the  < 

knowledge  is  known  to  the  transmitter  for  optimum  power  loading  based  on  water  fillin 

14.8- 3  Show  that  if  B  is  diagonal  and  Q  is  positive  semidetinite.  then  the  matrix 

I +  BQB 


is  also  positive  semidefinite. 


ERROR  CORRECTING 
CODES 


A  seen  from  the  discussion  in  Chapter  14,  the  key  to  achieving  error-free  digital  com¬ 
munication  in  the  presence  of  distortion,  noise,  and  interference  is  the  addition  of 
appropriate  redundancy  to  the  original  data  bits.  The  addition  of  a  single  parity  check 
digit  to  detect  an  odd  number  of  errors  is  a  good  example.  Since  Shannon’s  pioneering  paper,1 
a  great  deal  of  work  has  been  carried  out  in  the  area  of  forward  error  correcting  (FEC)  codes. 
In  this  chapter,  we  will  provide  an  introduction;  readers  can  find  much  more  in-depth  coverage 
of  this  topic  from  the  classic  textbook  by  Lin  and  Costello.2 


15.1  OVERVIEW 


Generally  there  are  two  important  classes  of  FEC  codes:  block  codes  and  convolutional  codes. 
In  block  codes  every  block  of  it  data  digits  is  encoded  into  a  longer  codeword  of  n  digits 
/„  >  *)  Every  unique  sequence  of  k  data  digits  fully  determines  a  unique  codeword  of  n 
digits  In  convolutional  codes,  the  coded  sequence  of  n  digits  depends  not  only  on  the  k  data 
digits  but  also  on  the  previous  N  -  1  data  digits  (N  >  1).  Hence,  the  coded  sequence  for  a 
certain  k  data  digits  is  not  unique  but  depends  also  on  N  -  1  earlier  data  digits.  In  short  t  e 
encoder  has  memory.  In  block  codes,  k  data  digits  are  accumulated  and  then  encoded  into  an 
n-digit  codeword.  In  convolutional  codes,  the  encoding  is  done  on  a  continuous  running  basis 

oMhe  capacity  of  noisy  channels  has  yielded  a  famous  result 
.  "  th„  n .  channel  coding  theorem.  This  result  states  that  for  a  noisy  channel  with  a 

capacity^!* there  exist  codes  of  rate  R  <  C  such  that  maximum  likelihood  decoding  can  lead 

to  error  probability 

p  <  2 -nEb(R)  (15.1) 

1,  jt  (R,  is  the  energy  per  information  bit  defined  as  a  function  of  code  rate  if.  This 
where  Eb(R)  is  the  cy  P  „  probability  can  be  achieved  by  increasing 

remarkable  result  rate  constant.  A  similar  result  for  convolutional 

the  block  code  lengt  NPotegthat  this  result  establishes  the  existence  of  good  codes.  It 

codes  was  also  shown  in  ^  L  N  In  fact>  it  is  not  simply  a  question  of  designing 

in SEES.  *-  -  »f  f“  S00d  codeS  ”'i'h 
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length  n  to  reduce  error  probability,  as  well  as  decoders  that  are  simple  to  implement.  The 
best  results  thus  far  are  the  recent  discovery  of  turbo  codes  and  the  rediscovery  of  low-density 
parity  check  (LDPC)  codes,  to  be  discussed  later.  The  former  are  derived  from  convolutional 
codes,  whereas  the  latter  are  a  form  of  block  code. 

Error  correction  coding  requires  a  strong  mathematical  background.  To  provide  a  suf¬ 
ficiently  detailed  introduction  of  various  important  topics  on  this  subject,  we  organize  this 
chapter  according  to  the  level  of  mathematical  background  necessary  for  understanding.  We 
begin  by  covering  the  simpler  and  more  intuitive  block  codes  that  require  the  least  amount  of 
probability  analysis.  We  then  introduce  the  concepts  and  principles  of  convolutional  codes  and 
their  decoding.  Finally,  we  focus  on  the  more  sophisticated  soft-decoding  concept,  which  lays 
the  foundation  for  the  subsequent  coverage  of  recent  progresses  on  high-performance  turbo 
codes  and  low-density  parity  check  codes. 

15.2  REDUNDANCY  FOR  ERROR  CORRECTION 

In  FEC  codes,  a  codeword  is  a  unit  of  bits  that  can  be  decoded  independently.  The  number  ot 
bits  in  codeword  is  known  as  the  code  length.  If  k  data  digits  are  transmitted  by  a  codeword 
of  n  digits,  the  number  of  check  digits  is  m  =  n  —  k.  The  code  rate  is  R  =  k/n.  Such  a  code 
is  known  as  an  (n,k)  code.  Data  digits  (di,d2,  ...»  d*)  are  a  A' -tuple,  and,  hence,  this  is  a 
&-dimensional  vector  d.  Similarly,  a  codeword  (ci,C2,  . . . ,  cn)  is  an  Az-dimensional  vector  c. 
As  a  preliminary,  we  shall  determine  the  minimum  number  of  check  digits  required  to  detect 
or  correct  t  number  of  errors  in  an  (n,  k)  code. 

If  the  binary  code  length  is  /z,  then  a  total  of  2n  code  words  (or  vertices  of  an  zz-dimensional 
hypercube)  is  available  to  assign  to  2k  data  words.  Suppose  we  wish  to  find  a  code  that  will 
correct  up  to  t  wrong  digits.  In  this  case,  if  we  transmit  a  data  word  dj  by  using  one  ot  the 
codewords  (or  vertices)  c/,  then  because  of  channel  errors  the  received  word  will  not  be  cj  but 
will  be  Cy  If  the  channel  noise  causes  errors  in  t  or  fewer  digits,  then  cj  will  lie  somewhere 
inside  the  Hamming  sphere*  of  radius  t  centered  at  Cj.  If  the  code  is  to  correct  up  to  t  errors, 
then  the  code  must  have  the  property  that  all  the  Hamming  spheres  of  radius  t  centered  at  the 
codewords  are  nonoverlapping.  This  means  that  we  must  not  use  vertices  as  codewords  that  are 
within  a  Hamming  distance  t  from  any  codeword.  If  a  received  word  lies  within  a  Hamming 
sphere  of  radius  t  centered  at  Cj,  then  we  decide  that  the  true  transmitted  codeword  was  cr 
This  scheme  is  capable  of  correcting  up  to  t  errors,  and  dm in,  the  minimum  distance  between 
t  error  correcting  codewords  without  overlapping,  is 

z/min=2f+l  (152) 

Next,  to  find  a  relationship  between  n  and  k ,  we  observe  that  2n  vertices,  or  words,  are 
available  for  2k  data  words.  Thus,  there  are  2n  —  2k  redundant  vertices.  How  many  vertices, 
or  words,  can  lie  within  a  Hamming  sphere  of  radius  tl  The  number  of  sequences  (ot  n  digi s 
that  differ  from  a  given  sequence  by  j  digits  is  the  number  of  possible  combinations  of  n  things 

taken  j  at  a  time  and  is  given  by  ^  rj  ^  [Eq.  (8. 1 6)].  Hence,  the  number  of  ways  in  which  up  to 
t  errors  can  occur  is  given  by 
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Thus  for  each  codeword,  we  must  leave 


go 

vertices  (words)  unused.  Because  we  have  2*  codewords,  we  must  leave  a  total  of 

40 


words  unused.  Therefore,  the  total  number  of  words  must  at  least  be 


2k+2k 


£0-4(5) 

j= 1  KJ/  7=0  XJ/ 


But  the  total  number  of  words,  or  vertices,  available  is  2n.  Thus,  we  require. 


or 


r>2‘£(") 

7=0  v// 

^40 

J=0 


(15.3a) 


Observe 


that  n-k  =  m  is  the  number  of  check  digits.  Hence,  Eq.  ( 1 5.3a)  can  be  expressed  as 


40 

7=0  XJ/ 


(15.3b) 


This  is  known  „s  the  Hamming  bound.  It  should  also  be  remembered  .but  the  Humming  bound 
Th  k  irv  hut  not  a  sufficient  condition  in  general.  However,  for  single-error  correcting 
is  a  necessary  sufficient  condition.  If  some  m  satisfies  the  Hamming  bound,  it  does 

Table 

*-?  is  known  -  *  ^ 
A  C  d  i  •  Ira  the  Hamming  spheres  (about  all  the  codewords)  not  only  are  nonover¬ 
code.  In  such  a  co  2n  vertices  leaving  no  vertex  outside  some  sphere.  An  e-error 

lapping  but  they  exhau  ^  condjtion  that  every  possible  (received)  sequence  is  at  a 

correcting  perfect  codeWord.  Perfect  codes  exist  in  only  a  comparatively  few  cases, 

distance  at  most  e  from  Hamming  codes.  For  a  Hamming 

Rinarv  single-error  correcting,  perreci  coues 
cS  =  !  and  =  3.  aud  hum  Eq.  (15.3b)  we  have 


40- 

7=0  X 


1  +n 


(15.4) 
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TABLE  15.1 

Some  Examples  of  Error  Correcting  Codes 


n 

k 

Code 

Code  Efficiency 
(or  Code  Rate) 

Single-error  correcting,  t  =  1 

3 

1 

(3,  1) 

0.33 

Minimum  code  separation  3 

4 

1 

(4.  1) 

0.25 

5 

2 

(5. 2) 

0.4 

6 

3 

(6,  3) 

0.5 

7 

4 

(7,4) 

0.57 

15 

11 

(15,  11) 

0.73 

31 

26 

(31,26) 

0.838 

Double-error  correcting,  t  =  2 

10 

4 

(10,4) 

0.4 

Minimum  code  separation  5 

15 

8 

(15,  8) 

0.533 

Triple-error  correcting,  t  =  3 

10 

2 

(10,2) 

0.2 

Minimum  code  separation  7 

15 

5 

(15.  5) 

0.33 

23 

12 

(23,  12) 

0.52 

and 


n  =  2m  -  1 

Thus.  Hamming  codes  are  («,  k)  codes  with  n  =  2m  -  1  and  it  =  2m  -  1  -  m  and  minimum 
distance  dm\n  =  m.  In  general,  we  often  write  Hamming  code  as  (2W  -  I,  2m  -  1  -  m,  m) 
code.  One  of  the  most  well-known  Hamming  codes  is  the  (7,  4.  3)  code. 

Another  way  of  correcting  errors  is  to  design  a  code  to  detect  (not  to  correct)  up  to  t  errors. 
When  the  receiver  detects  an  error,  it  can  request  retransmission.  This  mechanism  is  known  as 
automatic  repeat  request  (or  ARQ).  Because  error  detection  requires  fewer  check  digits,  these 
codes  operate  at  a  higher  rate  (efficiency). 

To  detect  t  errors,  codewords  need  to  be  separated  by  a  Hamming  distance  of  at  least 
t  +  I-  Otherwise,  an  erroneously  received  bit  string  with  up  to  t  error  bits  could  be  another 
transmitted  codeword.  Suppose  a  transmitted  codeword  Cj  contains  cr  bit  errors  (a  <  /).  Then 
the  received  codeword  Cj  is  at  a  distance  of  a  from  Cj.  Because  a  <  t,  however,  c  can  never 
be  any  other  valid  codeword,  since  all  codewords  are  separated  by  at  least  t  +  1 .  Thus,  the 
reception  of  cj  immediately  indicates  that  an  error  has  been  made.  Thus,  the  minimum  distance 
dm in  between  t  error  detecting  codewords  is 

^min  =  /  +  1 


In  presenting  coding  theory,  we  shall  use  modulo-2  addition,  defined  by 

101 =000=0 
0  0  1  =  1  ©  o  =  1 

This  is  also  known  as  the  exclusive  OR  (XOR)  operation  in  digital  logic.  Note  that  the  modulo- 
2  sum  of  any  binary  digit  with  itself  is  always  zero.  All  the  additions  in  the  mathematical 
development  of  binary  codes  presented  henceforth  are  modulo-2. 
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15.3  LINEAR  BLOCK  CODES 


A  codeword  consists  of  n  digits  c\ ,  ci,  ....  cn ,  and  a  data  word  consists  of  k  digits  d\,di, 
dk-  Because  the  codeword  and  the  data  word  are  an  n-tuple  and  a  /.-tuple,  respectively,  they 
are  n-  and  /-dimensional  vectors.  We  shall  use  row  vectors  to  represent  these  words: 


C  (C| ,  C2,  ....  Cn) 

d  =  (d\,  di,  ....  dk) 


For  the  general  case  of  linear  block  codes,  all  the  n  digits  of  c  are  formed  by  linear  combinations 
(modulo-2  additions)  of  k  data  digits.  A  special  case  in  which  cm  =  d\,ci  =  di,  . . . ,  c*  =  d * 
and  the  remaining  digits  from  c*+t  to  c„  are  linear  combinations  of  d\ ,  dj,  . . . ,  dk,  is  known 
as  a  systematic  code.  In  a  systematic  code,  the  leading  k  digits  of  a  codeword  are  the  data  (or 
information)  digits  and  the  remaining  m  =  n-k  digits  are  the  parity  check  digits,  formed  by 
linear  combinations  of  data  digits  d\,  di,  .  •  • ,  dk'. 


c\  =  d\ 
ci  =  di 


Ck  =  dk 

c*+i  =  h\\d\  ®  h\idi  ®  •  •  •  ©  h\kdk 
Ck+ 2  =  hi\d\  ©  hudi  ©  •  •  •  ®  fakdk 


(15.5a) 


c„  =  h,„\d\  ©  hmidi  ©  •  •  •  ©  hmkdk 


or 


c=dG 


(15.5b) 


where 


1  0  0  •••  0 

0  1  0  •••  0 


h\\  hi\  •••  hm\ 
h\i  hu  •••  hm2 


(15.6) 


G  = 


:  0 

0  0  0  •••  1 


:  0 

h\l[  hlk  hmk 


P(k  x  m) 


The  k  x  n  matrix  G  is 
into  a  k  x  k  identity  r 


s  called  the  generator  matrix.  For  systematic  codes,  G  can  be  partitioned 
matrix  h  and  a  kxm  matrix  P  The  elements  of  P  are  either  0  or  I .  The 
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codeword  can  be  expressed  as 


c=dG 
=  d[Ik  P ] 

=  [d  dP] 

=  [d  cp]  (15.7) 

where  the  check  digits,  also  known  as  the  checksum  bits  or  parity  bits,  are 

cp=dP  (15.8) 

Thus,  knowing  the  data  digits,  we  can  calculate  the  check  digits  from  Eq.  (15.8)  and  conse¬ 
quently  the  codeword  cp.  The  weight  of  the  codeword  c  is  the  number  of  Is  in  the  codeword. 
The  Hamming  distance  between  two  codewords  ca  and  cb  is  the  number  of  elements  by  which 
they  differ,  or 


d(ca .  cb)  =  weight  of  (ca  ©c*) 


Example  15.1  For  a  (6,  3)  code,  the  generator  matrix  G  is 


G  = 


1  0  0 
0  1  0 
0  0  1 


4 


1  0  1 
0  1  1 
1  1  0 


p 


For  all  eight  possible  data  words,  find  the  corresponding  codewords,  and  verify  that  this  code 
is  a  single-error  correcting  code. 


Table  15.2  shows  the  eight  data  words  and  the  corresponding  codewords  formed  from 
c  =dG. 


TABLE  15.2 


Data  Word  d 

Codeword  c 

111 

111000 

110 

110110 

101 

101011 

100 

100101 

Oil 

011101 

010 

010011 

001 

001110 

000 

000000 

Note  that  the  distance  between  any  two  codewords  is  at  least  3.  Hence,  the  code  can 
correct  at  least  one  error.  The  possible  encoder  for  this  code  shown  in  Fig.  15.1  uses  a 
three-digit  shift  register  and  three  modulo-2  adders. 
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Figure  15.1 

Encoder  for 
linear  block 
codes. 


Linear  Codes 

A  block  code  is  a  linear  block  code  if  for  every  pair  of  codewords  c„  and  c/,  from  the 
block  code. 


c„  ©  Cfc 


is  also  a  codeword.  For  this  reason,  linear  codes  must  have  an  all-zero  codeword  000  •  •  •  00. 
For  linear  codes,  the  minimum  distance  equals  the  minimum  weight. 


Decoding 

Let  us  consider  some  codeword 
From  Eq.  (15.8)  and  the  fact  that 


properties  that  could  be  used  for  the  purpose  of  decoding, 
the  modulo-2  sum  of  any  sequence  with  itself  is  zero,  we  get 


(15.9) 


where  I,„  is  the  identity  matrix  of  order  m  x  m  (m  -  n  k).  Thus, 

cHt  =  0 


(15.10a) 


where 


(15.10b) 


and  its  transpose 


H  =  \P'  I  m\ 


(15.10c) 


. .  .wk  matrix  Every  codeword  must  satisfy  Eq.  ( 1 5. 1 0a).  This  is  our  clue 
“  toS-  Consider  .he  received  word  r.  Because  of  possible  errors  caused  by  channel  noise, 
r  in  general  differs  from  the  transmitted  codeword  c, 


r  =c  ®e 
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where  the  error  word  (or  error  vector)  e ,  is  also  a  row  vector  of  n  elements.  For  example,  if 
the  data  word  100  in  Example  15.1  is  transmitted  as  a  codeword  100101  (see  Table  15.2),  and 
the  channel  noise  causes  a  detection  error  in  the  third  digit,  then 


r  =  101101 


c  =  100101 


and 


£>  =  001000 


Thus,  an  element  1  in  e  indicates  an  error  in  the  corresponding  position,  and  0  indicates  no 
error.  The  Hamming  distance  between  r  and  c  is  simply  the  number  of  Is  in  e. 

Suppose  the  transmitted  codeword  is  c,  and  the  channel  noise  causes  an  error  ex,  making 
the  received  word  r  =  c,  +e,  .  If  there  were  no  errors,  that  is,  if  ex  were  000000,  then  we  would 


have  rH  7  =  0.  But  because  of  possible  channel  errors,  rH 1  is  in  general  a  nonzero  row  vector 
s,  called  the  syndrome: 


(15.11a) 


(15.11b) 


Receiving r ,  we  can  compute  the  syndrome  s  [Eq.  ( 15. 1  la)]  and  presumably  we  can  compute 
ei  from  Eq.  (15.1  lb).  Unfortunately,  knowledge  of  s  does  not  allow  us  to  solve  ex  uniquely. 
This  is  because  r  can  also  be  expressed  in  terms  of  codewords  other  than  c/.  Thus, 


r  =cj®  ej  j  ^  i 


Hence, 


s  =  (Cj®ej)Hr  =ejHr 


Because  there  are  2k  possible  codewords. 


is  satisfied  by  2k  error  vectors.  In  other  words,  the  syndrome  s  by  itself  cannot  define  a  unique 


error  vector.  For  example,  if  a  data  word  d  =  100  is  transmitted  by  a  codeword  100101  in 
Example  15.1,  and  if  a  detection  error  is  caused  in  the  third  digit,  then  the  received  word  is 
101101.  In  this  case  we  have  c  =  100101  and  e  =  001000.  But  the  same  word  could  have 
been  received  if  c  =  101011  and  e  =  000110,  or  if  c  =  010011  and  e  =  111110,  and  so 
on.  Thus,  there  are  eight  possible  error  vectors  (2*  error  vectors)  that  all  satisfy  Eq.  (15.1 1  b). 
Which  vector  shall  we  choose?  For  this,  we  must  define  our  decision  criterion.  One  reasonable 
criterion  is  the  maximum  likelihood  rule  according  to  which,  if  we  receive  r,  then  we  decide 
in  favor  ot  that  c  for  which  r  is  most  likely  to  be  received.  In  other  words,  we  decide  ci 
transmitted"  if 


P{r\Ci)  >  P{r\ck)  all  A:  ^  i 

For  a  binary  symmetric  channel  (BSC),  this  rule  gives  a  very  simple  answer.  Suppose  the 
Hamming  distance  between  r  and  cx  is  J;  that  is,  the  channel  noise  causes  errors  in  d  digits* 
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Then  if  Pe  is  the  digit  error  probability  of  a  BSC, 

P(r\Ci)  =  /*(i  -  per~d  =  (1  -  Pe)n 


If  Pe  <  0.5  holds  for  a  reasonable  channel,  then  P(r\cj)  is  a  monotonically  decreasing  function 
of  d  because  Pt >/(l  —  Pe)  <  1.  Hence,  to  maximize  P(r|c,),  we  must  choose  that  a  which  is 
closest  to  r;  that  is,  we  must  choose  the  error  vector  e  with  the  smallest  number  of  Is.  A  vector 
with  e  the  smallest  number  of  Is  is  called  the  minimum  weight  vector.  This  minimum  weight 
error  vector  emin  will  be  used  to  correct  the  error  in  r  via 


Example  15.2  A  linear  (6,  3)  code  is  generated  according  to  the  generating  matrix  in  Example  15.1.  The 


receiver  receives  r  =  100011.  Determine  the  corresponding  data  word  if  the  channel  is  a  BSC 
and  the  maximum  likelihood  decision  is  used. 


We  have 


1  0  1 
0  1  1 


=  [1  0  0  0  1  1]  •  |  Q  [J 


0  1  0 
0  0  1 


=  [1  1  0| 


Because  for  modulo-2  operation,  subtraction  is  the  same  as  addition,  the  correct  transmitted 
codeword  c  is  given  by 


c  =  r®e 


where  e  satisfies 


s  =  [l  1  0|  =  eHr 


I  0  I 
0  1  1 
I  1  0 

I  0  0 
0  1  0 
0  0  1 


=  [e\  e2  ^4  ^5  e(>]  * 


w,»  that  e  =  001000  satisfies  this  equation.  But  so  does  *  =  000110.  or  010101.  or 
011011  or  111110,  or  110000,  or  101101,  or  100011.  The  suitable  choice,  the  minimum 

weight  if  min,  is  001000.  Hence, 


c  =  1000110001000  =  101011 
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TABLE  15.3 

Decoding  Table  for 
Code  in  Table  1 5.2 


e 

5 

000000 

000 

100000 

101 

010000 

Oil 

001000 

110 

000100 

100 

000010 

010 

000001 

001 

100010 

111 

The  decoding  procedure  just  described  is  quite  disorganized.  To  make  it  systematic,  we 
would  consider  all  possible  syndromes  and  for  each  syndrome  associate  a  minimum  weight 
error  vector.  For  instance,  the  single-error  correcting  code  in  Example  1 5. 1  has  a  syndrome  with 
three  digits.  Hence,  there  are  eight  possible  syndromes.  We  prepare  a  table  of  minimum  weight 
error  vectors  corresponding  to  each  syndrome  (see  Table  15.3).  This  table  can  be  prepared  by 
considering  all  possible  minimum  weight  error  vectors  and  using  Eq.  ( 15.1  lb)  to  compute  s 
for  each  of  them.  The  first  minimum  weight  error  vector  000000  is  a  trivial  case  that  has  the 
syndrome  000.  Next,  we  consider  all  possible  unit  weight  error  vectors.  There  are  six  such 
vectors:  100000.  010000.  001000,  000100,  000010,  000001.  Syndromes  for  these  can  readily 
be  calculated  from  Eq.  (15.1  lb)  and  tabulated  (Table  15.3).  This  still  leaves  one  syndrome, 
111,  that  is  not  matched  with  an  error  vector.  Since  all  unit  weight  error  vectors  are  exhausted, 
we  must  look  for  error  vectors  of  weight  2. 

We  find  that  for  the  first  seven  syndromes  (Table  15.3),  there  is  a  unique  minimum  weight 
vector  e.  But  for  s  =  111,  the  error  vector  e  has  a  minimum  weight  of  2,  and  it  is  not  unique. 
For  example,  e  =  100010  or  010100  or  001001  all  have  5  =  111,  and  all  three  e  are  minimum 
weight  (weight  2).  In  such  a  case,  we  can  pick  any  one  e  as  a  correctable  error  pattern.  In 
Table  15.3,  we  have  picked  e  =  100010  as  the  double-error  correctable  pattern.  This  means  the 
present  code  can  correct  all  six  single-error  patterns  and  one  double-error  pattern  (100010).  For 
instance,  if  c  =  101011  is  transmitted  and  the  channel  noise  causes  the  double  error  100010, 
the  received  vector  r  =  001001,  and 

s  =  rHT  =  [111] 

From  Table  15.3  we  see  that  corresponding  to  s  =  111  is  e  =  100010,  and  we  immediately 
decide  c  =  r®c  =  101011.  Note,  however,  that  this  code  will  not  correct  double-error 
patterns  except  for  100010.  Thus,  this  code  corrects  not  only  all  single  errors  but  one  double¬ 
error  pattern  as  well.  This  extra  bonus  of  one  double-error  correction  occurs  because  n  and 
k  oversatisfy  the  Hamming  bound  [Eq.  (15.3b)].  In  case  n  and  k  were  to  satisfy  the  bound 
exactly,  we  would  have  only  single-error  correction  ability.  This  is  the  case  for  the  (7, 4)  code, 
which  can  correct  all  single-error  patterns  only. 

Thus  for  systematic  decoding,  we  prepare  a  table  of  all  correctable  error  patterns  and  the 
corresponding  syndromes.  For  decoding,  we  need  only  calculate  s  =rHT  and,  from  the  decod¬ 
ing  table,  find  the  corresponding  e.  The  decision  is  c  =  r  ©  e.  Because  s  has  ni  =  n  —  k  digits 
there  is  a  total  of  2n~k  syndromes,  each  consisting  of  n  -  k  digits.  There  is  the  same  number 
of  correctable  error  vectors,  each  of  n  digits.  Hence,  for  the  purpose  of  decoding,  we  need  a 
storage  of  (2 n  —  k) 2n  k  =  (2 n  —  k)2m  bits.  This  storage  requirement  grows  exponentially 
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with  m ,  and  the  number  of  parity  check  digits  can  be  enormous,  even  for  moderately 
complex  codes. 

Constructing  Hamming  Codes 

It  is  still  not  clear  how  to  design  or  choose  coefficients  of  the  generator  or  parity  check  matrix. 
Unfortunately,  there  is  no  general  systematic  way  to  design  codes,  except  for  the  few  special 
cases  such  as  cyclic  codes  and  the  class  of  single-error  correcting  codes  known  as  Hamming 
codes.  Let  us  consider  a  single-error  correcting  (7,  4)  code.  This  code  satisfies  the  Hamming 
bound  exactly,  and  we  shall  see  that  a  proper  code  can  be  constructed.  In  this  case  m  =  3,  and 
there  are  seven  nonzero  syndromes,  and  because  n  =  7,  there  are  exactly  seven  single-error 
patterns.  Hence,  we  can  correct  all  single-error  patterns  and  no  more.  Consider  the  single-error 
pattern  e  =  1000000.  Because 


s=eHT 


eHT  will  be  simply  the  first  row  of  HT.  Similarly,  for  e  =  0100000.  s  =  eH  1  will  be  the 
second  row  of  HT,  and  so  on.  Now  for  unique  decidability,  we  require  that  all  seven  syndromes 
corresponding  to  the  seven  single-error  patterns  be  distinct.  Conversely,  il  all  seven  syndromes 
are  distinct,  we  can  decode  all  the  single-error  patterns.  This  means  that  the  only  requirement 
on  Ht  is  that  all  seven  of  its  rows  be  distinct  and  nonzero.  Note  that  H1  is  an  (n  x  n  -  k) 
matrix  (i.e.,  7  x  3  in  this  case).  Because  there  exist  seven  nonzero  patterns  of  three  digits,  it  is 
possible  to  find  seven  nonzero  rows  of  three  digits  each.  There  are  many  ways  in  which  these 
rows  can  be  ordered.  But  we  emphasize  that  the  three  bottom  rows  must  form  the  identity 

matrix  lm  [see  Eq.  (15.10b)]. 

One  possible  form  of  H 1  is 


Ht 


"i  i  r 
1  1  0 
1  0  1 
0  1  1 
1  0  0 
0  1  o 
0  0  1 


p 

l  m 


The  corresponding  generator  matrix  G  is 


G  =  [Ik  P\  = 


'10  0  0  1 

0  10  0  1 

0  0  10  1 

0  0  0  10 


1 

1 

0 

1 


1 

0 

1 

1 


.  u  J  _  mi  1  the  corresponding  code  word  c  =  1011001,  and  so  forth. 

Thus  when  •  code  has  ^-dimensional  syndrome  vectors  (m  =  n  -  k).  Hence, 

.  A  gC",r  ,  distinct  nonzero  syndrome  vectors  that  can  correct  2"  -  1  single-error  patterns. 
Because  in  an  («,*)  code  there  are  exactly  «  single-error  patterns,  all  these  single  errors  can 

be  corrected  if 


2m  -  1  >  n 
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This  is  precisely  the  condition  in  Eq.  (15.4)  for  t  —  I.  Thus,  for  any  ( n.k )  satisfying  this 
condition,  it  is  possible  to  construct  a  single-error  correcting  code  by  the  procedure  discussed. 
To  summarize,  a  (2m  -  1,  2"'  —  1  —  m,  m)  Hamming  code  has  the  following  attributes: 


Number  of  parity  bits 
Code  length 
Number  of  message  bits 
Minimum  distance 


Error  correcting  capability  t  =  1 

For  more  discussion  on  block  coding,  the  readers  should  consult  the  books  by  Peterson 
and  Weldon3  and  by  Lin  and  Costello.2 


15.4  CYCLIC  CODES 


Cyclic  codes  are  a  subclass  of  linear  block  codes.  As  seen  before,  a  procedure  for  selecting  a 
generator  matrix  is  relatively  easy  for  single-error  correcting  codes.  This  procedure,  however, 
cannot  carry  us  very  far  in  constructing  higher  order  error  correcting  codes.  Cyclic  codes 
satisfy  a  nice  mathematical  structure  that  permits  the  design  of  higher  order  correcting  codes. 
Second,  tor  cyclic  codes,  encoding  and  syndrome  calculations  can  be  easily  implemented  by 
using  simple  shift  registers. 

In  cyclic  codes,  the  codewords  are  simple  lateral  cyclic  shifts  of  one  another.  For 
example,  if  c  =  (cj,  ci,  . . . ,  c„_ i,  c„ )  is  a  codeword,  then  so  are  (C2,  C3,  . . . ,  c„,  ci), 
(03,  C4,  . . . ,  c„,  ci,  C2),  and  so  on.  We  shall  use  the  following  notation.  If 


c  =  (Cl,  C2,  ....  C„) 


(15.12a) 


is  a  code  vector  of  a  code  C,  then  ctn  denotes  c  shifted  cyclically  1  places  to  the  left,  that  is, 


(15.12b) 


Cyclic  codes  can  be  described  in  a  polynomial  form.  This  property  is  extremely  useful 
in  the  analysis  and  implementation  of  these  codes.  The  code  vector  c  in  Eq.  (15.12a)  can  be 
expressed  as  the  ( n  —  l)-degree  polynomial 


c(x)  =cixn  1  +  c2jc"-2  +  •  •  •  +  cn 


(15.13a) 


The  coefficients  of  the  polynomial  are  either  0  or  1,  and  they  obey  the  following  properties: 


0+0=0  0x0=0 

0  +  1  =  1  +  0  =  1  0  x  1  =  1  x  0  =  0 

1+1=0  1x1=1 


1x1  =  1 


The  code  polynomial  cw(jc)  for  the  code  vector  c(/)  in  Eq.  (15.12b)  is 

c('V)  =  cI+1y-'  +  c,+2y-2  +  ■  •  •  +  cx*  +  cjc'-'  +  •  •  •  +  c(-  (15.13b) 
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One  of  the  interesting  properties  of  code  polynomials  is  that  when  x'c(x)  is  divided  by  .v"  +  1 , 
the  remainder  is  c(,\x).  We  can  verify  this  property  as  follows: 


xc(x)  =  ci/1  +  C2x"  1  H - 1-  c„x 


c  1 


x"  +  l[c|JCn  +  C2Xn  1  + 
C\x"  + 


+  c„x 


C’l 


C2X"  1  +  C3Jt"  2  +  •  •  •  +  C„x  +  Cl 
remainder 


The  remainder  is  clearly  c(1)(j c).  In  deriving  this  result,  we  have  used  the  fact  that  subtraction 
amounts  to  summation  when  modulo-2  operations  are  involved.  Continuing  in  this  fashion, 
we  can  show  that  the  remainder  of  x‘c(x)  divided  by  x"  +  1  is  c(,lU). 

We  now  introduce  the  concept  of  code  generator  polynomial  g(x).  Since  each  (n,  k) 
codeword  can  be  represented  by  a  code  polynomial 

c(jc)  =  ci*”_l  +  C2x"~2  H - 1 -cn 

g(x)  is  a  code  generator  polynomial  (of  degree  n  -  k ),  if  for  a  data  polynomial  d(x)  of 
degree  k  —  1 

d( x)  =  d\^~X  +  d2xk~2  H - +  dk 


we  can  generate  code  polynomial  via 


c(x)  =  d(x)  g(x) 


(15.14) 


Notice  that  there  are  2*  distinct  code  polynomials  (or  codewords).  For  cyclic  code,  a  codeword 

after  cyclic  shift  is  still  a  codeword. 

We  shall  now  prove  an  important  theorem  in  cyclic  codes: 

Cyclic  Linear  Block  Code  Theorem:  //girl  Is  a  polynomial of  degree  n  -  k  and  Is 
a  factor  of  **  +  1  (modulo-2),  then  g(x)  is  a  gsnoralor  polynomml  ,ha,  generates  an  (n.  k) 

linear  cyclic  block  code. 

Proof:  For  »  data  vector  (dt.dt . <4),  the  data  polynomial  is 

d(x)  =  dt/~'  +di/~2  +  ••■+«*  (15.151 


Consider  k  polynomials 

g(x),  xg(x) .  ^-xg(x) 

i  i,  i  1  n  -  1  respectively.  Hence,  a  linear  combination  of 
which  have  degrees  n  —  k,  n  —  x  -r  *»•••’ 
these  polynomials  equals 

W  +  d2Xk~2g(x)  4 - F  dkg(x)  =  d(x)  g( x) 


(15.16) 
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Regardless  of  the  data  values  {<//},  d(jc)g(jc)  still  has  degree  n  —  1  or  less  while  being  a 
multiple  of  g(x).  Hence,  a  codeword  is  formed  by  using  Eq.  (15.16).  There  are  a  total  of  2k 
such  distinct  polynomials  (codewords)  of  the  data  polynomial  d(: c),  corresponding  to  2k  data 
vectors.  Thus,  we  have  a  linear  ( n ,  k)  code  generated  by  Eq.  ( 15. 14).  To  prove  that  this  code  is 
cyclic,  let 


c(x)  =  Cix"  1  +  C2Xn  2  H - h  cn 

be  a  code  polynomial  in  this  code  [Eq.  (15.16)].  Then, 
xc(x)  =  cix"  +  C2x"~l  H - h  cnx 

=  C\ ( Xn  +  1)  +  (C2X/I  1  +  C7,Xn  2  +  •  •  •  +  Cnx  +  C\ ) 

=  C\(xn  4-  1)  +c(1)(jc) 

Because  xc(x)  is  xd(x)g(x),  and  g(x)  is  a  factor  of  xtl  +  l,c(l)(.r)  must  also  be  a  multiple 
of  g(x)  and  can  also  be  expressed  as  d(x)g(x)  for  some  data  vector  d.  Therefore,  c(1)(jc) 
is  also  a  code  polynomial.  Continuing  this  way,  we  see  that  c(2)(jc),c(3)(jc),  ...  are  all  code 
polynomials  generated  by  Eq.  (15.16).  Thus,  the  linear  (n,k)  code  generated  by  d(jt)g(;c)  is 
indeed  cyclic.  ■ 


Example  1  5.3  Find  a  generator  polynomial  g(*)  for  a  (7, 4)  cyclic  code,  and  find  code  vectors  for  the  following 
data  vectors:  1010,  1111,  0001,  and  1000. 

In  this  case  n  =  1  and  n  -  k  =  3,  and 

x1  +\  =  (*  +  1  )(*3  +  .r  +  1  )(jc3  +  jc2  +  1 ) 

For  a  (7,  4)  the  generator  polynomial  must  be  of  the  order  n-k  =  3.  In  this  case, 
there  are  two  possible  choices  for  g(jr) :  *3  +  jc  +  1  or  x3  +  x2  +  1 .  Let  us  pick  the  latter, 
that  is. 


g(jr)  =  *3  +  x2  +  1 

as  a  possible  generator  polynomial.  For  d  =  [1  0  1  0], 

d  (x)  =  x3  +  x 


and  the  code  polynomial  is 


c(x)  =  d(x)g(x) 

=  (x3  +x)(x*  +x2  +  1) 
=  x6+x5+x4+x 


Hence, 


c  =  1110010 
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TABLE  15.4 

d 

c 

1010 

1110010 

1111 

1001011 

0001 

0001101 

1000 

1101000 

Similarly,  codewords  for  other  data  words  can  be  found  (Table  15.4).  Note  the  structure 
of  the  codewords.  The  first  k  digits  are  not  necessarily  the  data  bits.  Hence,  this  is  not  a 
systematic  code. 


In  a  systematic  code,  the  first  k  digits  are  data  bits,  and  the  last  m  =  n-  k  digits  are  the 
parity  check  bits.  Systematic  codes  are  a  special  case  of  general  codes.  Our  discussion  thus  far 
applies  to  general  cyclic  codes,  of  which  systematic  cyclic  codes  are  a  special  case.  We  shall 
now  develop  a  method  of  generating  systematic  cyclic  codes. 


Systematic  Cyclic  Codes 

We  shall  show  that  for  a  systematic  code,  the  codeword  polynomial  c(x)  corresponding  to  the 
data  polynomial  d{x)  is  given  by 

C(x)=xn-kd(x)  +  p(x)  (15.17a) 

where  p(x)  is  the  remainder  from  dividing  x"  kd(x)  by  gU), 

„  x"-kd(x)  ,ISI7h> 

p(x)  =  Rem  ,  ~  (15.17b) 


To  prove  this  we  observe  that 


jc ”~kd(x) 
g(x) 


,  .  ,  PW 

=  q(x)  H — — 
«(■*) 


(15.18a) 


where  q(x)  is  of  degree 
because  /  M  +f(x)  =  0 


k  _  i  or  less.  We  add  p(x)/g(x)  to  both  sides  of  Eq.  (15.18a),  and 
under  modulo-2  operation,  we  have 


+  =  (x) 
g(x) 


(15.18b) 


or 


q(x)g(x)=x"-kd(x)  +  p(x) 


(15.18c) 


d(x) 


...  „  thp  order  of  k  -  1  or  less,  q(x)g(x)  is  a  code  polynomial.  As  xT 

Because  q(x)  is  on  the  by  „  -  k  digits,  the  first  k  digits  of  this  codeword  are 

1  las, »  -  *  digte  corresponding  ,o  M  mns,  be  pnri.y  check  digils.  This 
S  teconre  dear  by  considering  a  specific  exanrple. 
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Example  15.4  Construct  a  systematic  (7, 4)  cyclic  code  using  a  generator  polynomial  (see  Example  15.3). 


We  use 


g(x)  =  x3  +  x2  +  1 


Consider  a  data  vector  d  =  1010. 


d  (x)  =  x3  +  jc 


and 


Hence, 


xn~kd(x)=x(>  +x4 


x3  +  x2  +  1 


X  ±  X2  +  lx6  +  X4 


X°  +  a:3  +  .r 


x5  +  X4  +  X3 
x5  +  X4  +  X2 


x3  +  x2 
x3  +  x2  +  1 


Hence,  from  Eq.  (15.17a), 


c(x)  =  x3d(x)  +  p(x) 
=  x3(x3  +  x)  +  1 
=  x6  +  x4  +  1 


and 


q(x) 


P(x) 


c  =  1010001 

We  could  also  have  found  the  codeword  c  directly  by  using  Eq.  (15.18c).  Thus,  c(x)  = 
y(x)g(x)  =  (.r  +x2  +  1)(jt3  + 1)  =  x6  +x4  + 1 .  We  construct  the  entire  code  table  in 

this  manner  (Table  15.5).  This  is  quite  a  tedious  procedure.  There  is,  however,  a  shortcut, 
by  means  ot  the  code  generating  matrix  G.  We  can  use  the  earlier  procedure  to  compute  the 
codewords  corresponding  to  the  data  words  1000.  0100.  0010.  0001.  These  are  1000110. 
0100011,  0010111.  0001101.  Now  recognize  that  these  four  codewords  are  the  four  rows 
ot  G.  This  is  because  c  =  d  G,  and  when  d  =  1000.  d  •  G  is  the  first  row  of  G,  and 
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TABLE  15.5 


d 

c 

1111 

1111111 

1110 

1110010 

1101 

1101000 

1100 

1100101 

1011 

1011100 

1010 

1010001 

1001 

1001011 

1000 

1000110 

0111 

0111001 

0110 

0110100 

0101 

0101110 

0100 

0100011 

0011 

0011010 

0010 

0010111 

0001 

0001101 

0000 

0000000 

so  on.  Hence, 

"10  0  0  1  1  0“ 

0  10  0  0  1  1 

G  ~  0  0  1  0  1  1  1 

0  0  0  1  1  0  1_ 

Now,  we  can  use  c  =  dG  to  construct  the  rest  of  the  code  table.  This  is  an  efficient 
method  because  it  allows  us  to  construct  the  entire  code  table  from  the  knowledge  of  only 
k  codewords. 

Table  15.5  shows  the  complete  code.  Note  that  dmi„,  the  minimum  distance  between 
two  codewords,  is  3.  Hence,  this  is  a  single-error  correcting  code,  and  14  of  these  code¬ 
words  can  be  obtained  by  successive  cyclic  shifts  of  the  two  codewords  1110010  and 
1101000.  The  remaining  two  codewords,  1111111  and  0000000,  remain  unchanged  under 

cyclic  shift. 


Generator  Polynomial  and  Generator  Matrix  of  Cyclic  Codes 

Cyclic  codes  can  also  be  described  by  a  generator  matrix  G  (Probs.  15.3-6  and  15.3-7).  It  can 
be  shown  that  Hamming  codes  are  cyclic  codes.  Once  the  generator  polynom,d  g(x)  has  been 
given,  it  is  simple  to  find  the  systematic  code  generator  matrix  G  =  [I  P]  by  determining  the 

parity  submatrix  1°: 

y-1 

1st  row  of  P:  Rem— — - 

g(x) 

x"~2 

2nd  row  of  P:  Rem  — 


xn~k 

g(x) 


(15.19) 


Ath  row  of  P:  Rem 
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Consider  a  Hamming  (7,  4,  3)  code  with  generator  polynomial 


gCr)  =  x3  +  *  +  1. 
x 6 

Rem—  =  x2  +  1 


£(-*) 


(15.20) 


Rem- 


£(*) 


=  xz  +  x  +  1 


Rem- 


gW 


=  XT  +  * 


Rem- 


g(x) 


=  x+  1 


Therefore,  the  cyclic  code  generator  matrix  is 


G  = 


1  0  0  0  10  1' 

0  10  0  111 

0  0  10  1  10 

0  0  0  10  1  1 


Correspondingly,  one  form  of  its  parity  check  matrix  is 


(15.21) 


H  = 


1  1  1  0  1  0  O' 
0  1110  10 
1  10  10  0  1 


(15.22) 


Cyclic  Code  Generation 

One  of  the  advantages  of  cyclic  codes  is  that  their  encoding  and  decoding  can  be  implemented 
by  means  of  such  simple  elements  as  shift  registers  and  modulo-2  adders.  A  systematically 
generated  code  is  described  in  Eqs.  (15.17).  It  involves  a  division  oix"~kd(x)  by  g(x)  that  can 
be  implemented  by  a  dividing  circuit  consisting  of  a  shift  register  with  feedback  connections 

according  to  the  generator  polynomial*  g(x)  =  jr"~*  +  g\x"~k~l  H - 1-  gn-k-\x  +  1.  The 

ga*n  8k  316  either  0  or  1 .  An  encoding  circuit  with  n  —  k  shift  registers  is  shown  in  Fig.  1 5.2.  An 
understanding  of  this  dividing  circuit  requires  some  background  in  linear  sequential  networks. 
An  explanation  of  its  functioning  can  be  found  in  Peterson  and  Weldon.3  The  k  data  digits 


Figure  15.2 

Encoder  for 
systematic  cyclic 
code. 


It  can  be  shown  that  for  cyclic  codes,  the  generator  polynomial  must  be  of  this  form. 
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are  shifted  in  one  at  a  time  at  the  input  with  the  switch  s  held  at  position  p\ .  The  symbol  D 
represents  a  one-digit  delay.  As  the  data  digits  move  through  the  encoder,  they  are  also  shifted 
out  onto  the  output  line,  because  the  first  k  digits  of  the  codeword  are  the  data  digits  themselves. 
As  soon  as  the  last  (or  kth)  data  digit  clears  the  last  (or  (n  -  k  )th|  register,  all  the  registers 
contain  the  parity  check  digits.  The  switch  s  is  now  thrown  to  position  P2<  and  the  n  —  k  parity 
check  digits  are  shifted  out  one  at  a  time  onto  the  line. 


Decoding 

Every  valid  code  polynomial  c(jc)  is  a  multiple  of  g(jt).  In  other  words,  c(x)  is  divisible  by 
g(x).  When  an  error  occurs  during  the  transmission,  the  received  word  polynomial  r(x)  will 
not  be  a  multiple  of  g(x)  if  the  number  of  errors  in  r  is  correctable.  Thus, 


r(*)  ,  ,  .  s(x) 

— —  =  m\(x)  +  —— 
g(x)  g(x) 


(15.23) 


and 


r(x) 

5(*)  =  Rem  — 
g(x) 


where  the  syndrome  polynomial  .v(jt)  has  a  degree  n  k  1  or  less. 
If  ^(jc)  is  the  error  polynomial,  then 

r(x)  =  c(x)  +  e(x) 


(15.24) 


Remembering  that  c(jr)  is  a  multiple  of  g(x)i 


r>  r(*) 


=  Rem 


=  Rem 


c(x)  +  e(x) 


g(x) 


e(x) 

g(x) 


(15.25) 


A  •  before  a  received  word  r  could  result  from  any  one  of  the  2*  codewords  and  a  suitable 
e^or  For  example  for  the  code  in  Table  15.5,  if  r  =  0110010,  this  could  mean  c  =  1110010 
.  -  1000000  ore  =  1101000  and  e  =  1011010,  or  14  more  such  combinations.  As  seen 

earlier” Ihemosx  likely  error  pattern  is  the  one  with  the  minimum  weigh,  (or  minimum  number 
earner,  t  llftftl0  anc|  e  =  1000000  is  the  correct  decision, 

of  Is).  Hence,  ere  a  jecoding  table,  that  is,  to  list  the  syndromes  for  all  correctable 

It  is  convemen  p  P  svnc|ronie  from  Eq.  ( 1 5.24),  and  from  the  table  we  find  the 

errors.  For  any  r.  we :  cc >  F 1  ^  Then  we  determine  c  =  r  ©  e.  Note  that  computation  of 

‘"7ian(15  24)|mvolves  exactly  the  same  operation  as  that  required  to  compute  pl.,1  during 
encothng  (Eq.  ( 15.18a)].  Hence,  the  circuit  in  Fig.  15.2  can  also  be  used  to  compute  sUI. 

~  j  —  I  ■  thle  for  the  single-error  correcting  (7, 4)  code  in  lable  15.5.  Determine 

Example  1 5.5  for  ^  recejved  vectors  r:  (a)  HOIIOI;  (b)  0101000; 

(c)  0001100. 
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TABLE  15.6 


e 

s 

1000000 

110 

0100000 

Oil 

0010000 

111 

0001000 

101 

0000100 

100 

0000010 

010 

0000001 

001 

The  first  step  is  to  construct  the  decoding  table.  Because  n  —  k  —  \  =  2,  the  syndrome 
polynomial  is  of  the  second  order,  and  there  are  seven  possible  nonzero  syndromes.  There 
are  also  seven  possible  correctable  single-error  patterns  because  n  =  7.  We  can  use 

s=e  Ht 

to  compute  the  syndrome  for  each  of  the  seven  correctable  error  patterns.  Note  that 
(Example  15.4) 


H  = 


10  1110  0 
1110  0  10 
0  1110  0  1 


We  can  now  compute  the  syndromes  based  on  H.  For  example,  for  e  =  1000000, 


s  =  [1000000]  Ht 
=  110 


In  a  similar  way,  we  compute  the  syndromes  for  the  remaining  error  patterns  (see 
Table  15.6). 

When  the  received  word  r  is  1101101,  we  can  compute  s(x),  either  according  to 
Eq.  (15.24)  or  by  simply  applying  the  matrix  product 

s  =  r  Ht 

=  [1101101]  Ht 
=  101 

Hence,  From  Table  15.6,  this  gives  e  =  0001000,  and 

c  =  r  ©  e  =  1101101  ©  0001000  =  1100101 

Because  this  code  is  systematic. 


d  =  1100 

In  a  similar  way,  we  determine  for  r  =  0101000.  s  =  110  and  e  =  1000000;  hence 
c  =  r  ©  e  =  1101000,  and  d  =  1101.  Forr  =  0001100.  s  =  001  and  e  =  0000001; 
hence  c  =  r  ©  e  =  0001101,  and  d  =  0001. 
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Bose-Chaudhuri-Hocquenghen  (BCH)  Codes  and 
Reed-Solomon  Codes 

The  BCH  codes  are  perhaps  the  best  studied  class  of  random  error  correcting  cyclic  codes. 
Moreover,  their  decoding  procedure  can  be  implemented  simply.  The  Hamming  code  is  a 
special  case  of  BCH  codes.  These  codes  are  described  as  follows:  for  any  posit  i  ve  i  ntegers  m  and 
t(t<  2m~ 1 ),  there  exists  a  f -error  correcting  (n,  k )  code  with  n  =  2'"  -  I  and  n  —  k<  mt.  The 
minimumdistancer/min  between  codewords  is  bounded  by  the  inequality  2/+I  <  </min  £  2t+2. 

As  a  special  case  of  nonbinary  BCH  codes,  Reed-Solomon  codes  are  by  far  the  most 
successful  forward  error  correction  (FEC)  codes  in  practice  today.  Reed-Solomon  codes  have 
found  broad  applications  in  digital  storage  (DVD,  CD-ROM),  high-speed  modems,  broadband 
wireless  systems,  and  HDTV,  among  numerous  others.  The  detailed  treatment  of  BCH  codes 
and  Reed-Solomon  codes  requires  extensive  use  of  modern  algebra  and  is  beyond  the  scope 
of  this  introductory  chapter.  For  in-depth  discussion  of  BCH  codes  and  Reed-Solomon  codes, 
the  reader  is  referred  to  the  classic  text  by  Lin  and  Costello.2 

Cyclic  Redundancy  Check  (CRC)  Codes  for  Error  Detection 

One  of  the  most  widely  used  cyclic  codes  is  the  cyclic  redundancy  check  codes  for  detection 
of  data  transmission  errors.  CRC  codes  are  cyclic,  designed  to  detect  erroneous  data  packets 
at  the  receivers  (often  after  error  correction).  To  verify  the  integrity  of  the  payload  data  block 
(packet),  each  data  packet  is  encoded  by  CRC  codes  of  length  n  <  2m  -  1 .  The  most  common 
CRC  codes  have  m  =12,  16,  or  32  with  code  generator  polynomial  of  the  form 


g(x)  =  (I  +x)gfW 

gc(x)  =  generator  polynomial  of  a  cyclic  Hamming  code 


To  select  a  code  generator  matrix,  the  design  criterion  is  to  control  the  probability  of  undetected 
error  events.  In  other  words,  the  CRC  codes  must  be  able  to  detect  the  most  likely  error  patterns 
such  that  the  probability  of  undetected  errors 


(15.26) 


checked  by  the  CRC-32  sequence. 
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TABLE  15.7 

Commonly  Used  CRC  Codes  and  Corresponding  Generator 
Matrices 


Code 

Number  of  Bits  in  FCS 

Generator  Polynomial  #(jc) 

CRC-8 

8 

a8+a7+a6+a4+a2+1 

CRC- 12 

12 

a12  +  a1  1  +  r5  +  x2  +  x  +  1 

CRC- 16 

16 

A16  +A15  +A2  +  1 

CRC-CCITT 

16 

Al6+A12  +A5  +  1 

CRC-32 

32 

A32  +  A26  +A23  +A22  +A16  +A12  +A11 
+A10  +  A8  +  A7  +  A5  +  A4  +  A2  +  A  +  1 

the  same  for  both  systems.  Because  only  k  digits  are  required  to  be  transmitted  in  the  uncoded 
system  (versus  n  over  the  coded  one),  the  bit  rate  Rt,  is  lower  for  the  uncoded  system  than  the 
coded  one  by  a  factor  of  k/n.  This  means  that  the  bandwidth  ratio  of  the  uncoded  system  over 
the  coded  system  is  k/n.  Clearly,  the  coded  system  sacrifices  bandwidth  for  better  reliability. 
On  the  other  hand,  the  coded  system  sends  n  code  bits  for  k  information  bits.  To  be  fair,  the 
total  energy  used  by  the  n  code  bits  must  equal  to  the  total  energy  used  by  the  uncoded  system 
for  the  k  information  bits.  Thus,  in  the  coded  system,  each  code  bit  has  Eh  that  is  k/n  times  that 
of  the  uncoded  system  bit.  We  need  to  illustrate  how  error  correction  can  reduce  the  originally 
higher  bit  error  rate  (BER)  despite  this  loss  of  code  bit  energy. 

Let  Pb u  and  Pbc  represent  the  raw  data  bit  error  probabilities  in  the  uncoded  and  coded 
cases,  respectively.  For  the  uncoded  case,  the  raw  bit  error  rate  is  the  final  bit  error  rate  Peu- 

For  a  f -error  correcting  ( n ,  k)  code,  the  raw  BER  can  be  reduced  because  the  decoder  can 
correct  up  to  t  bit  errors  in  every  n  bits.  We  consider  the  ideal  case  that  the  decoder  will  not 
attempt  to  correct  the  codeword  when  there  are  more  than  t  errors  in  n  bits.  This  action  of  the 
ideal  error  correction  decoder  can  reduce  the  average  BER.  Let  P(j, n)  denote  the  probability 
of  J  errors  in  n  digits.  Then  the  average  number  of  bit  errors  in  each  codeword  after  error 
correction  is 


ne  =  E{j  bit  errors  in  n  bits  } 

n 

=  £/■/>(/,# i)  (15-27a) 

j=i+ 1 


Therefore  the  average  BER  after  error  correction  should  be 

Pec  =  *  (15.27b) 

n 

Because  there  are  (^)  ways  in  whichy  errors  can  occur  in  n  digits  (Example  8.6),  we  have 


P(j,n) 


(Pbcyo  -  Pbc)n-j 
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Based  on  Eq.  (15.27a) 


ne=  f/Wd  -Pte)n~j 

At  w 

(15.28a) 

Pec=  J-(n)(P^  ~  P^n~j 

j=l+ 1  "  '  J ' 

=  e 

j=t+ 1  XJ  7 

(15.28b) 

For  Pfc  <£  1,  the  first  term  in  the  summation  of  Eq.  (15.28b)  dominates  all  the  other  terms, 
and  we  are  justified  in  ignoring  all  but  the  first  term.  Hence, 

Pec  =  ("  t  1^(/>bc),+  l(l  -  Phc)"-(,+  l) 

(15.29a) 

~  f"  t  1  Wbc)'+I  for  Pbc  <K  1 

(15.29b) 

For  further  comparison,  we  must  assume  some  specific  transmission  scheme.  Let  us  con¬ 
sider  a  coherent  PSK  scheme.  In  this  case,  for  an  additive  white  Gaussian  noise  (AWGN) 
channel, 


P I'n  — 


and  because  Eb  for  the  coded  case  is  k/n  times  that  for  the  uncoded  case. 


(15.30a) 


Hence, 


(15.30b) 


-w+l 


(15.31a) 

(15.31b) 


To  comoare  coded  and  uncoded  systems,  we  could  plot  P„  and  />«  as  function,  of  the  raw 

the  uncoded  system).  Because  Eqs.  ( 1 5.3 1  >  involve  parameters and  *  a  proper 

rtnnnires  families  of  plots.  For  the  case  of  a  (7,  4)  single-error  correcting  code 
companson  mqutres  fanning  Jj.  ^  ^  ^  (|JJ|)(ire  plotted  in  r„.  ,5.3  as  a  function 

1  :  ,\f  ohserve  that  the  coded  scheme  is  superior  to  the  uncoded  scheme  at  higher  Eh/M 
of  Eh/M-  Ob.  significant.  For  large  n  and  k.  however,  the  coded 

bu,  the  improvement  (afmut  I  dB)  *s  =  ^  ^  ^  channe|s  ,  ed 

dodes  can  yield  substantia,  gains,  as  shown  in . ... 

example. 


930  ERROR  CORRECTING  CODES 


It  should  be  noted  that  the  coded  system  performance  of  Fig.  15.3  is  in  fact  a  slightly 
optimistic  approximation.  The  reason  is  that  in  analyzing  its  bit  error  rate,  we  assumed  that 
the  decoder  will  not  take  any  action  when  the  number  of  errors  in  each  codeword  exceeds  t. 
In  practice,  the  decoder  never  knows  how  many  errors  are  in  a  codeword.  Thus,  the  decoder 
will  always  attempt  to  correct  the  codeword,  assuming  that  there  are  no  more  than  t  bit  errors. 
This  means  that  when  there  are  more  than  t  bit  errors,  the  decoding  process  may  even  increase 
the  number  of  errors.  This  counterproductive  decoding  effect  is  more  likely  when  Pe  is  high 
at  low  Eb/N.  This  effect  will  be  shown  later  in  Sec.  15.13  as  a  MATLAB  exercise. 


Example  1  5.6  Compare  the  performance  of  an  AWGN  BSC  using  a  single-error  correcting  (15,  11)  code 
with  that  of  the  same  system  using  uncoded  transmission,  given  that  Eb/J\f  =  9.0946  for  the 
uncoded  scheme  and  coherent  PSK  is  used  to  transmit  the  data. 

From  Eq.  (15.31b), 


P eu  =  0(^/18.1892)  =  1.0  x  10“5 


Pec  =  14 


Q  (  \l  y^(18.1892) 


n2 


=  14(1.3  x  10~4)2  =  2.03  x  10"7 


and  from  Eq.  (15.31a), 
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Note  that  the  word  error  probability  of  the  coded  system  is  reduced  by  a  factor  of  50. 
On  the  other  hand,  if  we  wish  to  achieve  the  error  probability  of  the  coded  transmission 
(2.03  x  10-7)  by  means  of  the  uncoded  system,  we  must  increase  the  transmitted  power. 
If  E'b  is  the  new  value  of  £/,  to  achieve  Pcu  =  2.03  x  I0-7, 


=  2.03  x  l()-7 


This  gives  E'h/ AT  =  13.5022.  This  is  an  increase  over  the  old  value  of  9.0946  by  a  factor 
of  1.4846,  or  1.716  dB. 


Burst  Error  Detecting  and  Correcting  Codes 

Thus  far  we  have  considered  detecting  or  correcting  errors  that  occur  independently,  or  ran¬ 
domly,  in  digit  positions.  On  some  channels,  disturbances  can  wipe  out  an  entire  block  of 
digits.  For  instance,  a  stroke  of  lightning  or  a  human-made  electrical  disturbance  can  affect 
several  adjacent  transmitted  digits.  On  magnetic  storage  systems,  magnetic  material  defects 
usually  affect  a  block  of  digits.  Burst  errors  are  those  that  wipe  out  some  or  all  of  a  sequential 
set  of  digits.  In  general,  random  error  correcting  codes  are  not  efficient  for  correcting  burst 
errors.  Hence,  special  burst  error  correcting  codes  are  used  for  this  purpose. 

A  burst  of  length  b  is  defined  as  a  sequence  of  digits  in  which  the  first  digit  and  the  bt  h  digit 
are  definitely  in  error,  with  the  b  -  2  digits  in  between  either  in  error  or  correct.  For  example, 
an  error  vector  e  =  0010010100  has  a  burst  length  of  6. 

It  can  be  shown  that  for  detecting  all  burst  errors  of  length  b  or  less  with  a  linear  block  code 
of  length  n,  b  parity  check  bits  are  necessary  and  sufficient.3  We  shall  prove  the  sufficiency 
part  of  this  theorem  by  constructing  a  code  of  length  n  with  b  parity  check  digits  that  will 

detect  a  burst  of  length  b.  ....... 

To  construct  such  a  code,  let  us  group  k  data  digits  into  segments  of  b  digits  in  length 

(Fig.  15.4).  To  this  we  add  a  last  segment  of  b  parity  check  digits,  which  are  determined  as 

_  m.  i :  *  * ranmAnf  (inoliiHinn  fhp  nttrtlv  nYi&C'V 


follows.  The  modulo-2  sum  of  the  ith  digits  from  each  segment  (including  the  parity  check 


segment)  must  be  zero.  For  example,  the  first  digits  in  the  five  data  segments  are  1.  0.  1.  1. 
and  1  Hence  to  obtain  a  modulo-2  sum  zero,  we  must  have  0  as  the  first  parity  check  digit.  We 
continue  in  this  way  with  the  second  digit,  the  third  digit,  and  so  on,  to  the  5th  digit.  Because 
parity  check  digits  are  a  linear  combination  of  data  digits,  this  is  a  linear  block  code.  Moreover, 


Figure  15.4 

Burst  error 
detection. 


Parity-check  bits 


b  digits 


1  0  1  1  !  0  1  0  1 


1  0  I  j  1  0  0  0 


0  0  0 


k  data  bits 
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check  digits,  is  independent  of  k  (or  /z),  which  makes  it  a  very  useful  code  for  such  systems  as 
packet  switching,  where  the  data  digits  may  vary  from  packet  to  packet.  It  can  be  shown  that 
a  linear  code  with  b  parity  bits  detects  not  only  all  bursts  of  length  b  or  less,  but  also  a  high 
percentage  of  longer  bursts.3 

If  we  are  interested  in  correcting  rather  than  detecting  burst  errors,  we  require  twice 
as  many  parity  check  digits.  According  to  the  Hamming  sphere  reasoning:  to  correct  all 
burst  errors  of  length  b  or  less,  a  linear  block  code  must  have  at  least  2b  parity-check 


1 5.6  CONVOLUTIONAL  CODES 


Convolutional  (or  recurrent)  codes,  introduced  in  1955,4  differ  from  block  codes  as  follows. 
In  a  block  code,  the  block  of  n  code  digits  generated  by  the  encoder  in  any  particular  time  unit 
depends  only  on  the  block  of  k  input  data  digits  within  that  time  unit.  In  a  convolutional  code, 
on  the  other  hand,  the  block  of  n  code  digits  generated  by  the  encoder  in  a  particular  time  unit 
depends  not  only  on  the  block  of  k  message  digits  within  that  time  unit  but  also  on  the  data 
digits  within  a  previous  span  of  N  —  1  time  units  (N  >  1).  For  convolutional  codes,  k  and 
n  are  usually  small.  Convolutional  codes  can  be  devised  for  correcting  random  errors,  burst 
errors,  or  both.  Encoding  is  easily  implemented  by  shift  registers.  As  a  class,  convolutional 
codes  are  easier  to  encode. 

15.6.1  Convolutional  Encoder 

A  convolutional  encoder  with  constraint  length  N  consists  of  an  N- stage  shift  register  and  t 
modulo-2  adders.  Figure  15.5  shows  such  an  encoder  for  the  case  of  V  =  3  and  i  =  2.  The 
message  digits  are  applied  at  the  input  of  the  shift  register.  The  coded  digit  stream  is  obtained 
at  the  commutator  output.  The  commutator  samples  the  i  modulo-2  adders  in  sequence,  once 
during  each  input-bit  interval.  We  shall  explain  this  operation  with  reference  to  the  input 
digits  11010. 

Initially,  all  the  contents  of  the  register  are  0.  At  time  k  =  1,  the  first  data  digit  1  enters 
the  register.  The  content  d *  shows  1  and  all  the  other  contents  <4_i  =  0  and  d^-2  =  0  are 
still  unchanged.  The  two  modulo-2  adders  show  encoder  output  v*  j  =  1  and  v*,2  ==  1  f°r 
data  input.  The  commutator  samples  this  output.  Hence,  the  encoder  output  is  11.  At  A:  =  2, 


Figure  15.5 
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Input  bit  stream 
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the  second  message  bit  1  enters  the  register.  It  enters  the  register  stage  <4*  and  the  previous  I 
in  d\  is  now  shifted  to  <4-i,  where  as  <4-2  is  still  0.  The  modulo-2  adders  now  show  v*  j  =  0 
and  v*,2  =  1.  Hence,  the  encoder  output  is  01.  In  the  same  way,  when  the  new  digit  0  enters 
the  register,  we  have  <4  =  0,  <4-i  =  1,  and  <4-2  =  and  the  encoder  output  is  01. 

Observe  that  each  data  digit  influences  N  groups  of  t  digits  in  the  output  (in  this  case 
three  groups  of  two  digits).  The  process  continues  until  the  last  data  digit  enters  the  stage 
<4.*  We  cannot  stop  here,  however.  We  add  N  —  1  number  of  Os  to  the  input  stream  (dummy 
or  augmented  data)  to  make  sure  that  the  last  data  digit  (0  in  this  case)  proceeds  all  the  way 
through  the  shift  register,  to  influence  the  N  groups  of  v  digits.  Hence,  when  the  input  digits  are 
11010,  we  actually  apply  (from  left  to  right)  1101000,  which  contains  N  —  1  augmented  zeros 
to  the  input  of  the  shift  register.  It  can  be  seen  that  when  the  last  digit  of  the  augmented  message 
stream  enters  dh,  the  last  digit  of  the  message  stream  has  passed  through  all  the  N  stages  of 
the  register.  The  reader  can  verify  that  the  encoder  output  is  given  by  11010100101100.  Thus, 
there  are  in  all  n  =  (N  +k  —  \)i  digits  in  the  coded  output  for  every  k  data  digits.  In  practice, 
k  N,  and,  hence,  there  are  approximately  kt  coded  output  digits  for  every  k  data  digits, 
giving  an  rate  r]  ~  1  /t .+ 

It  can  be  seen  that  unlike  the  block  encoder,  the  convolutional  encoder  operates  on  a 
continuous  basis,  and  each  data  digit  influences  N  groups  of  t  digits  in  the  output. 


Code  Tree 

The  process  of  coding  and  decoding  is  considerably  facilitated  by  what  is  known  as  the  code 
tree,  which  shows  the  coded  output  for  any  possible  sequence  of  data  digits.  The  code  tree  for 
the  encoder  in  Fig.  15.5  with  k  =  5  is  shown  in  Fig.  15.6.  When  the  first  digit  is  0,  the  encoder 
output  is  00.  and  when  it  is  1,  the  output  is  11.  This  is  shown  by  the  two  tree  branches  that 
start  at  the  initial  node.  The  upper  branch  represents  0.  and  the  lower  branch  represents  I .  This 
convention  will  be  followed  throughout.  At  the  terminal  node  of  each  of  the  two  branches, 
we  follow  a  similar  procedure,  corresponding  to  the  second  data  digit.  Hence,  two  branches 
initiate  from  each  node,  the  upper  one  for  0  and  the  lower  one  for  1.  This  continues  until  the 
kth  data  digit.  From  there  on,  all  the  input  digits  are  0  (augmented  digit),  and  we  have  only  one 
branch  until  the  end.  Hence,  in  all  there  are  32  (or  2* )  outputs  corresponding  to  2*  possible  data 
vectors.  The  coded  output  for  input  11010  can  be  easily  read  from  this  tree  (the  path  shown 

dashed  in  Fig.  15.6).  .  _ . 

Figure  15  6  shows  that  the  code  tree  becomes  repetitive  after  the  third  branch.  This  can 

be  seen  by  noting  that  the  two  blocks  enclosed  in  the  dashed  lines  are  identical.  It  means  that 
the  output  from  The  fourth  input  digit  is  the  same  whether  the  first  digit  was  1  or  0.  This  is 
not  surprising  since  when  the  fourth  input  digit  enters  the  shift  register,  the  first  input  digit 
is  shifted  out  of  the  register  and  ceases  to  influence  the  output  digits.  In  other  words,  the  data 
„_t„r  1  v,  >•->  xixa  •  •  •  and  the  data  vector  Or, *2*3*4  ’ ' '  generate  the  same  output  after  the  third 


*  For  a  systematic  code,  one  of  the  < 
t  In  general,  instead  of  shifting  one 


of  the  output  digits  must  be  the  data  digit  itself. 

ng  one  digit  at  a  time,  b  digits  may  be  shifted  at  a  time.  In  this  case  n  ~  b/i. 
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Figure  15.6 

Code  tree  for  the 
encoder  in 
Fig.  15.5. 
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State  Transition  Diagram  Representation 

The  encoder  behavior  can  be  seen  from  the  perspective  of  a  finite  state  machine  with  its 
state  transition  diagram.  When  a  data  bit  enters  the  shift  register  (in  d k),  the  output  bits  are 
determined  not  only  by  the  data  bit  in  d^,  but  by  the  two  previous  data  bits  already  in  stages 
dk-i  and  dk-\.  There  are  four  possible  combinations  of  the  two  previous  bits  (in  dk-2  an^ 
*4-1 ) :  00’  01, 10,  and  11.  We  shall  label  these  four  states  a,  b ,  c,  and  d ,  respectively,  as  shown 
in  Fig.  15.7a.  Thus,  when  the  previous  two  bits  are  01  ( dk-2  =  0,  dk-\  =  1),  the  state  is  b , 
and  so  on.  The  number  of  states  is  equal  to  2N~X . 

A  data  bit  0  or  1  generates  four  different  outputs,  depending  on  the  encoder  state.  If  the 
data  bit  is  0,  the  encoder  output  is  00,  10, 11,  or  01,  depending  on  whether  the  encoder  state  is 
a ,  b,  c,  or  d.  Similarly  if  the  data  bit  is  1,  the  encoder  output  is  11,  01,  00,  or  10,  depending  on 
whether  the  encoder  state  is  a ,  b ,  c,  or  d .  This  entire  behavior  can  be  concisely  expressed  by 
the  state  transition  diagram  (Fig.  15.7b),  a  four-state  directed  graph  that  uniquely  represents 
the  input-output  relation  of  this  encoder.  We  label  each  transition  path  with  a  label  of  input  bit 
over  output  bits: 


{<4}/{v*,i  vu} 

This  way,  we  know  exactly  the  input  information  bit  dk  for  each  state  transition  and  its 

corresponding  encoder  output  bits,  {v^j  v*,2}. 

For  instance,  when  the  encoder  is  in  state  a ,  and  we  input  1,  the  encoder  output  is  11-  Thus 
the  transition  path  is  labeled  with  1/11.  The  encoder  now  goes  to  state  b  for  the  next  data  bit 
because  at  this  point  the  previous  two  bits  become  dk_2  =  0  and  dk-\  =  1.  Similarly,  when 
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Figure  15.7 

(a)  State  and 

(b)  state  transi¬ 
tion  diagram  of 
the  encoder  in 
Fig.  15.5. 


Figure  15.8 

Trellis  diagram 
for  the  encoder 
in  Fig.  1 5.5. 


Branch  labels:  input  bits  /  output  bits 


the  encoder  is  in  state  a  and  the  input  is  0.  the  output  is  (Mi  (solid  line),  and  the  encoder  remains 
in  state  a  Note  that  the  encoder  cannot  go  directly  from  state  a  to  states  c  or  d .  From  any  given 
state  the  encoder  can  go  to  only  two  states  directly  by  inputting  a  single  data  bit.  This  is  an 
extremely  important  observation,  which  will  be  used  later.  The  encoder  goes  from  state  a  to 
state  b  (when  the  input  is  1),  or  to  state  a  (when  the  input  is  0).  and  so  on.  The  encoder  cannot 
£0  from  a  to  c  in  one  step.  It  must  go  from  a  to  b  to  c,  or  from  a  to  b  to  d  to  c,  and  so  on.  We 
can  also  verify  these  facts  from  the  code  tree.  Figure  15.7b  contains  the  complete  mformat.on 

of  the  code  tree. 


Another  JLfd  way  of  representing  the  code  tree  .s  the  trellis  d.agram  (Fig.  1 5.8).  The  diagram 
starts  from  scratch  (all  Os  in  the  shift  register,  i.e.,  state  a)  and  makes  transitions  corresponding 
T  h  innut  data  digit  These  transition  branches  are  labeled  just  as  we  labeled  the  state 
to  each  inp  g  fi  ■  ,  di git  is  0,  the  encoder  output  is  <M>.  and  the  trellis 

rr  SS  lSl'y  seen  from  Fig.  15.7b.  We  continue  this  way  with  the 
branch  is  la  /  ^^*,0  input  digits,  the  encoder  is  in  one  ofthe  four  states  a,  b,c,  or 

second  input  d  g  .  At  in  state  a  (previous  two  data  digits  00),  it  goes  to  state 

as  shown  F^Ut  ^  ^  ^  fl  .f  (he  next  input  bit  is  0.  In  so  doing,  the  encoder 
b  it  the  next  np  that  the  structure  of  the  trellis  diagram  is  completely 

J -  can  be  -eadily  drawn  by  using  .he  «*  diagran,  in  Fig.  15.7b. 
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Figure  15.9 

A  recursive 
systematic 
convolutional 
(RSC)  encoder. 


dk  vkA 


It  should  be  noted  that  the  convolutional  encoder  can  have  feedback  branches.  In  fact, 
feedback  in  the  convolutional  encoder  generates  the  so-called  recursive  code.  As  shown  in 
Fig.  15.9,  the  data  bit  can  have  a  direct  path  to  the  output  bit.  The  bits  from  the  top  branch  will 
be  the  information  bits  from  the  input  directly.  This  code  is  therefore  systematic.  This  encoder 
leads  to  a  recursive  systematic  convolutional  (RSC)  code.  It  can  be  shown  (see  Prob.  15.5-3) 
that  the  RSC  encoder  can  also  be  represented  by  a  similar  state  transition  diagram  and  a  trellis 
diagram.  Consequently,  recursive  convolutional  code  can  be  decoded  by  using  the  methods 
described  next  for  nonrecursive  convolutional  codes. 


15.6.2  Decoding  Convolutional  Codes 

We  shall  consider  two  important  techniques:  ( 1 )  maximum  likelihood  decoding  (Viterbi  algo¬ 
rithm)  and  (2)  sequential  decoding.  Although  both  are  known  as  hard-decision  decoders,  the 
Viterbi  algorithm  (VA)  is  much  more  flexible  and  can  be  easily  adapted  to  allow  soft  input  and 
to  generate  soft  output  decisions,  to  be  described  later  in  this  chapter. 

Maximum  Likelihood  Decoding:  The  Viterbi  Algorithm 

Among  various  decoding  methods  for  convolutional  codes,  Viterbi’s  maximum  likelihood  algo¬ 
rithm^  is  one  of  the  best  techniques  for  digital  communications  when  computational  complexity 
dominates  in  importance.  It  permits  major  equipment  simplification  while  obtaining  the  full  per¬ 
formance  benefits  of  maximum  likelihood  decoding.  The  decoder  structure  is  relatively  simple 
for  short  constraint  length  TV ,  making  decoding  feasible  at  relatively  high  rates  of  up  to  1 0  Gbit/s. 

In  AWGN  channels,  the  maximum  likelihood  receiver  requires  selecting  a  codeword  clos¬ 
est  to  the  received  word.  For  a  long  sequence  of  received  data  representing  k  message  bits 
and  2k  codewords,  direct  implementation  of  maximum  likelihood  decision  (MLD)  involves 
storage  of  2k  words  and  their  comparison  to  the  received  sequence.  This  computational  need 
places  a  severe  burden  on  MLD  receivers  for  large  values  of  k  for  convolutionally  encoded 
data  frames,  typically  in  the  order  of  hundreds  or  thousands  of  bits! 

Viterbi  made  a  major  simplification  for  MLD.  We  shall  use  the  convolutional  code  example 
of  Figs.  15.5  and  15.7  to  illustrate  the  fundamental  operations  of  the  VA.  First,  we  stress  that 
each  path  that  traverses  through  the  trellis  represents  a  valid  codeword.  The  objective  of  MLD 
is  to  find  the  best  path  through  the  trellis  that  is  closest  to  the  received  data  bit  sequence.  To 
understand  this,  consider  again  the  trellis  diagram  in  Fig.  15.8.  Our  problem  is  as  follows: 
given  a  received  sequence  of  bits,  we  need  to  find  a  path  in  the  trellis  diagram  with  the  output 
digit  sequence  that  agrees  best  with  the  received  sequence.  The  minimum  (Hamming)  distance 
path  represents  the  most  likely  sequence  up  to  stage  /. 
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As  shown  in  Fig.  15.8,  each  codeword  is  a  trellis  path  that  should  start  from  state  a  (00). 
Because  every  path  at  stage  i  must  grow  out  of  the  paths  at  stage  /  —  1,  the  optimum  path  to 
each  state  at  stage  i  must  contain  one  of  the  best  paths  to  each  of  the  four  states  at  stage  /  —  1 . 
In  short,  the  optimum  path  to  each  state  at  stage  i  is  a  descendant  of  the  predecessors  at  stage 
/  —  1.  All  optimum  paths  at  any  stage  i  +  /'o  are  descendants  of  the  optimum  path  at  stage  /. 
Hence,  only  the  best  path  to  each  state  need  be  stored  at  a  given  stage.  There  is  no  reason  to 
store  anything  but  the  optimum  path  to  each  state  at  every  stage  because  nonoptimum  paths 
would  only  increase  the  metric  of  path  distance  to  the  received  data  sequence. 

In  the  special  example  of  Fig.  15.7,  its  trellis  diagram  (Fig.  15.8)  shows  that  each  of  the 
four  states  (a,fe,c,  and  d)  has  only  two  predecessors;  that  is,  each  state  can  be  reached  only 
through  two  previous  states.  More  importantly,  since  only  the  four  best  surviving  paths  (one 
for  each  state)  exist  at  stage  /  —  1,  there  are  only  two  possible  paths  for  each  state  at  stage  /'. 
Hence,  by  comparing  the  total  Hamming  distances  (from  the  received  sequence)  of  the  two 
paths,  we  can  find  the  optimum  path  with  the  minimum  Hamming  distance  for  every  state  at 
stage  i  that  corresponds  to  a  codeword  which  is  closest  to  the  received  sequence  up  to  stage  /. 
The  optimum  path  to  each  state  is  known  as  the  survivor  or  the  surviving  path. 


le  15.7  We  now  study  a  decoding  example  of  the  Viterbi  algorithm  for  maximum  likelihood  decoding 
of  the  convolutional  code  generated  by  the  encoder  of  Fig.  1 5.5.  Let  the  first  1 2  received  digits 
be  01  10  11  00  00  00,  as  shown  in  Fig.  15.10a— e.  Showing  the  received  digits  along  with  the 
branch  output  bits  makes  it  easier  to  computer  the  branch  Hamming  distance  in  each  stage. 


We  start  from  the  initial  state  of  a  (00).  Every  stage  of  the  decoding  process  is  to  find  the 
optimum  path  to  the  four  states  given  the  2  received  bits  during  the  stage.  There  are  two 
possible  states  leading  to  each  state  in  any  given  state.  The  survivor  with  the  minimum 
Hamming  distance  is  retained  (solid  line),  whereas  the  other  path  with  larger  distance  is 
discarded  (dashed  line).  The  Hamming  distance  of  each  surviving  path  is  labeled  at  the 
end  of  a  stage  to  each  of  the  four  states. 

•  After  two  stages,  there  is  exactly  one  optimum  (surviving)  path  to  each  state  (Fig.  15.1 0a). 
The  Hamming  distances  of  the  surviving  paths  are  labeled  as  2,  2,  1 ,  and  3,  respectively. 

•  Each  state  at  stage  3  has  two  possible  paths  (Fig.  15.10b).  We  keep  the  optimum  path 
with  the  minimum  distance  (solid  line).  The  distances  of  the  two  possible  paths  (from 
top  to  bottom)  arriving  at  each  state  are  given  in  the  minimization  label.  For  example, 

.  x *  *  i  J  1  • _ r _ \  Unc  Unmminn  0  O  —  A. 


for  state  a  the  first  path  (dashed  line  from  a)  has  Hamming  distance  of  2  +  2  =  4, 
whereas  the  second  path  (solid  line  from  c)  has  the  distance  of  1+  0  =  I . 

•  Repeat  the  same  step  for  stages  4,  5,  and  6,  as  illustrated  in  Fig.  I5.10c^. 


for  state  a 


be 


Codeword:  11  10  11  000000 

Information  bits:  1  0  0  0  0  0 


(15.32a) 

(15.32b) 
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Figure  15.10 

Viterbi  decoding 
example  in 
Fig.  15.5: 

(a)  stage  1  and 
2;  (b)  stage  3; 

(c)  stage  4; 

(d)  stage  5; 

(ej  stage  6. 


Received  bits:  01_ 10_ 1J_ 10_ 00 


00 


Received  bits:  01_ 10_ 1  1_ 10_ 00_ 00 


Received  bits:  01 _ 10 _ jj _ 10 _ 00 _ 00  - 
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Received  bits:  01 _ 10 _ U _ [Q _ 00 _ 00 


Received  hits:  01 _ Ifl _ LJ _ Ifi _ QQ - & -  Optimal  path 


In  the  preceding  example,  we  have  illustrated  how  to  progress  from  one  stage  to  the  next 
bv  determining  the  optimum  path  (survivor)  leading  to  each  of  the  state.  When  these  survivors 
do  merge  the  merged  branches  represent  the  most  reliable  MLD  outputs.  For  the  later  stages 
that  do  not  exhibit  a  merged  path,  we  are  ready  to  make  a  maximum  likelihood  decision  based 
on  the  received  data  bits  up  to  that  stage.  This  process,  known  as  truncation,  is  designed  to  a 
force  a  decision  on  one  path  among  all  the  survivors  without  leading  to  a  long  decking  delay. 
One  way  to  make  a  truncated  decision  is  to  take  the  minimum  distance  path  as  in  Eq.  (15.32). 
Another  alternative  is  to  rely  on  extra  codeword  information.  In  Fig.  15.10e,  ,1  the  encoder 
always  forces  the  last  two  data  digits  to  be  (Ml.  then  we  can  consider  only  the  survivor  ending  at 

'^With  the  Viterbi  algorithm,  storage  and  computational  complexity  are  proportional  to  2*"' 
and  are  very  attractive  for  constraint  length  N  <  10.  To  achieve  very  low  error  probabilities. 
longer6 constraint  lengths  are  required,  and  sequential  decoding  (to  be  discussed  next)  may 

become  attractive. 


Sequential  f)e^^,nftechni  proposed  by  Wozencraft.  the  complexity  of  the  decoder 
!„ncS“early  raihmhan  exponentially.  Toexplain  this  technique,  le,  us  consider  anencode, 


Figure  1 5.1 1 

Convolutional 

encoder. 


Figure  15.12 

Code  tree  for  the 
encoder  in 
Fig.  15.11. 
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with  N  =  4  and  i  =  3  (Fig.  15.11).  The  code  tree  for  this  encoder  is  shown  in  Fig.  15.12. 
Each  data  digit  generates  three  (t  =  3)  output  digits  but  affects  four  groups  of  three  digits 
(12  digits)  in  all. 

In  this  decoding  scheme,  we  observe  only  three  (or  i )  digits  at  a  time  to  make  a  tentative 
decision,  with  readiness  to  change  our  decision  if  it  creates  difficulties  later.  A  sequential 
detector  acts  much  like  a  driver  who  occasionally  makes  a  wrong  choice  at  a  fork  in  the  road, 
but  quickly  discovers  the  error  (because  of  road  signs),  goes  back,  and  takes  the  other  path. 

Applying  this  insight  to  our  decoding  problem,  the  analogous  procedure  would  be  as 
follows.  We  look  at  the  first  three  received  digits.  There  are  only  two  paths  of  three  digits  from 
the  initial  node  n\.  We  choose  that  path  whose  sequence  is  at  the  shortest  Hamming  distance 
from  the  first  three  received  digits.  We  thus  progress  to  the  most  likely  node.  From  this  node 
there  are  two  paths  of  three  digits.  We  look  at  the  second  group  of  the  three  received  digits 
and  choose  that  path  whose  sequence  is  closest  to  these  received  digits.  We  progress  this  way 
until  the  fourth  node.  If  we  were  unlucky  enough  to  have  a  large  number  of  errors  in  a  certain 
received  group  of  t  digits,  we  will  take  a  wrong  turn,  and  from  there  on  we  will  find  it  more 
difficult  to  match  the  received  digits  with  those  along  the  paths  available  from  the  wrong  node. 
This  is  the  clue  to  the  realization  that  an  error  has  been  made.  Let  us  explain  this  by  an  example. 

Suppose  a  data  sequence  11010  is  encoded  by  the  encoder  in  Fig.  15.1 1.  Because  N  =  4. 
we  add  three  dummy  Os  to  this  sequence  so  that  the  augmented  data  sequence  is  11010000. 
The  coded  sequence  will  be  (see  the  code  tree  in  Fig.  15.12) 

111  101  001  111  001  on  011  000 

Let  the  received  sequence  be 

101  oil  001  111  001  on  on  000 


There  are  three  bit  errors:  one  in  the  first  group  and  two  in  the  second  group.  We  start  at  the 
initial  node  n, .  The  first  received  group  101  (one  error)  being  closer  to  1 1 1.  we  make  a  correct 
decision  to  go  to  node  n2 .  But  the  second  group  001  (two  errors)  is  closer  to  010  than  to  101 
and  will  lead  us  to  the  wrong  node  n'3  rather  than  to  ny  From  here  on  we  are  on  the  wrong  track, 
and  hence  the  received  digits  will  not  match  any  path  starting  from  n'y  The  third  received 
group  is  001  and  does  not  match  any  sequence  starting  at  n'3  (viz..  001  and  100).  But  it  is  closer 
to  011  Hence  we  go  to  node  n\.  Here  again  the  fourth  received  group  111  does  not  match 
any  group  starting  at  n\  (viz..  011  and  100).  But  it  is  closer  to  Oil.  This  takes  us  to  node  n'y 
It  can  be  seen  that  the  Hamming  distance  between  the  sequence  of  12  digits  along  the  path 
mmn'n'n  and  the  first  12  received  digits  is  4,  indicating  four  errors  in  12  digits  (if  our  path 
is  correct)  'such  a  high  number  of  errors  should  immediately  make  us  susp.cous.  If  P,  is  the 
digit  error  probability,  then  the  expected  number  of  errors  /t,  in  d  digits  is  Ped.  Because  P,.  is 
on  the  order  of  KT4  to  10'6,  four  errors  in  12  digits  is  unreasonable.  Hence,  we  go  back  to 

node  and  try  the  lower  branch,  leading  to  n".  This  path.  is  eve"  worse  than  *he 

previous  one-  it  gives  five  errors  in  12  digits.  Hence,  we  go  back  even  farther  to  node  n;  and 
mv  Z  oath  leading  to  m  and  farther.  We  find  the  path  n,n2n2n,ny  giving  three  errors.  If  we 
ol  hnek  still  farther  to  and  try  alternate  paths,  we  find  that  none  yields  less  than  five  errors. 
CT.h  c^paft  U  taken  as  S-ing  three  error,.  This  enables  us  to  decode 

1  hus,  the  c  p  we  start  at  node  „2i  discard  the  first  three  received  digits, 

and^peaUhe'procediire  to  decode  the  second  transmitted  digit.  We  repeat  this  until  all  the 

digits  have  been  .  concerns  the  criterion  lor  deciding  when  the  wrong  path  is 

cho^^ofrer^numtero,em,rs,,a,.ra„cti . .  the  number  of  decoded 
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Figure  15.13 

Setting  the 
threshold  in 
sequential 
decoding. 


digits  d  is  a  straight  line  (ne  =  Ped)  with  slope  Pe ,  as  shown  in  Fig.  15. 13.  The  actual  number 
of  errors  along  the  path  is  also  plotted.  If  the  errors  remain  within  a  limit  (the  discard  level), 
the  decoding  continues.  If  at  some  point  the  errors  exceed  the  discard  level,  we  go  back  to  the 
nearest  decision  node  and  try  an  alternate  path.  If  errors  still  increase  beyond  the  discard  level, 
we  then  go  back  one  more  node  along  the  path  and  try  an  alternate  path.  The  process  continues 
until  the  errors  are  within  the  set  limit.  By  making  the  discard  level  very  stringent  (close  to  the 
expected  error  curve),  we  reduce  the  average  number  of  computations.  On  the  other  hand,  if  the 
discard  level  is  made  too  stringent,  the  decoder  will  discard  all  possible  paths  in  some  extremely 
rare  cases  ot  an  unusually  large  number  of  errors  due  to  noise.  This  difficulty  is  usually  resolved 
by  starting  with  a  stringent  discard  level.  If  on  rare  occasions  the  decoder  rejects  all  paths,  the 
discard  level  can  be  relaxed  little  by  little  until  one  of  the  paths  is  acceptable. 

It  can  be  shown  that  the  error  probability  in  this  scheme  decreases  exponentially  as  N, 
whereas  the  system  complexity  grows  only  linearly  with  k .  The  code  rate  is  r;  ~  1  /£.  It  can  be 
shown  that  for  rj  <  r}() ,  the  average  number  of  incorrect  branches  searched  per  decoded  digit 
is  bounded,  whereas  for  r]  >  r}0  it  is  not;  hence  rj0  is  called  the  computational  cutoff  rate. 
There  are  several  disadvantages  to  sequential  decoding: 

1.  The  number  of  incorrect  path  branches,  and  consequently  the  computation  complexity,  is  a 
random  variable  depending  on  the  channel  noise. 

2.  To  make  storage  requirements  easier,  the  decoding  speed  has  to  be  maintained  at  10  to  20 
times  faster  than  the  incoming  data  rate.  This  limits  the  maximum  data  rate  capability. 

3.  The  average  number  of  branches  can  occasionally  become  very  large  and  may  result  in  a 
storage  overflow,  causing  relatively  long  sequences  to  be  erased. 

A  third  technique  for  decoding  convolutional  codes  is  feedback  decoding,  with  thresh¬ 
old  decoding6  as  a  subclass.  Threshold  decoders  are  easily  implemented.  Their  performance, 
however,  does  not  compare  favorably  with  the  previous  two  methods. 

15.7  TRELLIS  DIAGRAM  OF  BLOCK  CODES 

Whereas  a  trellis  diagram  is  connected  with  convolutional  code  in  a  direct  and  simple  way,  a 
syndrome  trellis  can  also  be  constructed  for  a  binary  linear  (h,  k)  block  code  according  to  its 
parity  check  matrix^  H .  The  construction  can  be  stated  as  follows: 
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•  There  are  min ( 2^ ,  2"  *)  possible  states  in  the  trellis. 

•  The  state  of  a  codeword  at  instant  /  is  determined  by  the  codeword  and  the  parity  check 
matrix  by  syndrome  from  the  first  codeword  bit  to  the  rth  codeword  bit: 

Zi  =  c\hi  ©  C2^2  ©  •  •  •  ®  Cihj.  ( 1 5.33) 

Note  that  this  syndrome  trellis,  unlike  the  state  transition  trellis  of  convolutional  code,  is 
typically  nonrepeating.  In  fact,  it  always  starts  from  the  “zero"  state  and  ends  in  “zero”  state. 
Indeed,  this  trellis  is  a  time-varying  trellis.  We  use  an  example  to  illustrate  the  construction 
of  a  syndrome  trellis. 


Example  15.8  Consider  a  Hamming  (7, 4,  3)  code  with  parity  check  matrix 


H  = 


1  1 
0  1 
1  1 


10  10  0 
110  10 
0  10  0  1 


(15.34) 


Sketch  the  trellis  diagram  for  this  block  code. 


For  this  code  there  are  3  error  syndrome  bits  defining  a  total  of  23  =  8  states.  Denote  the 
eight  states  as  (S„,  5,,  S2,  S3,  *4,  S5,  *>,  S7).  There  are  2*  =  24  =  16  total  codewords 
with  7  code  bits  that  are  in  the  null-space  of  the  parity  check  matrix  H.  By  enumerating 
all  16  codewords,  we  can  follow  Eq.  (15.33)  to  determine  all  the  paths  through  the  trellis. 

The  corresponding  time- varying  trellis  diagram  is  shown  in  Fig.  15.14.  Notice  that 
each  path  corresponds  to  a  codeword.  We  always  start  from  state  S0  initially  and  end  at 
the  state  So-  Unlike  the  case  of  convolutional  code,  it  is  not  necessary  to  label  the  trellis 
branches  in  this  case.  Whenever  there  is  a  state  transition  between  different  states,  the 
branch  automatically  corresponds  to  a  “1”  code  bit.  When  a  state  stays  the  same,  then  the 
transition  branch  corresponds  to  a  “0  code  bit. 


Figure  15.14 

Trellis  diagram  of 
a  Hamming 
(7,  4,  3)  code 
with  parity  check 
matrix  of 


Eq.  (15.34). 


Code  bits 


_  hnvP  i  trellis  diagram,  the  Viterbi  decoding  algorithm  can  be  implemented  for  the 

Once  we  have  •  &  c  omplexity.  Maximum  likelihood  detection  of  block  codes 

MLD  °|.‘he  be°tVthan  a  syndrome-based  decoder.  Keep  in  mind  that  the  example  we  show 
can  perform  better  t  O  ^  from  viterbi  decoding.  Clearly,  the  Viterbi  algonthm 

m^ces'more  sense  when  one  is  decoding  a  long  code. 
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15.8  CODE  COMBINING  AND  INTERLEAVING 

Simple  and  short  codes  can  be  combined  in  various  ways  to  generate  longer  or  more  powerful 
codes.  Certainly  there  are  many  possible  ways  of  combining  multiple  codes.  In  this  section, 
we  briefly  describe  several  of  the  most  common  methods  of  code  construction  through  code 
combining. 

Interleaving  Codes  for  Correcting  Burst  and  Random  Errors 

One  of  the  simplest  and  yet  most  effective  tools  for  code  combining  is  interleaving,  the  process 
of  reordering  or  shuffling  (multiple)  codewords  generated  by  the  encoder.  Thus,  a  burst  of  bit 
errors  will  be  affecting  multiple  codewords  instead  of  one.  The  purpose  of  interleaving  is  to 
disperse  a  large  burst  of  errors  over  multiple  codewords  such  that  each  codeword  needs  to 
correct  only  a  fraction  of  the  error  burst.  This  is  because,  in  general,  random  error  correcting 
codes  are  designed  to  tackle  sporadic  errors  in  each  codeword.  Unfortunately,  in  most  practical 
systems,  we  have  errors  of  both  kinds.  Among  methods  proposed  to  simultaneously  correct 
random  and  burst  errors,  interleaving  is  simple  and  effective. 

For  an  (n,k)  code,  if  we  interleave  X  codewords,  we  have  what  is  known  as  a  (Xn,Xk) 
interleaved  code.  Instead  of  transmitting  codewords  one  by  one,  we  group  X  codewords  and 
interleave  them.  Consider,  for  example,  the  case  of  X  =  3  and  a  two-error  correcting  (15, 
8)  code.  Each  codeword  has  15  digits.  We  group  codewords  to  be  transmitted  in  groups  of 
three.  Suppose  the  first  three  code  words  to  be  transmitted  are  x  =  (jcj ,  *2,  • .  • ,  *15), J  = 
(yi>  V2>  ••• »  y  15).  and  z  =  (zi,  Z2*  •  • • *  Z\5)>  respectively.  Then  instead  of  transmitting  xyz 
in  sequence  as  x\,  *2,  - .  .,*15,  yu  yi *  •  • . ,  Vi5»  Zi*  Z2»  •  • .  ,Zi5,  we  transmit  jci,  yu  Zl*  *2*  y* 
Z2>  *3*  Z3>  ...**15,  y\5i  z\5 •  This  can  be  explained  graphically  by  Fig.  15.15,  where  X 

codewords  (three  in  this  case)  are  arranged  in  rows.  In  normal  transmission,  we  transmit  one 
row  after  another.  In  the  interleaved  case,  we  transmit  columns  (of  X  elements)  in  sequence. 
When  all  the  15  (/t)  columns  are  transmitted,  we  repeat  the  procedure  for  the  next  X  codewords 
to  be  transmitted. 

To  explain  the  error  correcting  capabilities  of  this  interleaved  code,  we  observe  that  the 
decoder  will  first  remove  the  interleaving  and  regroup  the  received  digits  as  x\9  *2,  •  •  •  1  *15* 
3M*  •  •  •  >  y  15*  Zi*  Z2,  .  • . ,  z\s-  Suppose  the  digits  in  the  shaded  boxes  in  Fig.  15.15  were  in 

error.  Because  the  code  is  a  two-error  correcting  code,  up  to  two  errors  in  each  row  will  be 
corrected.  Hence,  all  the  errors  in  Fig.  15.15  are  correctable.  We  see  that  there  are  two  random, 
or  independent,  errors  and  one  burst  of  length  4  in  all  the  45  digits  transmitted.  In  general,  it 
the  original  ( n ,  k)  code  is  /-error  correcting,  the  interleaved  code  can  correct  any  combination 
of  /  bursts  of  length  X  or  less. 

Because  of  the  interleaver  described  in  Fig.  15.15  takes  a  block  bits  and  generates  output 
sequence  in  a  fixed  orderly  way,  interleavers  of  this  kind  are  known  as  block  interleavers. 
The  total  memory  length  of  the  interleaver  is  known  as  the  interleaving  depth.  Interleavers 


Figure  15.15 

A  block 
(nonrandom) 
interleaver  for 
correcting 
random  and 
burst  errors. 


input  x 
< - 


input  y 
< - 


input  z 
< - 
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Figure  15.16 

Product  code 
formed  by  two 
encoders 
separated  by  a 
block  interleaver. 
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with  larger  depths  can  better  handle  longer  bursts  of  errors,  at  the  cost  of  larger  memory 
and  longer  encoding  and  decoding  delays.  A  more  general  interleaver  can  pseudorandomly 
reorder  the  data  bits  inside  the  interleaver  and  output  the  bits  in  an  order  known  to  both  the 
transmitter  and  the  receiver.  Such  an  interleaver  is  known  as  a  random  interleaver.  Ran¬ 
dom  interleavers  are  generally  more  effective  in  combating  both  random  and  burst  errors. 
Because  they  do  not  generate  outputs  following  a  fixed  order,  there  is  a  much  smaller  prob¬ 
ability  of  receiving  a  burst  of  error  bits  in  a  codeword  because  of  certain  random  error 

patterns. 

Saved  co^etan  be  generalized  by  fntthcrencoding  .he  interleaved  ertewonh.  The  result- 

'"gl ?6  tow mform  a  product  code  from  two  systematic  block  codes  that  are 

Figure  15.  fifSt  i$  an  (M|>  £,)  code  and  the  second  is  an  (n 2,  kj)  code. 

known  “  2yna"ectangular  block  of  it,  x  k2  message  bits  is  encoded  by  two  encoders.  First. 
More  specific  y.  encoded  by  the  first  encoder  into  k2  codewords  of  the  (n,,  k\) 

bl“  ‘."sends  n,  Nock,  of  k2  bits  into  the  second  encoder. 

Th^second  (n2.  «  encoder  adds  n2  -  *2  parity  bits  for  each  of  the  »,  blocks,  generattng  „ 

codewords  of  the  block  codes  into  a  single  more 

from  each  of  the  two  codes. 


Concatenated  Codes  product  code  that  a  block  interleaver  connects  the  two 

Note  from  the  bloC^  ;  Uy,  as  shown  in  Fig.  15.17,  the  two  component  codes  need  not 
component  codes.  More  genciai  jr. 
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Figure  15.17 

Concatenated 
code  with  a 
nonbinary  outer 
code  and  a 
binary  inner 
code. 


Outer  code  Inner  code 


be  limited  to  binary  block  codes,  and  a  more  general  interleaver  FI  can  be  used.  The  resulting 
code  is  known  as  a  concatenated  code.  Indeed,  Forney8  proposed  concatenating  one  binary 
and  one  nonbinary  code  to  construct  a  much  more  powerful  code.  It  is  clear  that  product  codes 
are  a  special  class  of  concatenated  codes  with  binary  component  codes  and  a  block  interleaver. 

In  this  serial  concatenation,  encoder  2  is  known  as  the  inner  code  whereas  encoder  1  is 
known  as  the  outer  code.  The  most  successful  concatenation  as  proposed  by  Forney8  uses  a 
Reed-Solomon  outer  code  and  a  binary  convolutional  inner  code.  The  concatenated  code  can 
be  decoded  separately  by  first  decoding  the  inner  code  before  de-interleaving  and  decoding  the 
outer  code.  More  complex  ways  of  iterative  decoding  are  also  possible  to  potentially  achieve 
better  performance. 


15.9  SOFT  DECODING 

Thus  far,  we  have  focused  on  decoding  methods  that  generate  hard  decisions  based  on  either 
maximum  likelihood  or  syndrome-based  algebraic  decoding.  Hard-decision  decoding  refer  to 
the  fact  that  the  decoder  generates  only  the  most  likely  codeword  without  providing  the  relative 
confidence  of  this  decoded  codeword  with  respect  to  other  possibilities.  In  other  words,  the 
hard-decision  decoded  codeword  does  not  indicate  how  confident  the  decoder  is  about  this 
decision.  A  stand-alone  decoder  can  function  as  a  hard-decision  decoder  because  its  goal  is  to 
provide  the  best  candidate  as  the  decoded  codeword.  It  does  not  have  to  indicate  how  much 
confidence  can  be  placed  in  this  decision. 

In  practice,  however,  a  decoder  is  often  operating  in  conjunction  with  other  decoders  and 
other  receiver  units.  This  means  that  the  decoded  codeword  not  only  must  meet  the  constraint 
of  the  current  parity  check  matrix,  its  output  must  also  satisfy  other  constraints  such  as  those 
imposed  by  the  parities  of  different  component  codes  in  a  concatenated  error  correction  code. 
By  providing  more  than  just  one  hard  decision,  a  soft-decision  decoder  can  output  multiple 
possible  codewords,  each  with  an  associated  reliability  (likelihood)  metric.  This  kind  of  soft 
decoding  can  allow  other  units  in  the  receiver  to  jointly  select  the  best  codeword  by  utilizing 
the  soft  (reliability)  information  from  the  decoder  along  with  other  relevant  constraints  that 
the  codeword  must  satisfy. 

It  is  more  convenient  to  illustrate  the  soft  decoding  concept  by  means  of  a  BPSK  modu¬ 
lation  example.  Let  us  revisit  the  optimum  receiver  of  Sec.  1 1 .6.  We  will  focus  on  the  special 
case  of  binary  modulation  with  modulated  data  symbol  represented  by  b\  —  ±  1  under  additive 
white  Gaussian  noise  channel.  Let  Cjj  denote  the  /th  code  bit  of  the y  th  codeword  cj.  Because  the 
modulation  is  BPSK,  the  relationship  between  the  code  bit  cj ,  and  its  corresponding  modulated 
symbol  bjj  is  simply 


tyj  —  2  •  Cjj  —  1 
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written  as 


r,  =  y/lTbbjj  +  w,  i=l,2 . n  (15.35) 

Here  w,  is  an  AWGN  sample.  We  use  C  to  denote  the  collection  of  all  valid  codewords.  Based 
on  the  optimum  receiver  of  Sec.  11.6  |Eq.(  11.91)  and  Fig.  11.18],  the  maximum  likelihood 
decoder  (MLD)  of  the  received  signal  under  coding  corresponds  to 

c  =  arg  max  ^  rjbjj 

C y  €C 

=  arg  max  ^  n(2cjj  -  1 ) 

Cj€C 

=  arg  max  }  rjCjj  ( 1 5.36) 

CjeC  t—1 

Among  all  the  2*  codewords,  the  soft  MLD  not  only  can  determine  the  most  likely  codeword 
as  the  output,  it  should  also  preserve  the  metric 

Mj  =  nbj,i 

as  the  relative  likelihood  of  the  codeword  Cj  during  the  decoding  process.  Although  equiva¬ 
lent  to  the  distance  measure,  this  (correlation)  metric  should  be  maximized  for  MLD.  Unlike 
distance,  the  correlation  metric  can  be  both  positive  and  negative. 

Although  the  soft  MLD  appears  to  be  a  straightforward  algorithm  to  implement,  its  com¬ 
putational  complexity  is  affected  by  the  size  of  the  code.  Indeed,  when  the  code  is  long  with 
a  very  large  k  the  computational  complexity  grows  exponentially  because  2  metrics  must 
be  calculated.  For  many  practical  block  codes,  this  requirement  becomes  unmanageable  when 

the  code  length  exceeds  several  hundred  bits.  . 

To  simplify  this  optimum  decoder.  Chase  proposed  several  types  of  suboptimum  soft 
decoding  algorithms9  that  are  effective  at  significantly  reduced  computational  cost.  The  first 
steo  of  the  Chase  algorithms  is  to  derive  temporary  hard  bit  decisions  based  on  the  received 
samples  r,.  These  temporary  bits  do  not  necessarily  form  a  codeword.  In  other  words,  find 

y  =  [yi  y2  •••  yn\  (15.37a) 


where 


yi  =  sign  (n) 


i  =  1,  2 . n 


(15.37b) 


Each  individual  bit  decision  has  reliability  l-V|.  These  temporary  bits  (,,  |  are  sen, ,,,  an  algebraic 
Each  indiv  uiu  ,  error  syndromes.  The  result  is  an  initial  codeword  c«  = 

decoder  based  on,  ^  -s  exact)y  the  same  as  a  conventional  hard-decision  decoder. 

[c0,i  co.2  '  Co-T  .  .  |low  additional  modifications  to  the  hard  decoder  input  y  by 

However.  Chase  a Igonthm  aho  ^  changjng  „  code  bit  from  ,  „  and  from  0  ,o  1. 

mpping  the  „  |  n»* provide  multiple  candidate  codewords,  each  with  an  asso- 

cia.e^e,thd"a  aase  a,Pgor„hms  generate  most  My  dtp  patterns  he  used  to 


■■ 


i 
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modify  the  hard  decoder  input y.  Each  flip  pattern  e,*  consists  of  Is  in  bit  positions  to  be  flipped  1  5 

and  Os  in  the  remaining  bit  positions.  For  each  flip  pattern  */,  construct 

dj  =  hard  decision  (y,)  ©  e,  (15.38a) 

and  compute  the  corresponding  reliability  metric 

n 

M,  =  J2rJ-  -  D  (15.38b) 

7=1 

The  codeword  with  the  maximum  A//  is  the  decoded  output. 

There  are  three  types  of  Chase  algorithm.  First,  we  sort  the  bit  reliability  from  low  to  high: 

ki,l  <  kf2|  <  •••  <  hj  (15.39) 


Type  1  Test  all  flipping  patterns  of  weight  less  than  or  equal  to  (dmm  -  1). 

Type  2  Identify  the  [dmin/2J  least  reliable  bit  positions  {i\  z2  •  •  •  f’|jmin/2j  )•  Test  flipPinS 
patterns  of  weight  less  than  or  equal  to  L^min/2  -  1J.* 

Type  3  Test  flipping  patterns  of  weight  w  =  1,  3,  . . . ,  Jmin  -  1  by  placing  Is  in  the  w  least 
reliable  bit  positions. 


The  block  diagram  of  Chase  algorithms  is  shown  in  Fig.  1 5. 1 8.  The  three  Chase  algorithms 
differ  only  in  how  the  flipping  patterns  are  generated.  In  addition,  we  should  note  that  Chase 
decoders  can  exchange  reliability  and  likelihood  information  with  other  receiver  units  in  a  joint 
effort  to  improve  the  decoding  performance.  From  the  input  end,  the  set  of  flipping  patterns  can 
take  additional  suggestions  from  other  receiver  units.  From  the  output  end,  multiple  codeword 
candidates,  along  with  their  reliability  metrics,  can  be  sent  to  additional  decoding  units  for 
further  processing  and  eventual  elimination. 


Figure  15.18 

Block  diagram  of 
Chase 

soft-decoding 

algorithms. 
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15.10  SOFT-OUTPUT  VITERBI  ALGORITHM  (SOVA) 

Chase  algorithms  can  generate  multiple  candidate  codewords  and  the  associated  reliability 
metrics.  The  metric  information  can  be  exploited  by  other  receiver  processing  units  to  deter¬ 
mine  the  final  decoded  codeword.  If  the  decoder  can  produce  soft  reliability  information  on 
every  decoded  bit,  then  it  can  be  much  better  utilized  jointly  with  other  soft-output  decoders  and 
processors.  Unlike  Chase  algorithms,  soft-output  Viterbi  algorithms  (SOVA)10  and  the  maxi¬ 
mum  a  posterior ;  (MAP)  algorithms  are  two  most  general  soft  decoding  methods  to  produce 
bit  reliability  information.  We  first  describe  the  principles  of  SOVA  here. 

The  most  reliable  and  informative  soft  bit  information  is  the  log-likelihood  ratio  (LLR) 
of  a  particular  code  bit  c,  based  on  the  received  signal  vector 

r  =  (n,  T2 .  rn) 

In  other  words,  the  LLR11  as  defined  by 


A(c,)  =  log 


P[c,  =  1  |r  =  r| 
P[c,  =  0|r  =  r| 


(15.40) 


indicates  the  degree  of  certainty  by  the  decoder  on  the  decision  of  c,  =  1.  The  degree  of 
certainty  various  from  -oo  when  P  [c,  =  0|r]  =  1  to  +oo  when  P  [c,  =  0|r|  =  0 

Once  again,  we  consider  the  BPSK  case  in  which  (2c,  -  1)  =  ±1  is  the  transmuted 

data  and 

r,  =  (2c,  -  1)  +  w,  i=  1,2 . n  05.41) 


where  w  is  the  AWGN  sample.  Similar  to  the  Chase  algorithms,  the  path  metric  is  computed 
by  the  correlation  between  {r,}  and  the  BPSK  signal  {c,(.  In  other  words,  based  on  the  received 
data  samples  { r , } ,  we  can  estimate 


n2 

path  metric  between  stages  n\  and  n2  =  0  •  (2c,  -  1 ) 

>=".  +  ! 


(15.42) 


,.  .  . a,0„rithm  the  SOVA  decoder  operates  on  the  corresponding  trellis 

Like  the  traditional  Vrtert  ^va  consists  of  a  forward  step  and  a  backward  step. 

°n  ^  TV,^ri  »  i"  .hto„  v,„,,ona,  VUerbi  algorithm.  SOVA  M  rind,  ,he  - 
Dunng  the  forwa  P-  UnHke  conventional  VA,  which  stores  only  the  surviving  path 

likely  sequence  <survl  p  '  SOVA  stores  the  metric  of  every  surviving  path  leading 

metrics  at  the  states  in  the  current  stage, 

to  a  state  for  all  stages.  ...  * 

To  formulate  the  idea  formally,  denote 

Se(i)  =  state  i  at  stage  (time)  / 

,  .  ;n  stage  /  we  will  determine  the  forward  path  metric  leading  to  this 

For  each  survivor  at  state  5>  8  ^  ^  £  at  tjme  (  are  denoted  as  Mft(i).  The  maximum 

state.  These  forward  met  «  forward  VA  denoted  ^.corresponds  to  the  optimum 

total  path  metnc  at  the  finai  SQVA  then  applies  VA  backward  from  the  terminal 

forward  path.  During  the  bacKwaru  h- 
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(final)  state  at  stage  K  and  ends  at  the  initial  state  at  stage  0.  also  storing  the  backward  metrics 
ending  in  state  t  at  stage  /  as 

Since  the  likely  value  of  the  information  bit  dj  =  0.  1  that  leads  to  the  transition  between 
state  S(a (i  —  1)  and  state  Seb(i)  has  been  identified  by  VA  during  the  forward  step,  the  metric 
of  information  bit  can  be  fixed  as  total  path  metric 


Mi(di)  =  M 


max 


Our  next  task  is  to  determine  the  best  path  and  the  corresponding  maximum  path  metric 
A//(l  —  dj)  if  the  opposite  information  bit  value  of  1  —  dj  is  chosen  instead  at  stage  i 


M(l-dj) 


0)  +  B(at  (b  +  Mbh  (S«k(i))j 


(15.43) 


where  Btayib  is  the  path  distance  from  state  transition  ta  to  with  respect  to  the  received 

sample  r,.  The  maximization  is  over  all  state  transitions  denoted  by  (£a— >tb)  that  can  be 
caused  by  the  information  bit  value  of  1  -  di  at  stage  /. 

This  step  allows  us  to  find  the  best  alternative  path  through  the  trellis  if  the  alternative 
bit  value  1  —  dx  is  selected.  Now  that  we  have  both  and  Af,-(1  —  dj)  for  every  stage  /, 

likelihood  of  every  information  bit  is  proportional  to  the  metric  difference 


A/  =  Mi(  1)  -  A/,(0)  =  (2 di  -  \)[Mi(di)  -  Af|(l  -  d{j\ 

=  (2 di  -  1)  [A/max  -  Mg(l  ~  di)]  (15.44) 


Hence,  the  log-likelihood  ratio  A,  can  generated  by  SOVA  for  every  information  bit  di.  We 
now  can  use  the  survivor  path  to  determine  the  LLR  [Eq.  (15.40]  for  every  bit  in  this  most 
likely  sequence.  The  basic  concept  of  finding  the  best  alternative  surviving  path  caused  by  an 
information  bit  value  of  1  -  di  is  illustrated  in  Fig.  15.19. 


Figure  15.19 

Block  diagram  of 
Chase  son 
decoding 
algorithms. 


Initial 

state 


State  I 


State  2 


State  3 


Info  bit  dt 


on  the  best  path 


Best  (surviving) 
path  with  input  1  —  di 


Final 

state 


15.11  Turbo  Codes  951 


15. 11  TURBO  CODES 

As  we  briefly  mentioned  earlier  in  section  15.1.  turbo  codes  represent  one  of  the  major  break¬ 
throughs1"  in  coding  theory  over  the  past  several  decades.  The  mechanism  that  made  turbo 
codes  possible  is  its  simplified  decoder.  Turbo  codes  would  not  have  been  possible  without  a 
soft  decoding  algorithm.  In  fact,  a  short  paper  published  more  than  30  years  earlier  by  Bahl, 
Cocke,  Jelinek.  and  Ravivl?  played  a  major  role.  Their  maximum  a  posterior  (MAP)  soft 
decoding  algorithm  is  known  as  the  BCJR  algorithm.  Before  describing  the  essence  of  turbo 
codes,  we  introduce  the  fundamentals  of  BCJR  algorithm. 


BCJR  Algorithm  for  MAP  Detection 

Our  description  of  the  BCJR  MAP  algorithm  is  based  on  the  presentation  by  Bahl.  Cocke, 
Jelinek,  and  Raviv.13  We  first  assume  that  a  sequence  of  information  data  bits  denoted  by 

d\d2  ■■■  dN  05.45) 


The  information  bit  {</,}  are  encoded  into  codeword  bits  {v,  },  which  are  further  modulated  into 
(complex)  modulated  data  symbols  {*>,}•  In  the  general  case,  we  simply  note  that  there  is  a 

mapping  of 


{di\  — >  {bi) 


(15.46) 


In  the  special  case  of  BPSK,  bj  —  ±  1 . 

The  modulated  data  symbols  are  transmitted  in  an  i.i.d.  noise  channel,  and  the  received 
signal  samples  are 

r,  =  bj  +  w,  (15.47) 


in  which  w,  are  i.i.d.  noise  samples.  Following  MATLAB  notation,  we  denote  the  received 
data 


r*j  ;j^2  —  0h\  *  ^k\ + 1  ’  •  •  •  ’  ^2  ^ 

r  =  (n,  n,  ryv) 


Because  the  data  symbols  and  the  channel  noise  are  i.i.d.,  we  conclude  that  the  conditional 
probability  depends  only  on  the  current  modulated  symbol 

p(n\bi,  r,:,-i)  =  p(n\bi)  (15.48) 


The  (convolutional  or  block)  code  is  represented  by  a  trellis  diagram  in  which  5,  =  m 
denoteT theevent  .hat  the  .rellis  state  iS  m  a.  .ime  i.  The  , tans, .ton  probabtltty  be.ween  state 
and  a.  from  stage  f  I  to  stage  i  is  represented  by 

P[5,  =  wt|S,_i  ='»'] 

.  .  f  u  trellis  means  that  5,  is  a  Markov  process.*  Based  on  the  properties 
“v'^ses^d  the  knowledge  .ha,  fct*  and  f,;,  are  independent,  we  have  the 


v  iTa  Markov  process  if  its  conditional  probability  satisfies 

•A-atom preeess f  -/W*.*.-.’ 

„  ,  Make,  _  Has  .  «  —»  -  - 1"“'”  —  "  -  — . . 

is  available  in  its  immediate  past  *t-l  • 
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following  simplifications  of  the  conditional  probabilities: 


p  (n+i-.N |S,  =  "t,  5,_i  =  m,  r\:i)  =  p  (h+i:S  1 5/  =  m ) 
p  (n.  Si  =  m|5,_i  =  m,  =  p  (r„  5,  =  m|S,_i  =  m  ) 

The  MAP  detector  needs  to  determine  the  log-likelihood  ratio 


A(4)  =  log 


P[di  =  l\r] 
P[di=  ojr] 


=  log 


=l,r) 

p  (di  =  0.  r ) 


(15.49a) 

(15.49b) 


(15.50) 


We  are  now  ready  to  explain  the  operations  of  the  BCJR  algorithm.  First,  let  &i(u)  denote 
the  set  of  all  possible  state  transitions  from  S/_i  =  m  to  5/  =  m  when  di  =  u  (u  =  0,  1). 
There  are  only  two  such  sets  for  di  =  1  and  dj  =  0.  We  can  see  that 


p(d,  =  \,r)=  ^2  p(Si-\  =  m',  S,  =  m,r) 

(m\/n)€ft/(  1) 

=  p  ($-1  =  m  '  H:i»  Si  =  m,  n+l:N  )  (15.51) 

(m\/w)e£2,(l) 

=  p  (5f-'  =  m  '  Si  =  m,  r, ) 

•  P  ( h+UN  1 5,-i  =  m,  ri:/,  5,  =  m ) 


Applying  Eqs.  (15.49a)  and  (15.49b)  to  the  last  equality,  we  have 

p(di  =  \,r)=  ^  p(Si-i  =  Sj  =  m,  n)  ■  p(ri+i:N\Si  =m)  (15.52) 

(m',m)€Q/(l) 

=  H  P  (5/-1  =  m\  r1:,_i )  •  p  (Si  =  m,  r,  |5/-i  =  m  ) 

1) 

■p(n+  UN  I  Si  =  m) 

Applying  the  notations  used  by  Bahl  et  al13.,  we  define 

«i-l  (in')  =  p  (Si- 1  =  rri,  )  (15.53a) 

Pi(m)  =  p(h+i-.N  1 5/  =  m)  (15.53b) 

Yi(m,  m)  =  p  (Sj  =  m,  r,-  |5,_i  =  rri  )  (15.53c) 


Given  the  notations  in  Eq.  ( 1 5.53),  we  can  use  Eqs.  ( 1 5.50)  to  ( 1 5.52)  to  write  the  LLR  of  each 
information  bit  d,  as 


A(rf,)  =  log 


V 

y 


<Xi-\{m')Yi{m' ,  m)f}j(m) 


(15.54) 


«/- 1 (w')y, •(/«',  m)Pi(m) 
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This  provides  the  soft  decoding  information  for  the  tth  information  bit  cl,.  The  MAP  decoding 
can  generate  a  hard  decision  simply  by  taking  the  sign  of  the  LLR 


u  =  sign  [ A(<// )] 


To  implement  the  BCJR  algorithm,  we  apply  a  forward  recursion  to  obtain 
that  is, 


<*i(m)  -p{Si  =  f|:i  ) 

=  Yhp  =  m'  Si~l  =  m  '  :/-i.  n ) 

m! 

=  (Si  =  m,  r,  |5,_1  =  m,  ri.i-i )  p  (S,_i  =  m,  ri-j-i ) 

m' 

=  m)  -cti-xim')  (15.55) 

m' 

The  last  equality  comes  from  Eq.  (15.49b).  The  initial  state  of  the  encoder  should  be  So  =  0. 
In  other  words. 


ao(m)  =  P[S0  =  m]  =  <5[wt]  =  j()  m  ^  () 

from  which  the  forward  recursion  can  proceed.  The  backward  recursion  is  for  computing 
Pi-i(m')  from Pi(m): 

Pi- 1  ( m )  =  p  ( n-N  |S<- \=m  ) 

=  y^p  (Si  =  m,  n,  h+UN  |$-l  =  m  ) 

m 

=  J^p(ri+UN  1 5/- 1  =  m,  Si  =  m,  r„  )  p(Si=m,  r,|S,_i  =  m  ) 

m 

=  y2p(h+V.N  | Si  =  m,)  yi(m\  m) 
m 

=  Yh  * 1 5'56* 

m 

For  an  encoder  with  a  known  terminal  state  of  SN  =  0,  we  can  start  the  backward  recursion 
from 


pN(m)  =  8[m] 


3m  which  the  backward  recursion  can  be  initialized. 

Mn.irp  that  both  the  forward  and  backward  recursions  depends  on  the  function  Yi(m  .  m). 
farty^',  m)  is  already  in  a  simple  matrix  form.  The  entry  n(m',  m)  can  be  simplified  and 
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derived  from  the  basic  modulation  and  channel  information: 


Yi(m\  m)  =  p  (Si  =  m,  n  |S,_i  =  m  ) 

=  P  (n  |5,_1  =  m.  Si  =  m)  ■  P[Si  =  m  |S,_,  =  m  ] 
=  p  (n  \ci[m\  m\)  ■  P[dj  =  u  ] 


(15.57) 


where  c,[m',  m\  is  the  codeword  from  the  encoder  output  corresponding  to  the  state  transition 
from  in'  to  m,  where  as  d,  =  u  is  the  corresponding  input  bit.  To  determine  yi(mr,  in)  for  di  =  u 
according  to  Eq.  (15.57),  P\rj  |c,[m  ,  ni\\  is  determined  by  the  mapping  from  encoder  output 
£y[/77 ,  m\  to  the  modulated  symbol  bj  and  the  the  channel  noise  distribution  w /. 

In  the  special  case  ot  the  convolutional  code  in  Fig.  15.5,  for  every  data  symbol  dj,  the 
convolutional  encoder  generates  two  coded  bits  {v,j,  v,-2}.  The  mapping  from  the  coded  bits 
Ki,  vI>2}  to  modulated  symbol(s)  bt  depends  on  the  modulations.  In  BPSK,  then  each  coded 
bit  is  mapped  to  ±  1  and  bj  has  two  entries 


If  QPSK  modulation  is  applied,  then  we  can  use  a  Gray  mapping 


bi  = 


where 


°.  {v/.i,  v,-,2)  =  {0,  0} 

{v/,i,  vi2)  =  (0,  1) 
{v/,i,  v(>2)  =  {1,  1} 
-*A  {Vi,i,  v/>2}  =  {1,0} 


Hence,  in  a  baseband  AWGN  channel,  the  received  signal  sample  under  QPSK  is 


=  +  w, 


(15.58) 


in  which  w is  the  complex,  i.i.d.  channel  noise  with  probability  density  function 


As  a  result,  in  this  case 


P(n\ci[m',  m])  =p(n\di  =  u ) 


=  P  (n  I  bi  = 

=  Pw  (n  -  sJ~Ese>4’^ 


(15.59) 
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The  BCJR  MAP  algorithm  can  compute  the  LLR  of  each  information  bit  according  to 


A  (di)  =  log 


y 

{—‘(in' 


(m',m)€£2,(l) 


otj-\(m)p(n  \ci[m,  m\)P\dj  =  1] 


y 


a,_i(m')/7(r,  |c,[m\  m])  P\d,  =  0| fij(m) 


p<j.  _  n  V\  ,  ,  0„  a,-_i(m')c/[m\ 

P[a,  —  1J  ,  t—i(m',m)€Sli(l) 

=  log  77-7 - —  +  log 


;m=o] 

A  «■)(</,) 


(/n',m)e£2/(0) 


(15.60) 


A«><4) 


Equation  (15.60)  shows  that  the  LLR  of  a  given  information  symbol  d,  consists  of  two  parts: 

•  The  a  priori  information  A,-a)  from  the  prior  probability  of  the  data  symbol  </,,  which  may 
be  provided  a  priori  or  externally  by  another  decoder. 

•  The  local  information  A|n  that  is  specified  by  the  received  signals  and  the  code  trellis  (or 
state  transition)  constraints. 

With  this  decomposition  view  of  the  LLR.  we  are  now  ready  to  explain  the  concept  of  turbo 
codes,  or  more  appropriately,  turbo  decoding. 


Theconcept  of  turbo  codes  was  first  proposed  by  Berrou,  Glavieux,  and  Thitimajshima1-  in 
1993  at  the  annual  IEEE  International  Conference  on  Communications.  The  authors  claim  of 
near-Shannon-limit  error  correcting  performance  was  initially  niet  with  great  skepticism.  This 
reaction  was  natural  because  the  proposed  turbo  code  exhibited  BER  performance  within  1  dB 
ofThe  Shannon  limit  .ha,  had  t*en  considered  to  he  exttemely  challengtng.  tl  no,  tntposstble 
to  achieve  under  reasonable  computational  complexity.  Moreover,  the  construction  of  the 
ii.Hn.rho  codes  does  not  take  a  particularly  structured  form.  It  took  nearly  two  years  for 
the'e rfng  community  to  become  convinced  of  the  extraordinaty  BER  performance  of  turbo 
codesattd  to  toin  to  understand  their  principles.  Today,  turbo  codes  have  penneated  many 
aspects  of  digital  communications,  often  Inking  specially  evolved  forms.  In  thts  pan  of  the 
aspects  o  c  introduction  to  the  basic  principles  of  turbo  codes. 

^TbteSram  of  the  firs,  turbo  encoder  is  shown  in  Fig.  15.20a.  This  turbo  consists  of 
two  recursive  systematic  convolutional  RSC  codes.  Rcpresenttng  a  ««,.  delay  as  O.  the  I  x  2 
generator  matrix  of  the  rate  1/2  RSC  code  ts  ol  the  fonn 

G(D)  =  [l 


In 


particular,  the  example  turbo  code  of  Berrou  et  a.  “  was  specified  by 

r,  \+dLlql±d^\ 

G(D)  -  j^l  i  +D  +  ZT  J 

of  the  encoder  is  shown  in  Fig.  15.20b. 


The  simple  implementation  < 
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H 


Figure  15.20  d( 

Parallel  concate¬ 
nated  turbo 
code:  (a)  rate 
1  /3  turbo 
encoder;  (b) 
implementation 
of  recursive 
systematic 
convolutional 
(RSC)  encoder 

g ;  (D)=  1 +D+D4, 

92tDhl+  [P+lP+D4. 


Information  bits 


di 


Rate  1/2 
RSC  code 


Rate  1/2 
RSC  code 


(a) 


(b) 


In  this  example,  a  frame  of  information  bits  d{  is  sent  through  two  RSC  encoders.  Both 
convolutional  codes  have  rate  1/2  and  are  systematic.  Thus,  the  first  RSC  encoder  generates  a 
frame  of  coded  bits  p\  of  length  equal  to  the  information  frame.  Before  entering  the  second 
RSC  encoder,  the  information  bits  are  interleaved  by  a  random  block  interleaver  n.  As  a  result, 
even  with  the  same  encoder  structure  as  the  first  encoder,  the  second  encoder  will  generate 
a  different  coded  bit  frame  pf\  The  overall  turbo  code  consists  of  the  information  bits  and 
the  two  coded  (parity)  bit  streams.  The  code  rate  is  1/3,  as  the  turbo  code  has  two  coded 
frames  for  the  same  information  frame.  Then  [dh  pf\ p^)  are  modulated  and  transmitted 
over  communication  channels.  Additional  interleavers  and  RSC  encoders  can  be  added  to 
obtain  codes  that  have  lower  rates  and  are  more  powerful. 

(l)  c(^jstrucl  tur*:)0  c°des  that  have  higher  rates,  the  two  convolutional  encoder  outputs 
pi  and/?f.  can  be  selectively  but  systematically  discarded  (e.g.,  by  keeping  only  half  the  bits 

in  Pi  and  p.  ).  This  process,  commonly  referred  to  as  puncturing,  creates  two  RSC  codes 
that  are  more  efficient,  each  of  rate  2/3.  The  total  turbo  code  rate  is  therefore  1/2,  since  for 
every  information  bit,  there  are  two  coded  bits  (one  information  bit  and  one  parity  bit). 

^us'  essence  °f  turbo  code  is  simply  a  combination  of  two  component  RSC  codes, 
t  ough  each  component  code  has  very  few  states  and  can  be  routinely  decoded  via  decoding 
a  gorithms  such  as  VA,  SOVA,  and  BJCR,  the  random  interleaver  makes  the  overall  code 
muc  more  challenging  to  decode  exactly  because  it  consists  too  many  states  to  be  decoded 
y  means  of  traditional  MAP  or  VA  decoders.  Since  each  component  code  can  be  decoded 
y  usint  simple  decoders,  the  true  merit  of  turbo  codes  in  fact  lies  in  iterative  decoding,  the 
concept  of  allowing  the  two  component  decoders  to  exchange  information  iteratively. 


iterative  Decoding  for  Turbo  Codes 

important  to  note  that  naive  iteration  between  two  (hard)  decoders  cannot  guarante< 
convergence  to  the  result  of  the  highly  complex  but  exact  turbo  decoder.  Turbo  decoding  i 
made  possible  and  powerful  by  utilizing  the  previously  discussed  BCJR  decoding  algorithn 
(or  its  variations).  Each  component  code  can  be  decoded  by  using  a  BCJR  soft  decoding 

a  tom  m.  R  soft  decoding  makes  it  possible  for  iterative  turbo  decoding  to  exchange  sof 
information  between  the  two  soft  decoders. 

The  !dea  of  iterative  decoding  can  be  simply  described  as  follows.  Given  the  channe 
ou  put,  both  decoders  can  generate  the  soft  information  A (</,•)  according  to  Eq.  (15.60): 
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Figure  15.21 

Exchange  of 
extrinsic 
information 
between  two 
component  BCJR 
decoders  for 
iterative  turbo 
decoding. 


Note  that  A (a) (</,•)  and  A  f\dj)  are  the  a  priori  information  on  the  information  bit  d,  at  decoder 
1  and  decoder  2,  respectively.  Without  any  prior  knowledge,  the  decoders  should  just  treat  them 
as  0  because  dj  =  ±  1  are  equally  likely. 

Iterative  decoding  must  allow  the  two  low  complexity  decoders  to  exchange  information. 
To  accomplish  this,  decoder  1  can  apply  BCJR  algorithm  to  find  the  LLR  information  about  dk 
It  can  then  pass  this  learned  information  to  decoder  2  as  the  a  priori  LLR.  Note  that  this  learned 
information  must  be  unavailable  to  decoder  2  from  its  own  decoder  and  other  input  signals. 
To  provide  innovative  information,  decoder  1  should  remove  any  redundant  information  to 
generate  its  extrinsic  information  A «  2(4)  to  pass  to  decoder  2.  Similarly,  decoder  2  will 
find  out  its  extrinsic  information  A«  ,«)  (previously  unavailable  to  decoder  I)  and  pass 
it  back  to  decoder  1  as  a  priori  information  for  decoder  1  to  refresh/update  its  LLR  on  dk. 
This  closed-loop  iteration  will  repeat  multiple  iterations  until  satisfactory  convergence.  The 
*  i  kwl  digram  of  this  iterative  turbo  decoder  appears  in  Fig.  1 5.2 1 . 
wTnow  use  , he  example  given  by  Bril  el  al.'>  to  expltun  how  to  update  the  extrinsic 
we  now  between  two  soft  decoders.  Figure  1 5.2 1  illustrates  the  basic  signal 

information  for  exchang^  b  interconnected  BCJR  MAP  decoders.  Let  us 

r  r-ss- » -  *  **  ^  *•  *« — 

RSC  code,  the  output  code  bits  corresponding  to  the  information  bit  d,  are 

Ci[m  ,  m\  =  (di,  p\  1 ) 

To  determine  p(r1ilCi[m^  ml),  it  is  —sary  to  spectfy  the  modulation  forma,  and  the  channel 

model.  .  f  BpsK  modulation  under  channel  noise  that  is 

We  consider  the are  two  received  signal  samples  as  a  result  of 
additive,  white,  an  specifically,  from  encoder  1 .  the  channel  output 

the  coded  bits  c,[m',  m]  =  (4,  Pi  >■ 1V,U  1 
consists  of  two  signal  sequences 


rlV  =  jEbOdi  -  1)  +  w, 

rf,)  =  y/Eb(2p\^  —  I)  +  wi.l 

l*p 


(15.62a) 

(15.62b) 
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whereas  from  encoder  2,  the  channel  outputs  are 

r,,s  =  jEb(2dj  -  1)  +  w; 
iff  =  'IEb(2pf)  -  I)  +  W;,2 


(15.63a) 

(15.63b) 


Note  that  the  Gaussian  noises  w w,i,  and  w,,2  are  all  independent  with  identical  Gaussian 
distribution  of  zero  mean  and  variance  JV/2.  The  first  BCJR  decoder  is  given  signals  r(>J  and 
ri./>  to  decode-  whereas  the  second  BCJR  decoder  is  given  signals  r(J  and  r '  "  *  to  decode. 

Let’s  first  denote  Pi[m'  ,  m\  as  the  /th  parity  bit  at  a  decoder  corresponding  to  message  bit 
di.  It  naturally  corresponds  to  the  transition  from  state  m  to  state  m.  For  each  decoder,  the 
received  channel  output  signals  r,  =  [r(J,  ritP]  specifies  y,(w',  m)  via 


=  p(r,  |c/[m',  m])/>(rf,) 

=  P  (n,s,  n,p\dj,  pi[m\  m])  P(di) 

1 

1  ri,s  ~  >/Eb(2di  — 

nAf  P 

1 

'  rts+dp+2E»\r 

CXP 

Af 

X  P(di) 

exp 


M 

2  y/Tb 


%(2pj[m\  m]  -  1)|2  j 

^  [nA2di  -  1 )  +  rj'P(2pi [m,  m]  -  1 )] j 

(15.64) 


Notice  that  the  first  term  in  Eq.  (15.64)  is  independent  of  the  codeword  or  the  transition  from 
m'  to  m.  Thus,  the  LLR  at  this  decoder  becomes 


^(rn'.m)tai(uai-^P(r‘  M1*'-  =  HA(«) 


A(dj )  =  log  .  (w'-m)6^<1) 


(15.65) 


,  P[di  =  1] 

=  log - 1-  log 

6  />[</,=()]  S 


aimai-l(m')p  k-  ndi  =  o  mm) 

(m')exP  +  2npPi[m\  m]pi(m) 

exp  + 2r<> pi[m'' 

By  defining  the  gain  parameter  <  =  4 sfE~b/M,  we  can  simplify  the  LLR  into 

aw, )  =  log =  +  f . „ , + log 

E™„,  “i-i  <«')  'XP  (C  ■  «■])  ft,ml 


FW  =  0] 


iW 


(15.66) 


In  other  words,  for  every  information  bit  </„  the  LLR  of  both  decoders  can  be  decomposed 
into  three  parts  as  in 


A Mi)  =  Aj  a\di)  +  \f\di)  +  Aj  e\di) 


(c), 


iW/ 


7=1,2 
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where  A j“\di)  is  the  prior  information  provided  by  the  other  decoder,  Aj‘  \dj )  is  the  channel 

output  information  shared  by  both  decoders,  and  A J‘](d/ )  is  the  extrinsic  information  uniquely 
obtained  by  the  j th  decoder  that  is  used  by  the  other  decoder  as  prior  information.  This  means 
that  at  any  given  iteration,  decoder  1  needs  to  compute 

A i  Mr)  =  A« ,  (di)  +  f  •  rifS  +  A\t2(di)  ( 1 5.67a) 

in  which  A^Li  (di)  is  the  extrinsic  information  passed  from  decoder  2.  whereas  A^\2(dj)  is 
the  new  extrinsic  information  to  be  sent  to  decoder  2  to  refresh  or  update  its  LLR  via 


A 2(di)  =  A\el2(di)  +  C  •  n,s  +  A2l ,  (di)  ( 1 5.67b) 

At  both  decoders,  the  updating  of  the  extrinsic  information  requires  the  updating  of  a,(w) 
and  Pi(m)  before  the  computation  of  extrinsic  information 


A<*> 


=  log 


w)gfl|(  I ) 

Wl)€pj(0) 


a,-_i  (m')  exp  ({■  •  r^pi [m\  m])  Pi(tn) 
a,_i(m')  exp  (<  •  m])  fy(m) 


(15.68) 


To  refresh  a,(m)  and  /3,(m)  based  on  the  extrinsic  information  A(e),  we  need  to  recompute  at 
each  decoder 


y,(m\  m)  =  p(r,\dl)P(dt)  (,5  69) 

~  {(1  -rf()  +  </,exp[A(f)lJexp(0.5C  r,,J)exp(C  r,>p,[/n',  m])  (15.70) 

Once  decoder  1  has  finished  its  BCJR  decoding,  it  can  provide  its  soft  output  as  the 
prior  information  about  d,  to  decoder  2.  When  decoder  2  finishes  its  BCJR  decoding,  utilizing 
the  prior  information  from  decoder  1,  it  should  prov.de  its  new  soft  information .about d, 
back  to  decoder  1  To  ascertain  that  decoder  2  does  not  feed  back  the  “stale  information 
that  originally  came  from  decoder  1,  we  must  subtract  the  stale  information  before  feedback, 
thereby” providing  only  the  extrinsic  information  A«  2Mr>  back  to  decoder  I  as  “priors” 
for  decoder  I  in  the  next  iteration.  Similarly,  in  the  next  iteration  decoder  I  will  update  its 
soft  output  and  subtract  the  stale  information  that  originally  came  from  decoder  -  to  provide 
refreshed  extrinsic  information  A«  ,  Ml)  as  priors  for  decoder  2.  This  exchange  of  extrinsic 

decoding  performance  of  ,hc: turbo  code  pn,po;ed 

As  an  lttusira  ^  p.  ,5  22.  The  results  demonstrate  the  progressive  perfor- 

by  Berrou  et  a  .  is  p  iterations  during  iterative  soft  decoding.  After  1 8  iterations. 

zs  «  dB  »»y .... 
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,.  -  tlirhn  Codes  researchers  carried  out  a  flurry  of  activitity  aimed 

Following  the  discovery  ^  mQre  powerful,  error  correcting  codes  that  are  suitable 

at  finding  equally  power  ,  ^  another  c|ass  of  near-capacity  codes  known  as 

for  soft  iterative  deco  ing.  *  ^  originally  introduced  by  Gallager15  in  1963.  was 

low-density  parity  chetK  ^  l7  ,995  since  then.  LDPC  code  designs  and  efficient 

rediscovered  by  MacKay  and  Neal 
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Figure  15.22 

The  decoding 
performance  of  a 
rate  1  /2  Turbo 
code  is  shown  to 
be  very  close  to 
the  theoretical 
limit. 

(Reproduced 
with  copyright 
permission  from 
IEEE  from 
Ref.  14.) 
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means  of  LDPC  decoding  have  been  topics  of  intensive  research  in  the  coding  community. 
A  large  number  of  LDPC  codes  have  been  proposed  as  strong  competitors  to  turbo  codes, 
often  achieving  better  performance  with  comparable  code  lengths,  code  rates,  and  decoding 
complexity. 

LDPC  codes  are  linear  block  codes  with  sparse  parity  check  matrices.  In  essence,  the  parity 
check  matrix  H  consists  of  mostly  Os  and  very  few  Is,  forming  a  low-density  parity  check 
matrix.  LDPC  codes  are  typically  quite  long  (normally  longer  than  1000  bits)  and  noncyclic. 
Thus,  an  exact  implementation  of  the  BCJR  MAP  decoding  algorithm  is  quite  complex  and 
mostly  impractical.  Fortunately,  there  are  several  well-established  methods  for  decoding  LDPC 
codes  that  can  achieve  near-optimum  performance. 

The  design  of  LDPC  code  is  equivalent  to  the  design  of  a  sparse  parity  matrix  H .  Once  H 
has  been  defined,  the  LDPC  code  is  the  null-space  of  the  parity  matrix  H .  The  number  of  Is 
in  the  ith  row  of  H  is  known  as  the  row  weight  p,,  whereas  the  number  of  Is  in  the yth  column 
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is  known  as  the  column  weight  yj.  In  LDPC  codes,  both  row  and  column  weights  are  much 
smaller  than  the  code  length  n,  that  is. 


Pi  «  n  yj  «;  n 


For  regular  LDPC  codes,  all  rows  have  equal  weight  p,  =  p  and  all  columns  have  equal 
weight  yi  =  y.  For  irregular  LDPC  codes,  the  row  weights  and  column  weights  do  vary  and 
typically  exhibit  certain  weight  distributions.  Regular  codes  are  easier  to  generate,  whereas 
irregular  codes  with  large  code  length  may  have  better  performance. 

Bipartite  (Tanner)  Graph 

A  Tanner  graph  is  a  graphic  representation  that  can  conveniently  describe  a  linear  block  code. 
This  bipartite  graph  with  incidence  matrix  H  was  introduced  by  R.  M  lanner  in  I981.IS 
Consider  an  (n,  k)  linear  block  code.  There  are  n  coded  bits  and  n-k  parity  bits.  The  Tanner 
graph  of  this  linear  block  code  has  n  variable  nodes  corresponding  to  the  n  code  bits.  These 
n  variable  nodes  are  connected  to  their  respective  parity  nodes  (or  check  nodes)  according  to 
the  Is  in  the  parity  check  matrix  H  A  variable  node  (a  column)  and  a  check  node  (a  row)  are 
connected  if  the  corresponding  element  in  H  is  a  1.  Because  H  is  sparse,  there  are  only  a  lew 
connections  to  each  variable  node  or  check  node.  These  connections  are  known  as  edges.  Each 
row  represents  the  connection  of  a  check  node,  and  each  column  represents  the  connection  ol 
a  variable  node  For  LDPC  codes,  if  the  ith  row  of  H  has  row  weight  of  p,-,  then  the  check 
node  has  p,  edges.  If  column  j  has  column  weight  of  yh  then  the  variable  node  has  y,  edges. 
We  use  an  example  to  illustrate  the  relationship  between  H  and  the  Tanner  graph. 


Example  15.9  Consider  a  Hamming  (7, 4, 3)  code  with  parity  check  matrix 


'1110  10  0 

//=  0  1110  10 

110  10  0  1 


(15.71) 


Determine  its  Tanner  graph. 


cnee*  uuuc  i.  - 7 

Tanner  graph  is  shown  in  Fig.  15.23. 


Figure  15.23 

Tanner  graph  of 
the  (7,  4,  3) 


Variable  nodes 


Hamming  code. 
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Because  LDPC  codes  are  typically  of  length  greater  than  1000.  their  Tanner  graphs  are 
normally  too  large  to  illustrate  in  practice.  However,  the  basic  Tanner  graph  concept  is  very 
helpful  to  understanding  LDPC  codes  and  its  iterative  decoding. 

A  cycle  in  the  Tanner  graph  is  marked  by  a  closed  loop  of  connected  edges.  The  loop 
originates  from  and  ends  at  the  same  variable  (or  check)  node.  The  length  of  a  cycle  is  defined 
by  the  number  of  its  edges.  In  Example  1 5.9,  there  exist  several  cycles  of  length  4  and  length  6. 
Cycles  of  lengths  4  and  6  are  considered  to  be  short  cycles.  Short  cycles  are  known  to  be 
undesirable  in  some  iterative  decoding  algorithms  for  LDPC  codes.  When  a  Tanner  graph  is 
free  of  short  cycles,  iterative  decoding  of  LDPC  codes  based  on  the  so-called  sum-product 
algorithm  can  converge  and  generate  results  close  to  the  full-scale  MAP  decoder  that  is  too 
complex  to  implement  in  practice. 

To  prevent  a  cycle  of  length  4,  LDPC  code  design  usually  imposes  an  additional  constraint 
on  the  parity  matrix  H\  No  two  rows  or  columns  may  have  more  than  one  component 
in  common.  This  property,  known  as  the  “row-column  (RC)  constraint,”  is  sufficient  and 
necessary  to  avoid  cycles  of  length  4.  The  presence  of  cycles  is  often  unavoidable  in  most 
LDPC  code  designs  based  on  computer  searches.  A  significant  number  of  researchers  have 
been  studying  the  challenging  problem  of  either  reducing  the  number  of,  or  eliminating  short 

cycles  of  length  4,  6,  and  possibly  8.  Interested  readers  should  consult  the  book  by  Lin  and 
Costello.2 

We  now  describe  two  decoding  methods  for  LDPC  codes. 

Bit-Flipping  LDPC  Decoding 

The  large  code  length  of  LDPC  codes  makes  decoding  a  highly  challenging  problem.  Two  of 
the  most  common  decoding  algorithms  are  the  hard  decision  bit-flipping  (BF)  algorithm  and 
the  soft-decision  sum-product  algorithm  (SPA). 

The  bit-flipping  (BF)  algorithm  operates  on  a  sequence  of  hard-decision  bits  r  = 
011010  •  •  •  010.  Parity  checks  on  r  generate  the  syndrome  vector 

s=rHT 

Those  syndrome  bits  of  value  1  indicate  parity  failure.  The  BF  algorithm  tries  to  change  a  bit 
(by  flipping)  in  r  based  on  how  the  flip  would  affect  the  syndrome  bits. 

When  a  code  bit  participates  in  only  a  single  failed  parity  check,  flipping  this  bit  at  best 
will  correct  1  failed  parity  check  but  will  cause  y  -  1  new  parity  failures.  For  this  reason,  BF 

only  flips  bits  that  affect  a  large  number  of  failed  parity  checks.  A  simple  BF  algorithm  consists 
of  the  following  steps:2 


Step  I .  Calculate  the  parity  checks  s  —  rH 1 .  If  all  syndromes  are  zero,  stop  decoding. 

Step  2:  Determine  the  number  of  failed  parity  checks  for  every  bit: 

fi  i  =  1 ,  2,  . . . ,  n 

Step  3:  Identify  the  set  of  bits  Fmax  with  the  largest  f,  and  flip  the  bits  in  Fmax  to  generate  a 
new  codeword  r'. 

Step  4.  Let  r  —  r  and  repeat  steps  1  to  3  until  the  maximum  number  of  iterations  has  been 
reached. 
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Sum-Product  Algorithm  for  LDPC  Decoding 

The  sum-product  algorithm  (SPA)  is  the  most  commonly  used  LDPC  decoding  method.  It  is 
an  efficient  soft-input,  soft-output  decoding  algorithm  based  on  iterative  beliel  propagation. 
SPA  can  be  better  interpreted  via  the  Tanner  graph.  SPA  is  similar  to  a  see-saw  game.  In  one 
step,  every  variable  node  passes  information  via  its  edges  to  its  connected  check  nodes  in 
the  top-down  pass-flow.  In  the  next  step,  every  check  node  passes  back  information  to  all  the 
variable  nodes  it  is  connected  to  in  a  bottom-up  pass-flow. 

To  understand  SPA,  let  the  parity  matrix  be  H  of  size  J  xn  where  J  =n-k  for  an  ( n ,  k) 

LDPC  block  code.  Let  the  codeword  be  represented  by  variable  node  bits  (y/,  j  =  1 . n  | . 

For  the  jth  variable  node  v;,  let 


lij  —  [i :  hij  =1,  1  <  i 

denote  the  set  of  variable  nodes  connected  to  vy.  For  the  ith  check  node  z/,  let 


a i  =  l j '  ^  =1,  !<;<«} 


denote  the  set  of  variable  nodes  connected  to  z/* 

First,  define  the  probability  of  satisfying  check  node  z,  =  0  when  vj 

Rij(u)  =  P[zi  =  0\vj  =  u]  «  =  0,  1 


=  u  as 


(15.72) 


Let  us  denote  the  vector  of  variable  bits  as  v.  We  can  use  the  Bayes  theorem  on  conditional 
probability  (Sect.  8.1)  to  show  that 


Furthermore,  assume  rh„  ,he  variable  node  probabilhies  are  approximately  independent  We 


can  estimate 


(15.75) 


tz.O  /. 
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Figure  15.24 

Message  pass¬ 
ing  in  the 
sum-product 
algorithm. 


Note  that  the  probability  of  P  [z;  =  0|y,  =  u,  :  t  e  <r„  t  ^  j}]  is  either  0  or  1 ;  that  is,  the 
check  node  z,-  =  0  either  succeeds  or  fails.  The  relationship  Eq.  (15.76)  allows  Rij(u)  to  be 
updated  when  the  /th  check  node  receives  Qu(v(). 

Once  Rij(u)  have  been  updated,  they  can  be  passed  to  the  variable  nodes  in  the  bottom-up 
direction  to  update  Qu(x).  Again  using  Bayes-  theorem  (Sec.  8.1),  we  have 


QuW 


p  [ve  =  x]  P  [  {Zm  =  o  :  m  e  /t,,  m  ^  / } | =  a] 

P\  {Zm  =  0  :  me  nt,  m  ^  /}] 


Once  again  assuming  that  each  parity  check  is  independent,  we  then  write 


(15.77) 


P[{zm  =  Q:  mefi(,m^i}\ve=x]  =  ]""[  RmJ(x)  (15.78) 


Now  define  the  prior  variable  bit  probability  as pt  (*)  =  P(yt  =  jc).  Let  or, £  be  the  normalization 
factor  such  that  Qn ( 1 )  +  <2/,*(0)  =  1.  We  can  update  Qa(x)  at  the  variable  nodes  based  on 
Eq.  (15.76): 


Qi.eto  =  «/,€  P((x)  ]~[  Rmj(x)  (15.79) 

This  message  will  then  be  passed  back  in  the  top-down  direction  to  the  check  nodes. 
Figure  15.24  illustrates  the  basic  operation  of  message  passing  in  the  bottom-up  and  the 
top-down  directions  in  the  sum-product  algorithm.  The  SPA  can  be  summarized  as  follows. 

Initialization:  Let  m  =  0  and  let  mmax  be  the  maximum  number  of  iterations.  For  every 
h[£  =  1  in  //,  use  prior  probabilities  to  set 

Qu  (1)  =  Pl (1),  and  Qfj*  (0)  =  pe  (0) 


Step  1 :  Let  the  check  node  i  update  its  information 

«<;)(1)=  £  p  [Z(.  —  o|y,-  =  1,  {vf }]  •  n  <2gW  (15.80a) 

0>  =  £  '■[5  =  0|vj  =  0.M].  n  e!?(vt)  (15.80b) 


Variable  nodes 
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Step  2:  At  every  variable  node  (indexed  by  i),  update 


<2<7+I)(0)  =  <*g+1)  •  Pi( 0)  (I  R(”j(Q) 

(15.81a) 

e-”+1)(D=a-”+1)-Pf(l)  n  O'* 

(15.8  lb) 

where  the  normalization  factor  a\ 1  is  selected  such  that 


g|”+1)(0) + e5+,)(n  = 1 


Step  3:  At  the  variable  nodes,  also  estimate  the  a  posteriori  probabilities 

p(m+\)  [v<  _  o|r]  =  0f*m+l)  •  P((0)  H  <>( 0) 

me  It? 

p(m+\)  [v<  _  i|r]  _  a*m+,)  -pi(l)  f|  /?£>(  1) 

meflf 


(15.82a) 

(15.82b) 


where  the  normalization  factor  a"'+ 1 1  is  selected  such  that 

p(m+\)  j-^  _  0|r]  +  /><m+  ■>  [v<>  =  1  |r]  =  1 

Step  4:  Make  hard  decisions  of  each  code  bit 


vi  =  sign 


P(m+l)  [ve  =  1  |r] 
p(m+\)  =  o|r] 


If  the  decode  codeword  satisfies  all  parity  checks,  stop  decoding.  Otherwise,  go  back 
to  step  1  for  another  iteration. _ 


Notice  that  external  input  signals  {r,}  are  involved  only  during  the  estimation  of  a  priori 
probabilities  pt(\)  and^(O).  SPA  uses  the  a  priori  probabilities  as  follows: 

w(1)  =  /7(r|v£  =  1)+Mf|Vr=_0)  and  Pt(0)~  P(r\vt  =  l)+p(r\ve=0) 

For  a  more  concrete  example,  consider  the  example  of  an  AWGN  channel  with  BPSK 
modulation.  For  ve,  the  received  signal  sample  is 

n  =  y/EbO-vt  -  1)  + 

where  w«  is  Gaussian  with  zero  mean  and  variance  U /2.  Because  (r, }  are  independent,  when 
we  receive  T(  =  r/, we  can  simply  use 

,  1 

_ _ . _ —  and  pt(  0)  /  /jr  \ 

P,<  'l+exp(-4  #r<)  l+exp(4^r,) 

This  completes  the  introduction  of  sum-produc,  algorithm  for  the  decoding  of  LDPC  codes. 
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15.13  MATLAB  EXERCISES 


In  this  section,  we  provide  MATLAB  programs  to  illustrate  simple  examples  of  block  encoders 
and  decoders.  We  focus  on  the  simpler  case  of  hard-decision  decoding  based  on  syndromes. 


COMPUTER  EXERCISE  15.1 

In  the  first  experiment,  we  provide  a  program  to  decode  the  (6,  3)  linear  block  code  of  Example  15.1. 


%  Mat lab  Program  <Exl5_l.m> 

%  to  illustrate  encoding  and  decoding  of  (6,3)  block  code 
%  in  Example  15.1 


G=[l  0  0  1  0  1 
0  10  0  11 
0  0  1110]; 

H=  [10  1 
Oil 
110 
10  0 
0  10 
0  0  1]  '  ; 

E=[0  0  0  0  0  0 
1  0  0  0  0  0 
0  1  0  0  0  0 
0  0  1  0  0  0 
0  0  0  1  0  0 
0  0  0  0  1  0 
0  0  0  0  0  1 
1  0  0  0  1  0]; 

K=size (E, 1) ; 

Syndrome =mod ( mt ime  s ( E , H ' ) ,2) ; 

r=  [1  1  1  0  l  l] 

display ( [ ' Syndrome  ' , ' Error  Pattern' ] ) 

display (num2str( [Syndrome  E] ) ) 

x=mod (r*H' , 2) ; 

for  kk=l : K, 

if  Syndrome (kk, : ) ==x, 
idxe=kk; 

end 


%Code  Generator 


%Parity  Check  Matrix 


%List  of  correctable  errors 


%Find  Syndrome  List 
%Received  codeword 


% Compute  syndrome 
%Find  the  syndrome  index 


end 

syndrome = Syndrome  (idxe,  : ) 
error=E (idxe, : ) 
cword=xor (r , error) 


%display  the  syndrome 
%Error  correction 


The  execution  ot  this  MATLAB  program  will  generate  the  following  results,  which  include  the 
erroneous  codeword,  the  syndrome,  the  error  pattern,  and  the  corrected  codeword. 


Exl5_l 

Syndrome  Error  Pattern 


0  0  0  0 
10  11 
0  110 
110  0 


0  0  0  0 
0  0  0  0 
10  0  0 
0  10  0 


0 

0 

0 

0 
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967 


10  0 
0  10 
0  0  1 
111 


0  0  0  1  0  0 
0  0  0  0  1  0 
0  0  0  0  0  1 
1  0  0  0  1  0 


syndrome  = 


Oil 


error  = 

0  1  0  0  0  0 

cword  = 

10  10  11 


In  our  next  exercise,  we  provide  a  program  to  decode  the  (7,  4)  Hamming  code  of  Example  15.3. 


%  Matlab  Program  <Exl5_3.m> 

%  to  illustrate  encoding  and  decoding  of  Hamming  (7,4)  code 


% 

G=[l  0  0  0  1  0  1 
0  10  0  111 
0  0  10  110 

0  0  0  1  0  1  1]; 

H= [G ( : , 5 : 7) ' »  eye(3,3)] ; 

E=[l  0  0  0  0  0  0 

0  1  0  0  0  0  0 

0  0  1  0  0  0  0 

0  0  0  1  0  0  0 

0  0  0  0  1  0  0 

0  0  0  0  0  1  0 

0  0  0  0  0  0  1]; 

K=size (E, 1) ; 

Syndrome=mod (mtimes (E,H  )  ,2)  , 

r=[l  0  1  0  1  1  1] 

display  (  [ '  Syndrome  '/Error  Pattern  ]) 

display (num2str( [Syndrome  E] ) ) 

x=mod (r*H' ,2) ; 

for  kk=l:K, 

if  Syndrome (kk, :) ==x, 

idxe=kk; 

end 


%  Code  Generating  Matrix 


%Parity  Check  Matrix 
%List  of  correctable  errors 


%Find  Syndrome  List 
%Received  codeword 


%Compute  syndrome 


%Find  the  syndrome  index 


end 

syndrome =Syndrome (idxe,  : ) 

error=E ( idxe, : ) 
cword=xor (r , error) 


%display  the  syndrome 
%Error  correction 


Enecuting  MATLAB  program  E*15  J> .  m  will  genera, e  for  a  eraoneons  codeword  r  its  syndrome, 
the  error  proem.  a"d  B*  comctcJ  codeword: 
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Exl5_3 
r  = 


10  10  1 


1  1 


Syndrome  Error  Pattern 


10  11 
1110 
110  0 
0  110 
10  0  0 
0  10  0 
0  0  10 


0  0 
1  0 
0  1 
0  0 
0  0 
0  0 
0  0 


0  0 
0  0 
0  0 
1  0 
0  1 
0  0 
0  0 


0  0 
0  0 
0  0 
0  0 
0  0 
1  0 
0  1 


syndrome  = 

10  0 


error  = 


0  0  0 


0  1 


0  0 


cword  = 


10  10 


0 


1  1 


COMPUTER  EXERCISE  15.2 

In  a  more  realistic  example,  we  will  use  the  Hamming  (7,4)  code  to  encode  a  long  binary  message  bit 
sequence.  The  coded  bits  will  be  transmitted  in  polar  signaling  over  an  additive  white  Gaussian  noise 
( AWGN)  channel.  The  channel  outputs  will  be  detected  using  a  hard-decision  function  sgn.  The  channel 
noise  will  lead  to  hard-decision  errors.  The  detector  outputs  will  be  decoded  using  the  Hamming  (7,4) 
decoder  that  is  capable  of  correcting  1  bit  error  in  each  codeword  of  length  7. 

This  result  is  compared  against  the  uncoded  polar  transmission.  To  be  fair,  the  average  E/^/A  ratio 
tor  every  information  bit  is  made  equal  for  both  cases.  MATLAB  program  Sim7 4  Hamming  .m  is  given, 
the  resulting  BER  comparison  is  shown  in  Fig.  15.25. 

%  Matlab  Program  <Sim74Hamming . m> 

-s  Simulation  of  the  Hamming  (7,4)  code  performance 
*  under  polar  signaling  in  AWGN  channel  and  performance 
■s  comparison  with  uncoded  polar  signaling 
elf; clear  sigcw  BER_uncode  BER_coded 
G=  [1000101  %  Code  Generator 

0  10  0  111 

0  0  10  110 

0  0  0  1  0  1  1]; 

H=[l  1  1  0  1  0  0 

0  1110  10 


%  Parity  Check  Matrix 
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Figure  1 5.25 

Comparison  of 
bit  error  rates  of 
uncoded  polar 
signaling  trans¬ 
mission  and 
polar  signaling 
transmission  of 
Hamming  (7,  4) 
encoded 
[(dashed)  and 
uncoded  (solid) 
message  bits. 


110  10  0  1]; 

E=[l  0  0  0  0  0  0 

0  1  0  0  0  0  0 

0  0  1  0  0  0  0 

oooiooo 

0  0  0  0  1  0  0 

0  0  0  0  0  1  0 

0  0  0  0  0  0  1 

0  0  0  0  0  0  0]; 

K2=size (E, 1) ; 

Syndrome =mod (mtimes ( E , H  )  ,2)  , 

L1=25000;K=4*L1; 


%  Error  patterns 


%  Syndrome  list 

%Decide  how  many  codewords 


s ig_b= round  <rand (1,K)); 

sig_2=reshape(sig_b,4,LD  ; 
xig_l=mod(G'*sig_2,2) ; 
xig_2=2*reshape (xig_l, 1< 7*L  ‘  ' 

AWnoisel=randn(l,  7*L1) ; 

AWnoise2=randn(l,4*Ll); 

%  Change  SNR  and  compute  BER 


SNRdb=ii; 

SNR=10* (SNRdb*0 . 1) 1 

xig_n=sqrt(SNR*4/7)*xig_ 


2+AWnoisel; 


%Generate  message  bits 
%4  per  column  for  FEC 
%Encode  column  by  column 
%P/S  conversion 
%Generate  AWGN  for  coded  Tx 
%Generate  AWGN  for  uncoded  Tx 


%Add  AWGN  and  adjust  SNR 
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rig_l= (1+sign (xig_n) )/2; 
r=reshape(rig_l,7,Ll) ' ; 
x=mod(r*H' ,2) ; 
for  kl=l : LI , 

for  k2=l : K2 , 

if  Syndrome (k2 , : ) ==x (kl , : ) , 

idxe=k2 ; 

end 

end 

error=E ( idxe ,  : )  ; 
cword=xor (r (kl, :) , error) ; 
sigcw ( : , kl ) =cword (1:4) ; 

end 

cw=reshape (sigcw ,  1 ,  K) ; 

BER  coded (ii) =sum(abs (cw-sig_b) ) /K; 


%Hard  decisions 

%S/P  to  form  7  bit  codewords 

%  generate  error  syndromes 


%find  the  Syndrome  index 

%look  up  the  error  pattern 
%error  correction 
%keep  the  message  bits 

%  Coded  BER  on  info  bits 


%  Uncoded  Simulation  Without  Hamming  code 

xig_3=2*sig_b-l;  %  Polar  signaling 

xig  m=sqrt  (SNR)  *xig_3+AWnoise2 ;  %  Add  AWGN  and  adjust  SNR 

rig_l= (1+sign (xig_m) ) /2 ;  %  Hard  decision 

BER_uncode (ii) =sum (abs (rig_l-sig_b) ) /K;  %  Compute  BER 

end 

EboverN= [1 : 14] -3 ;  %  Need  to  note  that  SNR  =  2  Eb/N 


Naturally,  when  the  Eb/M  is  low,  there  tends  to  be  more  than  1  error  bit  per  codeword.  Thus,  when 
there  is  more  than  1  bit  error,  the  decoder  will  still  consider  the  codeword  to  be  corrupted  by  only  1  bit 
error.  Its  attempt  to  correct  1-bit  error  may  in  fact  add  an  error  bit.  When  the  Eb/N  is  high,  it  is  more 
likely  that  a  codeword  has  at  most  1  bit  error.  This  explains  why  the  coded  BER  is  worse  at  lower  Eb/ A 
and  better  at  higher  Eb/Af.  On  other  other  hand.  Fig.  15.3  gives  an  optimistic  approximation  by  assuming 
a  cognitive  decoder  that  will  take  no  action  when  the  number  of  bit  errors  in  each  codeword  exceeds  1. 
Its  performance  is  marginally  better  at  low  Eb/N  ratio. 
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PROBLEMS 


12  satisfies 


15.1-1  Golay’s  (23, 12)  codes  are  three-error  correcting  codes.  Verify  that  n  -  23  and  k 


the  Hamming  bound  exactly  for  /  =  3. 


15.1-2  (a)  Determine  the  Hamming  bound  for  a  ternary  code  (whose  three  code  symbols  are  0,  1 . 2). 


(b)  A  ternary  ( 1 1 , 6)  code  exists  that  can  correct  up  to  two  errors.  Verify  that  this  code  satisfies 


the  Hamming  bound  exactly. 


15.1-3  Confirm  the  possibility  of  a  (18,  7)  binary  code  that  can  correct  up  to  three  errors.  Can  this 


code  correct  up  to  four  errors  ? 


15.2-2  Given  a  generator  matrix 


construct  a  (3,  1)  code.  How  many  errors  can 
vectors  d  =  «  and  d  =  L  Comment. 


G  =  [1  1  1] 

rors  can  this  code  correct?  Find  the  codeword  for  data 


15.2-3  Repeat  Prob.  15.2-2  tor 


G  =  [1  1 


I] 


This  gives  a  (5,  1 )  code. 
15.2-4  A  generator  matrix 
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generates  a  (4,2)  code. 

(a)  Is  this  a  systematic  code? 

(b)  What  is  the  parity  check  matrix  ot  this  code 

(c)  Find  the  codewords  for  all  possible  input  bits. 

,d,  Determine  the  minimum  distance  of  the  code  and  the  number  of  bi,  errors  this  code  can 
correct. 


15.2-5  Consider  the  following  (k  + 1,  k)  systematic  linear  block  code  with  the  parity  check  digit  c*+, 
given  by 


c^_|_  i  =  d\  4-  di  4-  *  *  *  +  dk 


(15.83) 


(a)  Construct  the  appropriate  generator  matrix  for  this  code. 

(b)  Construct  the  code  generated  by  this  matrix  tor  k  =  3. 

(c)  Determine  the  error  detecting  or  correcting  capabilities  of  this  code. 

(d)  Show  that 


cHt  =0 


and 


10  if  no  error  occurs 

1  if  single  error  occurs 


15.2-6  Consider  a  generator  matrix  G  tor  a  nonsystematic  (6,  3)  code. 


G  = 


0  1110  1' 
1110  10 
1  10  0  0  1 


Construct  the  code  for  this  G,  and  show  that  dmin,  the  minimum  distance  between  codewords, 
is  3.  Consequently,  this  code  can  correct  at  least  one  error. 


15.2-7  Repeat  Prob.  15.2-6  if 


G  = 


1  0  0  0  1  1 
0  10  10  1 
0  0  1110 


15.2-8  Find  a  generator  matrix  G  for  a  (15,  11)  single-error  correcting 
codeword  for  the  data  vector  10111010101. 


linear  block  code.  Find  the 


15.2-9  For  a  (6,  3)  systematic  linear  block  code,  the  three  parity-check  digits  C4,  c$,  and  <  6 


C4  =  d\  +  d2  +  dj 

C5  =d\+d2 

Cf,  =  d\  +d3 

(a)  Construct  the  appropriate  generator  matrix  for  this  code. 

(b)  Construct  the  code  generated  by  this  matrix. 
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(c)  Determine  the  error  correcting  capabilities  of  this  axle. 

(d)  Prepare  a  suitable  decoding  table. 

(e)  Decode  the  following  received  words:  101100,000110,  101010. 

15.2- 10  (a)  Construct  a  code  table  for  the  (6,  3)  code  generated  by  the  matrix  G  in  Prob.  1 5.2-6. 

(b)  Prepare  a  suitable  decoding  table. 

15.2- 11  Construct  a  single-error  correcting  (7,  4)  linear  block  code  (Hamming  code)  and  the 

corresponding  decoding  table. 


15  2-12  For  the  (6,  3)  code  in  Example  15.1,  the  decoding  table  is  Table  15.3.  Show  that  if  we  use  this 
decoding  table,  and  a  two-error  pattern  010100  or  001001  occurs,  it  will  not  be  corrected.  If 
it  is  desired  to  correct  a  single  two-error  pattern  010100  (along  with  six  single-error  patterns), 
construct  the  appropriate  decoding  table  and  verify  that  it  does  indeed  correct  one  two-error 
pattern  010100  and  that  it  cannot  correct  any  other  two-error  patterns. 

15.2- 13  (a)  Given  k  =  8,  find  the  minimum  value  of  n  for  a  code  that  can  correct  at  least  one  error. 

(b)  Choose  a  generator  matrix  G  for  this  code. 

(c)  How  many  double  errors  can  this  code  correct? 

(d)  Construct  a  decoding  table  (syndromes  and  corresponding  correctable  error  patterns). 

15.2- 14  Consider  a  (6,  2)  code  generated  by  the  matrix 


0 


0 


0 


(a)  Construct  the  code  table  for  this  code  and  determine  the  minimum  distance  between 


codewords. 


patterns. 


i  of  r)  —  jr3  -I-  x  4-  1  to  construct  a  systematic  (7,  4)  cyclic 

15.3-1  (a)  Use  the  generator  polynomial  g(x )  -  *  +  x  + 

code. 

(b)  What  are  the  error  correcting  capabilities  of  th.s  code . 


Use  the  generator  polynomial  g(x)  -  *  +  x  +  10  c< 

le. 

What  are  the  error  correcting  capabilities  of  this  code? 


(c)  Construct  the  decoding  table. 

,d)  If  the  received  word  is  1101100,  determine  the  transmuted  data  word. 


Determine  the 

11000101011110 


15.3-3  Factorize  the  polynomial 


x3  +  x2  +  x  +  1 
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Hint'.  A  third-order  polynomial  must  have  one  factor  of  first  order.  The  only  first-order  poly¬ 
nomials  that  are  prime  (not  factorizable)  are  x  and  x  +  I.  Since  x  is  not  a  factor  of  the  given 
polynomial,  try  x  +  1.  Divide*3  +  x2+x+lbyx-l-l. 

15.3-4  The  concept  explained  in  Prob.  15.3-3  can  be  extended  to  factorize  any  higher  order  polynomial. 
Using  this  technique,  factorize 


x5+x4+x2  +  1 

Hint:  There  must  be  at  least  one  first-order  factor.  Try  dividing  by  the  two  first-order  prime 
polynomials  jc  and  x+ 1 .  The  given  fifth-order  polynomial  can  now  be  expressed  as  <t>\ (x)04(x), 
where  <p\ (jc)  is  a  first-order  polynomial  and  </>4(x)  is  a  fourth-order  polynomial  that  may  or  may 
not  contain  a  first-order  factor.  Try  dividing  04M  by  .r  and  ,c+ 1 .  It  it  does  not  work,  it  must  have 
two  second-order  polynomials  both  of  which  are  prime.  The  possible  second-order  polynomials 
are  x2,x2  +  l,x2  +  x,  and  x2  +  x  +  1  •  Determine  which  of  these  are  prime  (not  divisible  by  x 
or  x  +  1).  Now  try  dividing  <t>4 U)  by  these  prime  polynomials  of  the  second  order.  It  neither 
divides,  04(x)  must  a  prime  polynomial  of  the  fourth  order  and  the  factors  are  4> i  (x)  and 
04(x). 

15.3- 5  Use  the  concept  explained  in  Prob.  15.3-4  to  factorize  a  seventh-order  polynomial  x7  +  1. 

Hint:  Determine  prime  factors  of  first-,  second-,  and  third-order  polynomials.  The  possible 
third-order  polynomials  are  ,  x3  4-  1 ,  x*  -f  xy  x*  +  x  -I-  1 ,  x*  +  x2 ,  -v*  4-  x~  +  1  *  x  +  x  +  x* 
and  x3  +  x2  +  x  +  1 .  See  hint  in  Prob.  15.3-4. 

15.3- 6  Equation  (15. 16)  suggests  a  method  of  constructing  a  generator  matrix  G'  for  a  cyclic  code. 


V  'g(x)‘ 

"^1 

82 

*“  8n—k+\ 

0 

0 

0 

G'  = 

xk~2g(x) 

= 

0 

8 1 

82 

8n-k+\ 

0  • 

0 

_  g(x)  . 

_0 

0 

0 

8\ 

82  * 

8n — k-\~  1  _ 

where  g(x)  =  g^~k  +  g2*n“*_1  +  —  +  8n-k+l  is  the  generator  polynomial.  This  is,  in 
general,  a  nonsystematic  cyclic  code. 

(a)  For  a  single-error  correcting  (7,  4)  cyclic  code  with  a  generator  polynomial  g(x)  —  x  + 
x2  -f  1,  find  G '  and  construct  the  code. 

(b)  Verify  that  this  code  is  identical  to  that  derived  in  Example  15.3  (Table  15.4). 

15.3-7  The  generator  matrix  G  for  a  systematic  cyclic  code  (see  Prob.  15.3-6)  can  be  obtained  by 
realizing  that  adding  any  row  of  a  generator  matrix  to  any  other  row  yields  anot  a 

generator  matrix,  since  the  codeword  is  formed  by  linear  combinations  of  data  digits,  s0^ 
generator  matrix  for  a  systematic  code  must  have  an  identity  matrix  4  in  ^rst  *  c0  ^  ^ 
Such  a  matrix  is  formed  step  by  step  as  follows.  Observe  that  each  row  in  G  in  Prob.  • 
a  left  shift  of  the  row  below  it,  with  the  last  row  being  g(x).  Start  with  the  kth  (last)  row  g 
Because  g(jc)  is  of  the  order  n  —  ky  this  row  has  the  element  1  in  the  kth  column,  as  requir  ^ 
For  the  ( k  -  1  )th  row,  use  the  last  row  with  one  left  shift.  We  require  a  0  in  the  kth  co  umn 
the  ( k  -  l)th  row  to  form  4-  If  there  is  a  0  in  the  kth  column  of  this  ( k  —  l)th  row,  we  acc^  ^ 
it  as  a  valid  (k  —  l)th  row.  If  not,  then  we  add  the  fcth  row  to  the  ( k  -  1  )th  row  to  o  ta^n  ^ 
its  Ath  column.  The  resulting  row  is  the  final  ( k  —  l)th  row.  This  row  with  a  single  e  t  s  ^ 
serves  as  the  (k  —  2)th  row.  But  if  this  newly  formed  (k  -  2)th  row  does  not  have  a  »  ^ 
^th  column,  we  add  the  fcth  (last)  row  to  it  to  get  the  desired  0.  We  continue  this  wa^un^^e 
k  rows  have  been  formed.  This  gives  the  generator  matrix  for  a  systematic  (n,  k)  eye  ic  c 
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(a)  For  a  single-error  correcting  (7,  4)  systematic  cyclic  code  with  a  generator  polynomial 

^  +  jf2  +  find  G  and  construct  the  code. 

(b)  Verify  that  this  code  is  identical  to  that  in  Table  15.5  (Example  15.4). 

,  •  ,  _  ,3  4.  r  +  I  to  find  the  generator  matrix  O'  for  a 

15.3- 8  (a)  Use  the  generator  polynomial  g(jc)  -  + 

nonsystematic  (7, 4)  cyclic  code  . 

(b)  Find  the  code  generated  by  this  matrix  G' . 

(c)  Determine  the  error  correcting  capabilities  of  this  code. 

•  i  ,  x  _  r3  ,  r  +  1  (See  Prob.  15.3-8)  to  find  the  generator  matrix 

15.3- 9  Use  the  generator  polynomial  g(x) 

G  for  a  systematic  (7, 4)  cyclic  code. 

15.3- 10  Discuss  the  error  correcting  capabHiti^^o^^^*Oerleaved  (Xn.Xk)  cyclic  code  with  X 

and  using  a  three-error  correcting  (31, 16)  BCH 

15.3- 11  The  generator  polynomial 


«(•*) 


=  x10+/+JC5+Jt4+x2+X+l 


15.4-1 


generates  a  cyclic  BCH  ( 15, 5)  code. 

■ — 

(c)  Show  how  many  errors  this  code  can  correct. 

.  pci/  over  an  AWGN  channel  with  Eh/N  =  9.  This  data 
Uncoded  data  is  transmitted  by  us'"f?  ^  , 2)  Golay  code  (Prob.  1 5. 1  - 1 )  and  transmitted 

is  no.  coded  using  a  ^  “^,™^nd  1  ft.’—  — 

over  the  same  channel  at  an()  Pec  for  the  coded  and  the  uncoded 

(a)  Determine  the  corrected  error  probability  />eu 

systems.  ..  ;litv  p  r  computed  in  part  (a),  using  the  uncoded 

0,1  ,he  req”i"d  °'  ^  . 

154.2  Thesimplecodefordeteciingbursle^o^’i^;  ftdlgils  in  lengths 

nde  with  a  Slight  modification.  The  A  data  digit  so  that  each  segment  now  has 

L  .  p.  1  5  4  To  each  group  we  add  one  p  >  narlty  check  digit  is  chosen  to  ensure 

m  *.«*  -f  ZTl  *.  las.  —  -»  + 1  ^  ch“k  “ 

digit,  as  our  new  data  and  J*  ^  15  4wi,|be  transmitted  thus: 

was  done  in  Fig.  l^- 


10111 


01010 


noil  ioooi  nooo  oiiu 


.  tthis(30  20)  code  is  capable  of  single  error  correction  as  well  as  the  detection  of  a 
Show  that  this  (W-  ~u' 

Single  burst  of  length  bits  are  01  «•  »'  •>  111  11 


dtgits  11010100 
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ERROR  CORRECTING  CODES 


Figure 

P.15.5-2 


Figure 

P.15.5-3 


(b)  Use  Viterbi's  algorithm  to  decode  the  following  received  sequences: 


(i) 

100 

110 

111 

101 

001 

101 

001 

010 

(ii) 

010 

110 

111 

101 

101 

101 

001 

010 

(iii) 

111 

110 

111 

111 

001 

101 

001 

101 

15.5-3  A  systematic  recursive  convolution  encoder  (Fig.  P15.5-3)  generates  a  rate  1/2  code.  Unlike 
earlier  examples,  this  encoder  is  recursive  with  feedback  branches.  It  turns  out  that  we  can 
still  use  a  simple  trellis  and  state  transition  diagram  to  represent  this  encoder.  The  maximum 
likelihood  Viterbi  decoder  also  applies.  Denote  the  state  value  as  (<4_i,  <4-2)* 

(a)  Illustrate  the  state  transition  diagram  of  this  encoder. 

(b)  Find  the  corresponding  trellis  diagram. 

(c)  For  an  input  data  sequence  of  0100110100.  determine  the  corresponding  codeword. 


15.5- 4  For  the  systematic  recursive  convolution  code  of  Fig.  PI 5.5-3.  apply  the  Viterbi  algorithm  to 

decode  the  received  sequence  10  01  01  00  10  11. 

15.5- 5  The  Wi-Fi  standard  (IEEE  802.1  la)  uses  a  rate  1/2  convolutional  code  with  encoder  as  show 

in  Fig.  PI 5.5-5.  Find  the  sketch  the  trellis  diagram  for  this  convolutional  code. 
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Figure 

P.15.5-5 


Input  data 


15.6- 1  A  block  code  has  parity  check  matrix 

„_r 1  °  1  0  1 1 

"  “  0  1  0  1  I  j 

(a)  Find  the  code-generating  matrix  of  this  code. 

(b)  Find  the  minimum  distance. 

(c)  Find  the  trellis  diagram. 

15.6- 2  For  the  block  code  in  Prob.  15.2-9, 

(a)  Find  the  code-generating  matrix. 

(b)  Find  the  minimum  distance. 

(c)  Find  the  trellis  diagram. 

15.10-1  (a)  Illustrate  the  implementation  ot  a  rate 

=  i+o2+d3 

(b)  Find  and  sketch  the  trellis  diagram  of  this  RSC  encoder. 

,5.11.1  Find  and  illustrate  the  bipartite  graph  of  a  code  with  parity  check  matrix 

1  0  1  0  1  "I 
0  10  1  1  J 


1/3  turbo  encoder  in  which  the  RSC  encoder  has 
and  82(D)  =  1  +  D  +  D* 


-[ 


1M1J  pi,*)  ill-siraie  .h.  bipartite  graph  -f.be  (6. 3,  biocK  code  teiih  gerrerai.r  nr.iria 


G  = 


0  1  1  *  0  1 

|  1  1  0  1  o 

1  1  0  0  0  1 


1 


APPENDIX  A 


ORTHOGONALITY  OF  SOME  SIGNAL  SETS 


A.l  Orthogonality  of  the  Trigonometric  and  Exponential  Signal  Set 

Consider  an  integral  /  defined  by 


where  fr  stands  for  integration  over  any  contiguous  interval  of  7o  =  2;r/a>o  seconds.  By 
using  a  trigonometric  identity  (Appendix  E),  Eq.  (A. la)  can  be  expressed  as 


(A.  lb) 


Since  cos  coot  executes  one  complete  cycle  during  any  interval  of  7o  seconds,  cos  ( n  +  m)mt 
executes  ( n+m )  complete  cycles  during  any  interval  of  duration  To .  Therefore,  the  first  integral 
in  Eq.  (A.  lb),  which  represents  the  area  under  ( n  +  m)  complete  cycles  of  a  sinusoid,  equals 
zero.  The  same  argument  shows  that  the  second  integral  in  Eq.  (A.  1  b)  is  also  zero,  except  when 
n  =  m.  Hence,  /  in  Eq.  (A. lb)  is  zero  for  all  n  ^  t n.  When  n  =  m,  the  first  integral  in  Eq. 
(A. lb)  is  still  zero,  but  the  second  integral  yields 


Thus, 


(A. 2a) 


We  can  use  similar  arguments  to  show  that 


(A.2b) 


and 


sin  moot  cos  mcoot  dt  =  0  all  n  and  m 


(A.2c) 
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A.2  Orthogonality  of  the  Exponential  Signal  Set 

The  set  of-2pc]In  Appendices  running  head,  we  have  followed  the  style  of  sample  book.  I  rust 
this  is  okay,  exponentials  ejn('K]I  (n  =  0,  ±  1 ,  ±2,  . . .)  is  orthogonal  over  any  interval  of  duration 
7o,  that  is, 


f  gimmi $  =  /  e,(m_n)"0'  dt 
JTo  JTo 


0  m  ^  n 

T{)  m  =  n 


(A.3) 


Let  the  integral  on  the  left-hand  side  ot  Eq.  (A.3)  be  /,  where 

I—  f  eima* >'(e'nt00')*  dt 

JTo 


=  /  S 

JTo 


gji m-n)wot 


(A.4) 


The  case  of  m  =  n  is  trivial:  the  integrand  is  unity,  and  /  =  T0.  When  m  £  n,  however. 


/  = 


j{m-n)a)Qt 


ei(m 


\t\+To 
1 1\ 


The  last 


j(m  —  n)a)o 

_ 1  gHm—n)(DQii  [J(m-n)woT0  _]]  =  () 

j(m  —  ri)(o o 

result  follows  from  .he  fac,  that  mTo  =  2rr.  and  =  I  for  all  imegral  values  oft. 


APPENDIX  B 


CAUCHY-SCHWARZ  INEQUALITY 

Prove  the  following  Cauchy-Schwarz  inequality  for  a  pair  of  real  finite  energy  signals  /(f) 
and  g(t): 


cb 

2 

Cb  i 

rb 

/  f(t)g(t)dt 

J  a 

< 

/  f\t)dt 
Ja 

- 1 

:n 

-  ^ 

- 1 

with  equality  only  if  g(t)  =  cf(t),  where  c  is  an  arbitrary  constant. 

The  Cauchy-Schwarz  inequality  for  finite  energy,  complex-valued  functions  X  ( co )  and 
Y (co)  is  given  by 


/oo  roo 

\X(a))\2d(o  /  \Y(0))\2d(0 

-OO  J  —  OO 


with  equality  only  if  Y(co)  =  cX*(co ),  where  c  is  an  arbitrary  constant. 

We  can  prove  Eq.  (B.l)  as  follows:  for  any  real  value  of  X,  we  know  that 
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with  equality  only  if  |y<a>)|  =  c|X(«)|,  where  c  is  an  arbitrary  constant.  Now  recall  that 


f 


X(co)Y((o)d(o\ 


<-f 


\X(co)\\Y((o)\dco 


-f 


\X(u))Y(u>)\dco 


(B.6) 


with  equality  if  and  only  if  Y(w)  =  cX*( to),  where  c  is  an  arbitrary  constant.  Equation  (B.2) 
immediately  follows  from  Eqs.  (B.5)  and  (B.6). 


APPENDIX  C 


GRAM-SCHMIDT  ORTHOGONALIZATION 
OF  A  VECTOR  SET 

We  have  defined  the  dimensionality  of  a  vector  space  as  the  maximum  number  of  independent 
vectors  in  the  space.  Thus  in  an  /V -dimensional  space,  there  can  be  no  more  than  N  vectors 
that  are  independent.  Alternatively,  it  is  always  possible  to  find  a  set  of  N  vectors  that  are 
independent.  Once  such  a  set  has  been  chosen,  any  vector  in  this  space  can  be  expressed  in 
terms  of  (as  a  linear  combination  of)  the  vectors  in  this  set.  This  set  forms  what  we  commonly 
refer  to  as  a  basis  set,  which  forms  the  coordinate  system.  This  set  of  N  independent  vectors 
is  by  no  means  unique.  The  reader  is  familiar  with  this  propery  in  the  physical  space  of 
three  dimensions,  where  one  can  find  an  infinite  number  of  independent  sets  of  three  vectors. 
This  is  clear  from  the  fact  that  we  have  an  infinite  number  of  possible  coordinate  systems. 
An  orthogonal  set,  however,  is  of  special  interest  because  it  is  easier  to  deal  with  than  a 
nonorthogonal  set.  If  we  are  given  a  set  of  N  independent  vectors,  it  is  possible  to  obtain 
from  this  set  another  set  of  N  independent  vectors  that  is  orthogonal.  This  is  done  by  the 
Gram-Schmidt  process  of  orthogonalization. 

In  the  following  derivation,  we  use  the  result  [derived  in  Eq.  (2.27)]  that  the  projection 
(or  component)  of  a  vector  \i  upon  another  vector \\  (  see  Fig.  C.  1)  is  C12X1 ,  where 


c  12  = 


llxill2 


(C.l) 


The  error  in  this  approximation  is  the  vector  X2  —  C12X1 ,  that  is. 


<  X|,  X2  > 

error  vector  =  X2 - ~ — xi 

llxill2 


(C.2) 


The  error  vector,  shown  dashed  in  Fig.  C.  1  is  orthogonal  to  vector  xi . 

To  get  physical  insight  into  this  procedure,  we  shall  consider  a  simple  case  ot  two 
dimensional  space.  Let  xi  and  X2  be  two  independent  vectors  in  a  two-dimensional  space 
(Fig.  C.  1).  We  wish  to  generate  a  new  set  of  two  orthogonal  vectors  y  1  and  y2  from  xi  and  x:- 
For  convenience,  we  shall  choose 


yi  =  xi 


(C.3) 


We  now  find  another  vector  y2  that  is  orthogonal  to  y  1  (and  xi .  Figure  C.  1  shows  that  the  error 
vector  in  approximation  of  X2  by  yi  (dashed  lines)  is  orthogonal  to  y  1 ,  and  can  be  taken  as  >2* 

982 


Appendix  C:  Gram-Schmidt  Orthogonalization  of  a  Vector  Set  983 


Figure  C.l 

Gram-Schmidt 
process  for  a 
two-dimensional 
case. 


Hence, 


<  Xi,  X2  > 

yi  =  *2 - - - ^ - xl 


=  X2  - 


llxill2 
<  yi-  x2  > 

llyill2 


yi 


(C.4) 


Equations  (C.3)  and  (C.4)  yield  the  desired  orthogonal  set.  Note  that  this  set  is  not  unique^ 
There  is  an  infinite  number  of  possible  orthogonal  vector  sets  (y , .  y2)  that  can  be  generated 
from  (x, .  x2).  In  our  derivation,  we  could  as  well  have  started  with  y  =  x2  instead  of  y ,  =  x, . 
This  starting  point  would  have  yielded  an  entirely  different  set. 

The  reader  can  extend  these  results  .0  a  rhree-dinrensrona,  case. 

an  independent  error  in  this  approximation 

musl'beorthogona^  10  both  „  and  ,2  and,  hence,  can  be  laken  as  rhe  third  orthogonal  vecror 
y3.  Hence, 


y3  =  x3  -  sum  of  projections  of  x3  on  yi  and  y2 


=  x3  - 


<y,,x3>  <  y2.  x3  > 


llyil 


-yi  - 


Ily2ll2 


y2 


(C.5) 


i»  <,n  extended  to  an  N -dimensional  space.  In  general,  given  N  independent 
These  results  can  be  ette  staitar  lines,  we  can  oblain  an  orthogonal  sel 

vectors  xi ,  X2,  •  ‘  •  *  x^’  11  r 

yi,y2. ....  y  n.  where 


<r  ftt 


%i.  —  Vi 


and 


<  yk 

y i  =  *j~L 

k= I 


X;  > 


llyill2 


-y  k 


j  =  2,3 . N 


(C.l) 


Note  that  this  is  one  of  The orrhonormal  set y,.  h . 

U-gthsoftherespecdvetecto. 


y  k  = 


n 

llytll 


•  ml  cnace  because  one-to-one  correspondence  exists  between 
We  can  apply  Impendent  signals  *,  (,).  *,«> . «(,).we  can  fomn  a 

signals  and  vectors.  If  we  have /v 
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set  of  N  orthogonal  signals  yi(0,  >’2(0.  •  •  • .  }'n( 0  as 
yi(0=40 


yj( 0  =  xj(t)  -  ^  ckjyk(t )  j  =  2,3 - ,N 


(C.8) 


k= 1 


where 


f)kU)Xj(t)  dt 


(C.9) 


Note  that  this  is  one  of  the  infinitely  many  possible  orthogonal  sets  that  can  be  formed  from 
the  set  Jt|  (/),  *2(0,  •  •  • ,  *aK0-  The  set  can  be  normalized  by  dividing  each  signal  yj(t)  by  its 
energy. 


Example  C.  1  The  exponential  signals 


gi(0  =  e  p,u(t) 
£2(0  =  e~2p,u(t) 


gN(t)  =  e  Np,u(t) 

form  an  independent  set  of  signals  in  A' -dimensional  space,  where  N  may  be  any  integer.  This 
set,  however,  is  not  orthogonal.  We  can  use  the  Gram-Schmidt  process  to  obtain  an  orthogonal 
set  for  this  space.  If  yi  (0,  y2(0,  •  •  • ,  w(0  is  the  desired  orthogonal  basis  set,  we  choose 

y\(t)=g\{t)  =  e~p'u(t) 

From  Eqs.  (C.8)  and  (C.9)  we  have 

y2(0  =*2(0  -ci2yi(0 


where 


/!^yi  (0*2(0  dt 


j™e~p,e-2p'  dt 


2 

3 


Hence, 


X2(0  =  (e  2pl  -  \e  pl)u(t) 


(C.10) 


Similarly,  we  can  proceed  to  find  the  remaining  functions  y^t),  w(0,  and  so  on.  The 

reader  can  verify  that  all  this  represents  a  mutually  orthogonal  set. 


APPENDIX  D 


BASIC  MATRIX  PROPERTIES  AND  OPERATIONS 


D.l  Notations 

An  n  x  1  column  vector* 


consists  of  n  entries  and  is  formed  by 


X2 

x„ 


The  transpose  of  *  is  a  row  vector  represented  by 

*r  =  [  *1  xi  •  •  •  x„  ] 


The  conjugate  transpose  of*  is 


also  a  row  vector  written  as 


xH  =  (x*)T  —  [  *1  v2 


>4  ] 


xH  is  also 
An  m 


known  as  the  Hermitian  of*, 
x  n  matrix  consists  of  n  column  vectors 


A  =  [  a\  a~  a"  1 

au  ai,2  ••• 

02,1  °2,2  •••  fl2.n 


dm,  1  a»n,2 


dm,n  _ 


We  also  define  its  Transpose  and  Hermi.ian.  nespecTively.  as 


’  a\,i 

d\,2 

d2,\  ‘ 

d2,2  * 

•  *  dm,  1 

•  •  dm,2 

a"  = 

[4. 

ah 

4.  • 

•  4.1  ‘ 

•  •  4.2 

__  a\,n 

d2,n  * 

•  •  dm,n  _ 

.  4* 

4.n  • 

*  *  & m,n  - 

(D.la) 

(D.lb) 

(D.lc) 

(D.2a) 

(D.2b) 

(D.2c) 
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•  If  At  =  A ,  then  we  say  that  A  is  a  symmetric  matrix. 

•  If  Ah  =  A.  then  we  say  that  A  is  a  Hermitian  matrix. 

•  If  A  consists  of  only  real  entries,  then  it  is  both  Hermitian  and  symmetric. 

D.2  Matrix  Product  and  Properties 

For  an  m  x  n  matrix  A  and  an  n  x  i  matrix  B  with 


*1.1 

*1.2  - 

b\x 

b 2.1 

*2.2 

•  •  *2.r 

bn,  1 

bn,2  • 

•  •  bnx 

the  matrix  product  C  =  A  •  B  has  dimension  m  x  l  and  equals 


— 

.  — 

C  1,2  * 

C2,2  * 

••  a\,l 
■  ■  02,1 

where 

n 

ciJ  =  ^Oijkbkj 

(D.4) 

cm,  1 

cmy2 

••  am,t 

k= 1 

In  general  AB  ^  BA.  In  fact,  the  products  may  not  even  be  well  defined.  To  be  able  to  multiply 
A  and  B.  the  number  of  columns  of  A  must  equal  the  number  of  rows  of  B. 

In  particular,  the  product  of  a  row  vector  and  a  column  vector  is 


A  =  Erf* 

Jt=i 


=  <  x,y  > 


(D.5a) 

(D.5b) 


Therefore,  xHx  =  ||jc  ||2. 

Two  vectors  x  and  y  are  orthogonal  if  yHx  =  xHy  =  0. 

There  are  several  commonly  used  properties  of  matrix  products: 


A(B  +  C) 

=  AB+AC 

A(BC ) 

=  (AB)C 

(AB)* 

=  A*B* 

(AB)T 

=  BtAt 

(AB)h 

=  bhah 

(D.6a) 

(D.6b) 

(D.6c) 

(D.6d) 

(D.6e) 
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mmetric. 


(D.3) 


=  ^  aixbkj  (D-4) 

*=l 


D.3  Identity  and  Diagonal  Matrices 

Ann  xn  square  matrix  is  diagonal  if  all  its  off-diagonal  entries  are  zero,  that  is. 


D  =  diag (d\,  di . d„) 

‘  d\  0  0  •  •  •  0 

0  di  0  •  •  •  0 

0  0  dn- 1  0 

0  0  •••  0  d„ 

An  identity  matrix  /„  has  unit  diagonal  entries 

f  1 
0 


0 


0  •••  o  “ 

1  ...  0 


(D.7a) 


(D.7b) 


(D.8) 


For  an  nxn  square  matrix  A,  if  there  exists  an  xn  square  matrix  B  such  that 

BA  =  AB  =  ln 


ined.  To  be  able  to  multiply 
>ws  of  B. 
is 


(D.5a) 

(D.5b) 


(D.6a) 

(D.6b) 

(D.6c) 

(D.6d) 

(D-6e) 


then 


B  =  A 


-l 


(D.9) 


is  the  inverse  matrix  of  A.  For  example,  given  a  diagonal  matrix 

D  =  diag(</i,  d2,  •  •  d„) 

D'l=diag(^  s . s) 

n  4  Determinant  of  Square  Matrices 

determinant  of  .  x  .  “  -'O'”11  recurslW'y  by 

det  (A)  =  ^  ai ij(~  I )  jMiJ 

i=l 

,  •  (  -Ux(n-D  matrix  known  as  the  minor  of  A  by  eliminating  its  ith  row 

ZjthcolT™.  Specifically,  for  a  2  x  2  matrix. 


(D.IO) 


det 


[:  i]— 


be 
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Based  on  the  definition  of  determinant,  for  a  scalar  or, 

det(aA)  =  (/'det  (A) 
det  t  )  =  det  (A ) 

For  an  identity  matrix 

det  (/ )  =  1 

Also,  for  two  square  matrices  A  and  B, 

det  (AB)  =  det  (A)det(B) 

Therefore, 

det  ^AA-1^  =  det  (A)  det  (a-1^  =  1 

For  an  m  x  n  matrix  A  and  an  n  x  m  matrix  B,  we  have 

det(/m  +  AB)  =  det  (/„  +  BA) 

D.5  Trace 

The  trace  of  square  matrix  A  is  the  sum  of  its  diagonal  entries 

n 

Tr(A)  =  ajj 

i=l 

For  an  m  x  n  matrix  A  and  an  n  x  m  matrix  B ,  we  have 

Tv(AB)  =Tv(BA) 

D.6  Eigendecomposition 

If  the  n  x  n  square  matrix  A  is  Hermitian,  then  the  equation 

Au  =  Xu 

specifies  an  eigenvalue  X  and  the  associated  eigenvector  u. 

WhenA  is  Hermitian,  its  eigenvalues  are  real-valued.  Furthermore,  A  can  be 

into 

A  =  UAUh 

in  which  the  matrix 

U  =  [ill  «2  •••  K„] 

consists  of  orthogonal  eigenvectors  such  that 

UUH  =In 


Matrices  satisfying  this  property  are  called  unitary. 
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Furthermore,  the  diagonal  matrix 


(D.lla) 

A  =  diag(X|  X2  •  •  •  K  ) 

(D.19) 

(D.llb) 

consists  of  the  corresponding  eigenvalues  of  A. 

Because 

(D.llc) 

UHU  =  UUH  =1„ 

we  can  also  write 

(D.20) 

(D.  lid) 

UhAU  =  A 

The  eigenvalues  of  A  are  very  useful  characteristics.  In  particular. 

(D.21) 

(D.lle) 

n 

det  (A)  =  Y\  h 

i=  1 

(D.22a) 

(D.12) 

n 

Trace  (A)  = 

1=1 

(D.22b) 

(D.13) 

D  7  Special  Hermitian  Square  Matrices 

Let  an  n  x  n  matrix  A  be  Hermitian.  A  is  positive  definite  if  for  any  n  x 

have 

xhAx  >  0 

1  vector  x  ^  0,  we 

(D.23) 

(D.14) 

A  is  semipositive  definite  if  for  any  n  x  1  vectors,  we  have 

xhAx  >  0 

(D.24) 

(D.15) 

4  is  negative  definite  if  for  any  .  x  1  vector*  #  0.  we  have 

xhAx  <  0 

4  is  positive  definite  if  and  only  if  all  its  eigenvalues  are  positive. 

(D.25) 

be  decomposed 
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MISCELLANEOUS 


E.l  L’Hopital’s  Rule 

If  lim  fix) /gix)  results  in  the  indeterministic  form  0/0  or  oc/oo,  then 


fix)  fix) 

lim  - =  lim 


gix) 


gix) 


E.2  Taylor  and  Maclaurin  Series 


fix)  =fia)  +  ~—~~~f ia)  +  <A  / ia)  + 


1! 


2! 


fix)  =/(0)  +  f/(0)  +  X-fi  0)  + 


E.3  Power  Series 


,  ,  X2  X3  X” 

^=1+I+2!  +  3!+-+^!  + 

X3  X5  X 7 

sinx  =  x - 1 - |-  •  •  • 

3!  5!  7! 

X2  X4  X^  X® 

cosx  =1 - 1 - 1 - 

2!  4!  6!  8! 

x3  2x5  1 7x7 

tan  x  =  x  +  —  H - 1 - 1 - 

3  15  315 


n 


x  < 


Qix)  = 


e-*2/2  /  1 

xV2n  \  x2 


1-3  1-3-5 


x4 


+ 


1  -x 


2! 

1  +  nx  \x\  «;  1 
=  1  +  x  +  x2  +  x3  H - 


\x\  <  1 


+ 


C) 


xk  + 
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E.4  Sums 


*  Jt+1  _  1 

Yr  = - —  r±  1 

r  _  1 

m= 0 


N  JV+1  JM 

m=M 

"  V  /?  /  b^{a  —  b) 

m=0 


r*  1 

a  ^  b 


E.5  Complex  Numbers 


_±>/2  _ 


=  ±/ 


= 


1  n  even 
—  1  n  odd 


=  cos  6  ±  j  sin  0 

a+jb  =  re>e  r  =  Ja2  +  b\  e  =  Uui~'  (^) 

(re>e)k  =  rVw 
(r,^')(r2^)  = 


E.6  Trigonometric  Identities 


=  cos  x  ±  j  sin  x 


COS  Jt 


2 ) 

sin  jc 


cos(jt±^)  =  Tsin 

sin  (x  ±  ^)  =  icosx 
2  sin*  cos*  =  sinlt 
sin" .?  +  cos  •*  =  ^ 
cos2  x  —  sin2  x  =  cos  lx 


cos2*  =  -O  +  coslr) 
sin2  jc  =  -0  -  cos  20 
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COS3  X  =  -  (3  cos  X  +  cos  3x) 
4 

sin3  jc  =  -  (3  sin  x  —  sin  3*) 
4 


sin  (x  ±  y) 
cos  (jc  ±  y) 

tan(jc  ±y) 


sin  jc  cos  y  ±  cos  jc  sin  y 
cos  jc  cos  v  =F  sin  jc  sin  v 
tan  .c  ±  tan  y 

1  ip  tan  x  tan  y 


sinjcsiny  =  -[cos(*  —  y)  —  cos  (a:  +  y)] 
cos x cosy  =  ^[cos  (*  “  y)  +  cos  (*  +  >’)] 
sin  jc  cos  y  =  -  [sin  (jc  —  y)  +  sin  (x  +  y)] 


a  cos  jc  +  b  sin  a*  =  C  cos  (jc  +  9) 


in 


which  C  =  Va2  +  b2 


and 


6  =  tan  1 


E.7  Indefinite  Integrals 


/ 


J  udv  =  uv  —  J  v 


du 


f(x)g(x)dx=f(x)g(x)~  \  f{x)g(x)dx 


1 


-P' 


sin  axdx  =  —  cos  ax 
a 


I* 

/.  2  ,  x  sin2ojc 

sin  axdx  = - 

2 


4  a 


sin  axdx  =  -x (sin  ax  —  ax  cos  ax) 
a 1 

xcosaxdx  =  — r- (cos ax  -j-  ax  sin  ax) 


f 


f 


cos  axdx  =  -  sin  ax 
a 

x  sin  2  ax 


cos  ax  dx  =  -  + 


4a 


f  t  si 

f 

J  x2  sin  ax  dx  =  —  (lax  sin  ax  +  2  cos  ax  —  a2 x2  cos  ax) 

J  x2  cos  axdx  =  — j (2ar  cos  a*  —  2  sin  ax  +  a2 x2  sin  ax) 

/  sin  (a  -  b) x  sin  (a  +  b) x  7  ,  , , 

sm  ax  sin  bx  dx  =  — — - — - ^ - — —  a2  ^  b 

/ 


2  {a  —  b)  2(a  +  b) 

sin  ax  cos  bx  dx  =  —  ^ 


cos  (a  -  b)x  cos  ( a  +  b)x 


2  (a  —  b )  2  (a  +  b) 


a2^b2 


sin  ( a  -  b)x  sin  ( a  +  b)x 

cos  ax  cos  bx  dx  =  — - : —  + 


2 (a  -  b)  2 (a  +  b) 
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a2*b2 


I 

1 


eaxdx=-eax 

a 

PCLX 

o<*X  Ar  - 


xe™  dx  =  -^-(ax  -  1) 
aL 

/pCLX 

x2eax  dx  =  —  ( a2x 2  —  2  ax  +  2) 


/ 

/ 


e0*  sin  bxdx  = 
eax  cos  bxdx  = 


a2  +  b2 
e™ 


(a  sinbx  -  bcosbx) 


a2  +  b 2 


-  (a  cos  bx  +  b  sinbx) 


I 

J 


1  .  1  _1 

- - -  dx  =  -  tan  - 

x 2  +  a2  ci  a 

■?  -  -Z  dx  =  -  In  (x2  +  a‘ ) 
x 2  +  a2  2 
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Adaptive  delta  modulation  (ADM),  351 
Adaptive  differential  pulse  code  modulation 
(ADPCM),  345-346 
Additive  white  Gaussian  noise  (AWGN), 


10, 635 

Aliasing  error  (spectral  folding),  79,  159, 
310-312 

All-pass  vs.  distortionless  system,  126-127 
Alternate  mark  inversion  (AMI),  378, 
390-392 

Amplitude-modulated  systems,  565-574 
large-noise  case,  571-572 

optimum  preemphasis-deemphasis  in, 

573-574 

small-noise  case,  570-57 1 
Amplitude  modulation  (AM),  11,  114-115, 
178,  179-180,  190-197,  876-877 
bandwidth-efficient.  197-206 
demodulation  of,  195—197 
double-sideband,  180-190 
generation  of,  195 
pulse  (PAM),  318 

single-sideband  (SSB),  198-199 

vestigial  sideband  (VSB). 206-210  , 

Amplitude  shift  keying  (ASK),  424, 680-683 
and  AM  modulation,  connection 
between,  427 
binary  (BASK),  620 
detection,  428 
Amplitude  spectrum,  56,  59 
Analog  signals,  4, 26-27 

Analog-to-digital  (A/D)  conversion, 

maximum  information  rate,  313-314 
nonideal  practical  sampling  analysis, 
314-318 

signal  reconstruction,  304-309 

aliasing  error  (spectral  folding),  310-312 


antialiasing  filter,  312 
filters,  realizability  of,  309-310 
ideal,  305-306 
practical,  306-309 

Angle  modulation,  179-180,  252,  254,  574 
bandwidth  of,  259-272 
features  of,  279-28 1 
frequency,  577-580 
immunity  of,  284-285 
and  mean  square  bandwidth,  581-584 
narrowband,  260-261, 580-581 
optimum  preemphasis-deemphasis  in, 

589-591 
phase,  575-577 
power  of,  256-259 
threshold  in,  585-589 
Antheil,  George,  723 
Antialiasing  filter,  312 

vs.  matched  filter,  771-774 
Antipodal  signals,  45, 673 
Aperiodic  signals,  27 

representation  by  Fourier  integral, 

91-99 

amplitudes  and  phases,  balance  of,  99 
conjugate  symmetry  property,  95 
Fourier  transform,  existence  of,  96-97 
Fourier  transform,  linearity  of,  97 
Fourier  transform,  physical  appreciation 
of,  97-99 

A  posteriori  probability,  640,  851 
Arbitrarily  small  error  probability,  907 
Armstrong,  Edwin  H.,  270-272 
indirect  methods  of,  275-277 
Asymmetric  digital  subscriber  line  (ADSL), 
808-809 

Asynchronous  channels,  338—339 
Autocorrelation  function,  46-47,  145-148 
of  power  signals,  150-154 

995 


Automatic  gain  control  (AGC),  139 
Automatic  repeat  request  (ARQ),  910 

Balanced  circuits,  1 85 
Balanced  discriminator,  283 
Balanced  modulators,  185 
double,  185 
single,  185 

Band-limited  white  Gaussian  noise.  863 
Bandpass  limiter,  273-274 
Bandpass  matched  filter,  as  coherent 
receiver,  620-62 1 
Bandpass  signals,  117-119 
Bandwidth,  9,  102-106 
of  angle  modulated  waves,  259-272 
essential,  141-144,  159,  386 
of  product  of  signals,  1 20 
Bandwidth-efficient  amplitude  modulations, 
197-206 

Baseband  communications,  178-179 
Baseband  signal,  2,  11,  178 
Basis  functions,  50.  629-630 
Basis  vectors,  48,  625 

Bayes’  decision  rule.  453,  478,  64 1 , 677,  85 1 

Bayes  receiver,  677-678 

Bayes’  theorem,  460-461 

BCJR  MAP  decoding  algorithm,  951-955 

Bernoulli  trials,  454-458 

Best  filter.  608 

Bezout  identity,  787-788 

Binary 

amplitude  shift  keying  (BASK),  620 
with  eight-zero  substitution  (B8ZS),  394 
message,  4 

with  N  zero  substitution  (BNZS) 
signaling,  394 

phase  shift  keying  (BPSK),  619-620 
polar  signaling,  optimum  linear  detector 
for,  605-61 1 
signaling,  611-619 

with  six-zero  substitution  (B6ZS),  394 
symmetric  channel  (BSC),  847,  914 
systems,  615-619 

with  three-zero  substitution  (B3ZS),  394 
threshold  detection,  606-607 
Bipartite  (Tanner)  graph,  961-962 
Bipolar  (pseudotemary)  signaling,  378, 
390-392 

Bit  (binary  digit),  320 


Bit  error  rate  (BER),  605,  614,  652,  928 
of  orthogonal  signaling.  665-666 
Bit-flipping  LDPC  decoding.  962 
Bit  loading.  806-807 
Bit  stuffing.  338-339 
Blind  equalization.  8 1 2-8 1 3 
Block  codes.  907 
linear.  911.913 
Block  interleaver,  944 
Bluetooth.  721-722 
Bose-Chaudhuri-Hocquenghen  (BCH) 
codes,  927 

Bounded-input-bounded-output  (BIBO) 
linear  system.  1 23 
Broad  OFDM  applications,  810 
Burst  error  correction  codes,  4 1 7,  93 1 
Butterworth  filters,  131-132 

Carrier,  11,  114 
Carrier  communications,  179 
OFDM.  793-803 
Carrier  power,  194-195 
Carson's  rule,  263 

Cauchy-Schwartz  inequality,  42n.  608,  980 
Causal  signal,  34 

CCITT  (Comity  Consultatif  International 

Telephonique  et  Telegraphique),  212 
cdmaOne  (IS-95),  737,  744,  745-746 
Cellular  networks,  743-744 
Cellular  systems,  CDMA  in,  744-745 
Central  limit  theorem 

for  sample  mean.  500-502 
for  sum  of  independent  random 
variables,  502 
Channel,  3 
Channel  bank,  339 
Channel  capacity,  10,  847,  853,  855, 

856,  866 

of  band-limited  AWGN  channel,  866-869 
of  continuous  memoryless  channel,  858 
of  discrete  memoryless  channel,  850 
of  infinite  bandwidth.  869 
Channel  diversity,  816 
Channel  equalization.  770 
receiver,  771-777 
Channel  estimation,  789-790 
Channel  matrix,  851 
Channel  shortening,  807 
Chase  algorithms,  947-948 
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Chebyshev’s  inequality,  489-490 
Chrominance  signals,  206,  226 
multiplexing,  226-228 
Cochannel  interference,  206 
Code-division  multiple  access  (CDMA),  723 
in  cellular  phone  networks,  743-747 
of  DSSS,  730-737,  743-749 
in  GPS,  747-749 
power  control  in,  736-737 
Code  efficiency,  844 
Code  generator  polynomial,  919 
Code  rate,  907,  908 
Code  tree,  933 

Coherent  AM  demodulation,  190,  569-570 
Coherent  receivers,  for  digital  carrier 
modulations,  619-624 
Color  burst,  206 

Common  channel  interoffice  signaling 
(CCIS),  335 


Continuous  time  signal,  25-26 
Convolution 
codes,  907,  932 
frequency,  119 
time,  1 19 

Convolution  theorem,  119-121 
Coprime,  788 

Correlation  coefficient,  41-47,  490-493 
Correlation  functions,  46-47 
Correlation  receiver,  61 1 
Costas  loop,  2 1 9-220 
Cross-correlation  coefficients,  46,  660 
Cumulative  distribution  function  (CDF), 
466-467 

Cutoff  frequency,  1 32 

Cyclic  codes,  918-927 

Cyclic  linear  block  code  theorem,  919 

Cyclic  prefix,  795,  807 

Cyclic  Redundancy  Check  (CRC)  codes,  927 


Communication 
baseband,  178-179 
carrier,  179 
systems,  1-4 
Compact  code,  843 
Compact  disc,  321 

Compact  trigonometric  Fourier  series, 
52-55 


Compandor,  326,  327-328 
Compatible  color  television  (CCTV), 


225-230 

luminance  and  chrominance  signals, 
multiplexing,  226-228 
receiver,  228—229,  230 
signals,  demultiplexing,  229 
Complement,  448 
Complete  orthogonal  basis,  48,  49 
Complete  orthogonal  set,  49,  626 
Concatenated  codes,  945-946 
Conditional  densities,  478 


Conditional  entropy,  85 1 
Conditional  probability,  452-4. 
multiplication  rule  for,  458 
of  random  variables,  464-466 
Conference  on  European  Postal  and 
Telegraph  Administration 

(CEPT),  335 

Comm™u'ptoe  frequency  shift  keying 
Continuous^ndunf  variables.  463, 467-470 


D-dimensional  sphere,  871-874 
Decision  feedback  equalizer  (DFE), 
790-793 

error  propagation  in,  792-793 
Decision  feedback  MUD  receiver,  742—743 
Decision  regions  and  error  probability, 
644-650 

Decorrelator  MUD  receiver,  740 
Deductive  logic,  461 
Delta  modulation  (DM),  179,  346-351 
adaptive  (ADM).  351 
comparison  with  PCM,  347-348 
overloading  in,  348-350 
sigma-delta  modulation,  350—35 1 
threshold  of  coding,  348-350 
Demodulation,  11,  13,  178,  182-184.  252 
of  AM  signals,  195-197 
of  DSB-SC  signals,  189-190 
of  FM  signals,  281-284 
Destination,  3 

Detection  error  probability,  416-41 7 
Detection  signal  space  dimensionality, 
637-640 

Deterministic  signals,  28,  447 
Differential  encoding,  405^106, 430,  686 
Differential  GPS,  748 
Differential  phase  shift  keying  (DPSK), 
429-431 

error  probability,  686-688 
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Differential  pulse  code  modulation  (DPCM), 

341-344 

analysis  of,  343-344 
SNR  improvement,  344 
Digital  audio  broadcasting  (DAB),  810-812 
Digital  broadcasting,  810 
Digital  carrier  systems 
analog  and  digital  carrier  modulations, 

connections  between,  427-428 
binary  carrier  modulations,  423-425 
demodulation,  428-43 1 
PSD  of,  425-427 
Digital  communication  systems, 

377-380,  767 
advantages  of,  321-322 
line  coder,  378-379 
multiplexer,  379 
performance  analysis  of,  605 
regenerative  repeater,  379-380 
source,  377 

Digital  data  system  (DDS),  340 
Digital  data  transmission,  principles  of,  377 
Digital  filters,  132-133 
Digital  multiplexing 
asynchronous  channels  and  bit  stuffing, 

338- 339 

plesiochronous  digital  hierarchy, 

339- 341 

signal  format,  336-338 
Digital  signals,  4-8,  26-27 
at  level  0  (DS0),  339 
at  level  1  (DS1),  333,  339 
distortionless  regenerative  repeaters  and 

nodes,  viability  of,  5-6 
noise  immunity  of,  4-5 
pulse  code  modulation,  7-8 
Digit  interleaving,  336 
Diode  bridge  modulator,  1 86 
Direct  FM  generation,  277-279 
Direct  sequence  spread  spectrum  (DSSS) 
against  broadband  jammers,  730 

CDMA  of,  730-737,  743-751 
against  narrowband  jammers,  729-730 
PN  sequence  generation,  725-726 
PSK,  optimum  detection  of,  724-725 
resilient  features  of,  728-730 
single-user,  726-728 
Dirichlet  conditions,  56-57 
strong,  57 
weak,  57 


Discrete  Fourier  transform  (DFT) 

FFT  algorithm  in.  1 60 
numerical  computation  of,  155-160 
points  of  discontinuity,  160 
Discrete  multitone  (DMT)  modulation, 
803-807 

real-life  applications  of,  808-812 
Discrete  random  variables,  462-464 
Discrete  time  signal,  25-26 
Dispersion,  134 
Distortion.  124-125,  133-138 
in  audio  and  video  signals,  127-129 
linear.  134-135 
due  to  multipaths,  137-138 
nonlinear.  135-137,  284-289 
Distortionless  transmission,  125-129 
Distributed  coordinator  function 
(DCF),  751 
Doppler  effect,  8 1 3 
Double  balanced  modulator,  1 85 
Double-sideband,  suppressed-carrier 

(DSB-SC)  modulation.  180,  181, 
565-566 

carrier  acquisition  in,  218-220 
nonlinear,  184-185 
signals,  demodulation  of,  189-190 
switching,  185-189 

Double-sideband  amplitude  modulation, 
180-190 

Downconversion,  189 
Duality 

property,  108-110 
time-frequency,  106-108 
Duobinary  signaling,  402—403 
detection  of,  404-406 
modified,  404 


Elastic  store,  338 
Element,  448 
Energy 

of  modulated  signals,  144-145 
signal,  21,  22,  28,  139-148 
scaler  product  and,  627-628 
of  sum  of  orthogonal  signals,  40-41 
Energy  spectral  density  (ESD),  140-141 
of  input  and  output,  148 
time  autocorrelation  function  and, 
145-148 
Entropy,  14,  839 
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Envelope  detection,  191-192 
AM  signal,  570 
condition  for,  192-194 
Envelope  detector,  196-197 
Equalizers 

decision  feedback  (DFE),  790-793 
feedforward  (FFW),  791 
finite  length  MMSE,  782-783 
fractionally  spaced  (FSE),  785, 
787-788,  789 
linear,  790-791 
time  domain  (TEQ),  807 
zero-forcing,  410—413,  778—780 
Equivalent  optimum  binary 
receivers,  615 

Equivalent  signal  sets,  668-676 
Error  correction  coding,  14-15 
Error-free  communication,  847-850, 
856-857,  870-871 

Error  probability  of  optimum  receivers, 
660-668 

Error  propagation,  in  DFE.  792-793 
Error  vector,  36 

Essential  bandwidth,  141-144,  159,  386 
Event,  448 

Exclusive  OR  (XOR),  910 
Experiment,  447—448 
Exponential  Fourier  series,  61-69 
Fourier  spectra,  63-64 
negative  frequency  mean.  64-68 
Parseval’s  theorem  in,  68-69 
Exponential  Fourier  spectra,  63-64 
Exponential  modulation,  254 
Extended  superframe  (ESF).  33  - 
Eye  diagrams,  417-420 
in  PAM,  423 


'ading  channels,  139 
flat,  815-816 

8.6 

-alse  alarm.  679 
:alse  dismissal,  679 

-araday,  Michael,  16  _8_8q 

'ast  Fourier  transform  ( FFT), 

algorithm  in  DFT  computations,  160 

Feedback  decoding,  942 

Feedforward  (FFW)  equalizers,  791 

Filtering,  164 


Filters 

antialiasing,  3 1 2 
bandpass  matched,  620-621 
best,  608 

Butterworth,  131-132 
digital,  132-133 
first-order-hold,  309 
ideal  vs.  practical,  129-133 
matched,  608-6 1 1 
optimum  receiver,  607-61 1 
reconstruction  filters,  realizability  of, 
309-310 
VSB,  208-209 

Finite  length  MMSE  equalizers,  782-783 

First-order-hold  filter,  309 

Flat  fading  channels,  815-816 

Folding  frequency,  159 

Forward  error  correcting  (FEC)  codes,  907 

Fourier  integral 

aperiodic  signal  representation  by,  91-99 
Fourier  series 

exponential,  61-69 
numerical  computation  of,  78—80 
ParsevaFs  theorem  in,  68-69 
trigonometric,  51-61 
Fourier  spectrum,  56-61 
Fourier  transform 
direct,  94 
discrete 

numerical  computation  of,  155-160 
duality  property,  108-110 
existence  of,  96-97 

frequency-shifting  property,  107,  114-119 
inverse,  94 
linearity  of,  97 

physical  appreciation  of,  97—99 
time  differentiation  property,  121-123 
time-frequency  duality,  106-108 
time  integration  property,  121-123 
time-scaling  property,  110-111 
time-shifting  property,  112-114 
Fractionally  spaced  equalizers  (FSE) 

MMSE  design,  789 
S1MO  model,  785-787 
ZF  design,  787-788 
Frame,  334 
Framing  bit,  334 

Frequency  compression  feedback  (H~b), 
588-589 

Frequency  converter  (mixer),  188-189 
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Frequency  convolution,  1 19 
Frequency  counters,  283 
Frequency  demodulators,  practical,  282-283 
Frequency  division  multiplexing  (FDM),  13, 
117,  178,211-212 
Frequency  hopping  spread  spectrum 
(FHSS),  714-717 
applications  of,  721-724 
asynchronous,  719-720 
performance,  with  multiple  user  access, 
718-719 

Frequency  modulation  (FM),  11, 252,  254, 
877, 577-580 

broadcasting  system,  291-292 
narrowband  (NBFM),  260,  261 
and  phase  modulation,  relationship 
between,  255-256 
signals,  demodulation  of,  281-284 
tone,  264-267 
waves,  generating,  272-281 
wideband  (WBFM),  261-263 
Frequency  multipliers,  275 
Frequency  resolution,  1 60 
Frequency  shift  keying  (FSK),  258,  424, 
621-622,  683-685 
detection,  428^129 
and  FM  modulation,  connection 
between,  427 
Gaussian,  722 

A/ -ary  FSK  and  orthogonal  signaling, 
431^433 

Frequency  spectra,  56 
Frequency-selective  channels,  770, 

880-884 

Frequency-selective  fading  channel,  138, 
814-815 

conversion  to  flat  fading  channel,  816 
Frequency-shifting  property,  107,  114-119 
Full-cosine  roll-off  characteristics,  400 

Gaussian  approximation,  of  nonorthogonal 
MAI,  733-734 

Gaussian  random  process  properties, 

633-635 

Gaussian  random  variable,  470-476 
sum  of,  498-500 
Generalized  angle,  253 
Generalized  Fourier  series,  50 
Generator  matrix,  91 1, 923-924 


Generator  polynomial,  923-924 
code,  919 

Geometric  interpretation,  in  signal  space, 
645-650 

Geometrical  signal  space,  624—626 
Global  Positioning  System  (GPS),  747 
differential,  748 
operation,  747-748 
spread  spectrum  in,  748-749 
Gram-Schmidt  orthogonalization,  629,  630, 
982-984 
Gray  code,  652 
Group  (envelope)  delay,  127n 
GSM  cellular  phones,  776 

Hadamard  inequality,  890,  891 
Hamming  bound,  909 
Hamming  codes,  909,  9 1 7-9 1 8 
Hamming  distance,  848,  912 
Hard-decision  decoding,  946 
Hermitian  symmetry,  95n 
High-definition  television  (HDTV),  220, 
360-361 

High-density  bipolar  (HDB)  signaling,  393 
High-quality  LP  vocoders,  355 
High-speed  packet  access  (HSPA),  747 
Hilbert  transform,  199-200 
Hold-in  (lock)  range,  218 
Homodyne  modulators,  1 89 
Huffman  code,  842-847 

Ideal  vs.  practical  filters,  129-133 
IEEE  802. 1 1 ,  72 1-722,  749-75 1 
Image  stations,  290 
Impulse  noise,  417 

Independence  vs.  uncorrelatedness,  493 
Independent  events,  454 
Independent  random  variables,  479 
variance  of  sum  of,  488-489 
Indirect  FM  generation,  275-277 
Inductive  logic,  461 
Information 

commonsense  measure  of,  836-837 
engineering  measure  of,  837-839 
entropy  of  a  source,  839-84 1 
Input  transducer,  2 
Instantaneous  frequency,  253-254 
Instantaneous  velocity,  253 
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Integrated  services  digital  network 
(ISDN),  422 

Interleaved  code,  944-945 

Interleaving  depth,  944 

Interleaving  effect,  291 

International  Mobile 

Telecommunications-2000  standard 
(IMT-2000),  746 

International  Telecommunications  Union 
known,  2 1 2 

Interpolation,  304,  306-309 
ideal,  305-306 

Intersection,  449 

Intersymbol  interference  (ISI),  134, 
394-395,  770 

Iterative  soft  bit  decoding,  949-950 


Jitter,  415, 416 

Joint  distribution,  476-478 

Joint  event,  449 

Joint  source-channel  coding,  15 

Justification,  338 


Karhunen-Loeve  expansion.  630, 676 


Lamarr,  Hedy,  723 
L- ary  digital  signal,  320 

Levinson-Durbin  algorithm - 

Linear  distortion,  3,  134-135 
Linear  equalizers,  790-79 1 
Linear  mean  square  estimation. 

Linear  prediction  coding  (LPC)  vocoders. 
352-355 

high-quality  LP  vocoders, . .  - 
LPC- 10  vocoder,  354 

voice 'models  and  model-based  vocoders. 

352-353 

Linear  system  u  m  129 

signal  transmission  through.  123  l  £ 
distortion,  in  audio  and  video  signals. 

127-129  i?S-127 

distortionless  transmission, 

signal  distortion ^ time 
Linear  time-invariant  (Ll 

system,  123,  148 

frequency  response  of.  U*, 

Line  coder.  378-379 


Line  coding.  378,  380-394 

bipolar  (pseudotemary  or  AMI)  signaling. 
390-392 

BNZS  signaling,  394 
HDB  signaling,  393 
on-off  signaling,  388-390 
polar  signaling,  385-387 
power  spectral  density.  381-385 
constructing  dc  null  in.  387-388 
properties  of,  380-38 1 
Line  spectral  pairs  (LSP),  354 
Local  carrier  synchronization,  210-2 1 1 
Logarithmic  units,  331-332 
Low-density  parity  check  (LDPC)  codes, 
959-965 

bipartite  (tanner)  graph.  961-962 
bit-flipping  decoding.  962 
decoding,  sum-product  algorithm  lor. 
963-965 

LPC- 10  vocoder,  354 
Luminance  signals.  226 
multiplexing,  226-228 

Manchester  (split-phase)  signaling.  388 
Marginal  densities,  477 
Marginal  probabilities,  465 

A# -ary 

ASK  and  noncoherent  detection.  431 
bandwidth  and  power  trade  offs,  653. 
666-667 

binary  polar  signaling.  650-653 
FSK  and  orthogonal  signaling.  431-433 
message.  4,  5 

PAM  signaling.  420-423. 434-435. 436 
QAM  analysis,  435-436. 653-659 
trading  power  and  bandwidth.  436437 
Matched  filter,  608-61 1 
MATLAB  exercises 

for  AM  modulation  and  demodulation, 
234-237 

basic  signals  and  signal  graphing.  69-71 
coefficients  Dn.  numerical  computation 
of.  76-78 

for  delta  modulation.  368-370 
digital  communication  systems.  816 
performance  analysis  of.  688 
for  DSB-SC  modulation  and 
demodulation.  230-234 
for  error  correcting  code.  966 
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MATLAB  exercises  ( Continued ) 
for  eye  diagrams,  437-438 
for  FM  modulation  and  demodulation. 
292-296 

Fourier  series,  numerical  computation  of, 
78-80 

for  Fourier  transform  computation, 
160-167 

for  information  theory.  893 
lowpass  signals,  sampling  and 
reconstruction  of,  361-364 
for  PCM,  364-368 
periodic  signals  and  signal  power. 

73-74 

for  QAM  modulation  and  demodulation. 
240-244 

signal  correlation.  74-76 
signal  operations,  71-73 
for  spread  spectrum  technologies,  75 1 
for  SSB-SC  modulation  and 
demodulation,  237-240 
Matrix  product  and  properties,  985 
Maximum  a  posterior  (MAP)  detection. 

BCJR  algorithm  for,  951 
Maximum  a  posteriori  probability  (MAP) 
detector,  64 1 

Maximum  capacity  power  loading, 

881-883 

Maximum  information  rate,  in  digital 
communication,  313-314 
Maximum  length  shift  register 
sequences,  725 

Maximum  likelihood  decoding,  914, 
936-939 

Maximum  likelihood  receiver,  678-679, 
739-740 

Maximum  likelihood  sequence  estimation 
(MLSE),  774-777 

complexity  and  practical  implementations, 
776-777 
Mean,  481-482 

of  function  of  random  variable,  482-483 
of  product  of  two  functions,  484 
of  sum,  483 

Mean  square  bandwidth,  581-585 
Measure-zero  set,  49n 
Memoryless  source,  839 
Message  signal,  2 
Minimax  receiver,  679-680 
Minimum  energy  signal  set,  671-674 


Minimum  mean  square  error  (MMSE), 
414,  780 

finite  data  design,  784-785 
finite  length  MMSE  equalizers, 
782-783 
FSE  design.  789 
and  optimum  delay,  782 
receiver.  740-742 
vs.  ZF.  783-784 

Minimum  shift  keying  (MSK),  622-624 
Minimum  weight  vector.  915 
Mobile  telephone  switching  office 
(MTSO),  744 
Modem.  423 

Modem  telecommunications,  historical 
review  of,  15-19 

Modified  duobinary  signaling,  404 
Modulated  communication  systems  under 
noise,  performance  of,  564 
amplitude-modulated  systems,  565-574 
angle-modulated  systems.  574-591 
pulse-modulated  systems,  591 
Modulated  signal.  1 15 
phase  spectrum,  shifting,  115-116 
power  spectral  density  of,  1 55 
Modulating  signal.  114 
Modulation 

amplitude  (AM),  11,  114-115,  178. 

179-180,  876-877 
angle.  179-180,  252,  254 
application  of,  117 
delta  (DM).  179,346-351 
discrete  multitone  (DMT),  803 
double-sideband,  suppressed-carrier 

(DSB-SC),  180,  181 
double-sideband  amplitude.  180-190 
frequency  (FM),  1 1,  252,  254,  255-256, 
260-267,  877 
index,  263 
nonlinear,  252-259 
phase  (PM).  11,  254.  255-256.  260, 
263-264 

pulse  amplitude.  179 
pulse  code  (PCM),  7-8,  179,  279, 
456-457.  878-879 
pulse  position  (PPM),  179,  318,  319 
pulse  width  (PWM),  179,  318,  319 
single-sideband  (SSB).  198-199 
tone,  183 

vestigial  sideband  (VSB),  206-210 
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Modulators 
balanced,  185 
coherent,  189 
diode  bridge,  1 86 
double  balanced,  185 
frequency,  282-283 
homodyne,  189 
multiplier,  184 
nonlinear,  184—185 
switching,  185-189 
synchronous,  190 
Moments  of  random  variables,  484 
central,  484 
Morse  code,  4, 14 

Moving  Picture  Experts  Group  (MPEG), 
352, 355-360 
MPSK  signals,  656-659 
Multiamplitude  signaling,  650-653 
Multicarrier  communication  system,  800 
Multipath  transmission,  137-138 

Multiple-input-multiple-output 

(M1MO),  816 

channel  capacity,  884-887,  898-900 
transmitter  with  channel  knowledge, 
889-893 

transmitter  without  channel  knowledge, 
887-888 


Multiplexer,  379 
Multiplexing,  12-13 
digital,  336-341 

frequency  division  (FDM),  13,  117,  178, 
211-212 
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T1  time  division,  332-333 
Multiplication  rule,  for  conditional 
probabilities,  458 

Multiplier  modulators,  184 

Multitone  signaling  (MFSK),  663 
noncoherent,  685-686 
Multiuser  detecion  (MUD),  737- 
decision  feedback  receiver,  742-/41 
decorrelator  receiver,  740 
MMSE  receiver,  740-742 


imum,  739-740 
power  control,  747 

al  information,  852,  864-866 
mnel  capacity  and,  895-898 

ally  exclusive  (disjoint)  events, 


Narrowband  angle  modulation,  260-261 
Narrowband  exponential  modulation 
(NBEM),  580 

Narrowband  frequency  modulation  (NBFM), 
260,  261 

generation,  272-273 

Narrowband  modulation,  260-261, 580-581 

Narrowband  phase  modulation  (NBPM),  260 

Natural  binary  code  (NBC),  320 

Near-far  problem,  735-736 

Near-far  resistance,  737 

Nodes,  5-6 

Noise,  3,  14 

Noisy  channel  coding  theorem,  907 
Noncoherent  detection,  680—688 
Nonideal  practical  sampling  analysis, 
314-318 

Nonlinear  distortion,  3,  135-137 
Nonlinear  DSB-SC  modulators,  184-185 
Nonlinear  modulation,  252-259 
Non-retum-to-zero  (NRZ)  pulses,  379 
Nonwhite  channel  noise,  676 
NTSC  (National  Television  System 
Committee) 

television  broadcasting  system,  220-230 
bandwidth,  222,  224 
video  spectrum,  224—225 
Null  event,  448 

Nyquist  criteria  for  zero  ISI,  395-400 
Nyquist  interval,  304 
Nyquist  sampling  rate,  304 


Offset  QPSK  (OQPSK),  745 
On-off  keying  (OOK),  424 
On-off  signaling,  378,  388-390,  617-618 
Optimum  delay,  782 
Optimum  linear  precoder,  893 
Optimum  linear  receiver  analysis,  61 1-615 
Optimum  MUD  receiver,  739 
Optimum  power  distribution,  805 
Optimum  power  loading 
in  OFDM/DMT,  883-884 
water-pouring  interpretation  of,  883 
Optimum  preemphasis-deemphasis, 

589-591 

Optimum  receiver 

filter,  607-61 1 

for  white  Gaussian  noise  channels,  635 
Optimum  threshold,  612-614 


Orthogonal  frequency  division  modulation 
(OFDM) 

channel  equalization  and,  770, 

802-803 

channel  noise,  799-801 
cyclic  prefix  redundancy  in,  802 
principles  of,  793-799 
real-life  applications  of,  808-812 
zero-padded,  801-802 
Orthogonality 

complex  signal  space  and,  39^0 
of  exponential  signal  set,  979 
of  trigonometric  signal  set,  978 
Orthogonal  signaling,  618-619, 

661-663,  880 

bandwidth  and  power  trade-offs  of  M  -ary, 
666-667 

energy  of  sum  of,  40-4 1 
Orthogonal  signal  sets,  47-50 
Orthogonal  signal  space,  49-50 
Orthogonal  vectors,  36,  625-626 
Orthogonal  vector  space,  47^18 
Orthonormal  basis  set,  49,  628-629 
Orthonormal  vectors,  626 
Outcomes,  448 
Output  transducer,  3 
Overhead  bits,  336 

Paley- Wiener  criterion,  131,310 
Parity  check  digits,  911 
Parity  check  matrix,  913 
Parse  val’s  theorem,  50,  139-140 
in  Fourier  series,  68-69 
Partial  reflection  coefficients 
(PARCOR),  354 

Partial  response  signaling,  401-402 
Perfect  code,  909 
Periodic  signals,  27 
Phase  coherent  (in  phase)  lock,  218 
Phase  delay,  127n 

Phase-locked  loop  (PLL),  211,  212-220, 
283-284,  589 
basic  operation,  213-214 
first-order  loop  analysis,  216-217 
hold-in  (lock)  range,  218 
phase  coherent  (in  phase) 
lock,  218 

pull-in  (capture)  range,  218 
small-error  analysis,  214-216 


Phase  modulation  (PM),  1 1,  254,  263-264, 
575-577 

and  frequency  modulation,  relationship 
between.  255-256 
narrowband  (NBPM),  260 
Phase  shift  keying  (PSK),  258,  424 
binary  (BPSK),  619-620 
detection,  429 

differential  (DPSK).  429—43 1 
differentially  coherent,  686-688 
and  QAM  modulation,  connection 
between,  427-428 
Phase  shift  method,  202 
Phase  spectrum,  56 
Piconet,  721 

Plain-old-telephone-service  (POTS),  808 
Plesiochronous  digital  hierarchy,  339-341 
Polar  signaling,  385-387,  615-617 
Power 

of  angle  modulated  wave,  256-259 

control,  736-737 

vs.  MUD,  747 

loading,  805 

signal,  28,  148-155 

Power  spectral  density  (PSD),  149-150, 
381-385 

of  amplitude  shift  keying,  425-427 
constructing  dc  null  in,  387-388 
of  digital  carrier  modulation,  425-427 
of  frequency  shift  keying,  426-427 
input  and  output,  154-155 
interpretation  of,  152 
of  modulated  signals,  155 
of  phase  shift  keying,  426-427 
Prediction  coefficients,  343 
Probability,  447-462 
axiomatic  theory  of,  461-462 
Probability  density  function  (PDF),  468 
Product  code,  945 
Progressive  taxation,  325-329 
Pseudonoise  (PN)  sequence  generation, 
725-726 

Public  switched  telephone  network 
(PSTN),  808 

Pull-in  (capture)  range,  218 
Pulse  amplitude  modulation  (PAM),  179, 
318,319,  650-653 
M  -ary  baseband  signaling,  for  higher 
order  rate,  420-423 
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Pulse  code  modulation  (PCM),  7-8,  179, 
279.  318,  319-332, 591-600, 
878-879 

channel  noise,  mean  square  value  of, 
486-488 


companded,  596-600 
differential  (DPCM),  341-344 
encoder,  329 

quantization  error,  mean  square  value  of, 
485-486 

repeater  error  probability,  456-457 
in  T 1  carrier  systems,  332-335 
total  mean  square  error  in,  489 
transmission  bandwidth  and  output  SNR. 


329-332 

>ulse  detection  error.  322 
>ulse  generation,  406 
•*0186  position  modulation  (PPM),  179, 
318,319 

>ulse  shaping,  387-388,  394-406 
controlled  1S1  or  partial  response 
signaling,  401-402 
differential  encoding,  405-406 
duobinary  pulse,  402-403, 404-406 
intersymbol  interferences,  394-395 
Nyquist’s  criteria  for  zero  1SI,  395-400 
in  PAM,  422 

zero-lSl,  duobinary,  and  modified 
duobinary,  pulse  relationship 
between,  403-404 
Pulse  stuffing,  338-339 
Pulse  width  modulation  (PWM),  179, 
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LP  (Qualcomm  code-excited  linear 
prediction)  vocoder.  745 
rature  amplitude  modulation  (QAM), 
198,  204-206 

-ary,  435-436,  653-659 

d  PSK,  connection  between,  4_- 

lrature  multiplexing,  205,  206,  566 

mzation,  7,  320,  322-324 

ror,  322 

>ise,  323 


idiation,  11-12 

lised  cosine  characteristics,  400 
indom  interleaver,  945 


Randomness,  14 
Random  processes,  447 

basic  functions  determination  for, 

629-630 

Gaussian  properties,  633-635 
Random  variables 

conditional  probabilities  of,  464-466 
continuous,  463, 467—470 
discrete,  462—164 
Gaussian,  470-476 
independent,  479, 488—489 
sum  of,  497-500 
Ratio  detector,  283 
Rayleigh  density,  479-48 1 
Receiver,  3 

Rectifier  detector,  195-196 
Recursive  systematic  convolutional  (RSC) 
code,  936,  955—956 
Redundancy,  13,14,  844 
Reed-Solomon  codes,  927 
Regenerative  repeater,  5-6,  379-380, 
409-410 

Relative  frequency,  449-452 
Relative  likelihood,  947 

Resource  exchange,  10-1 1 

Return-to-zero  (RZ)  pulses,  379 
Ring  modulator,  187,  188 
Robbed-bit  signaling,  335 
Roll-off  factor,  399 
Root-raised-cosine  pulse,  775 
Row-column  (RC)  constraint,  962 

Sample  space,  448 
Sampling  theorem,  6,  302-304 
applications  of,  318-319 
Scalar  product  and  signal  energy,  627-628 
Scatter  diagram,  49 1 , 698 
Scrambling,  406-409 
Selective-filtering  method,  202-203 
Sequential  decoding,  939-942 
Series  bridge  diode  modulator,  1 87 
Shannon’s  equation,  10,  875 
Shunt  bridge  diode  modulator,  1 87 
Sideband,  194-195 
Sigma-delta  modulation,  350-35 1 

Signal  distortion,  124-125 

in  audio  and  video  signals,  127- 14V 
Signal  energy,  21, 22,  139-148 
scalar  product  and.  627-628 
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Signal  power,  9-10,  148-149 
time  autocorrelation  of,  150-155 
Signal  reconstruction,  304-309 
aliasing  error  (spectral  folding),  310-312 
antialiasing  filter,  3 1 2 
filters,  realizability  of,  309-310 
ideal,  305-306 
practical,  306-309 
Signals 

bandpass,  117-119 
chrominance,  226-228 
energy,  21,  22 

frequency  domanin  description  of,  56 
0  luminance,  226-228 
operations,  29-32 
power,  21-22 
size  of,  20-25 

time  domain  description  of,  56 
vs.  vectors,  34—41 
Signals,  classification  of,  25 
analog,  26-27 
aperiodic,  27 
continuous  time,  25-26 
deterministic,  28 
digital,  26-27 
discrete  time,  25-26 
energy,  28 
periodic,  27 
power,  28 
probabilistic,  28 
Signals,  correlation  of,  41-47 
closest  relatives,  42-44 
complete  strangers,  42-44 
correlation  functions,  46-47 
opposite  personalities,  42-44 
signal  detection,  application  of,  44-46 
Signal  space 

analysis  of  optimum  detection,  624-629 
and  basis  signals,  626-629 
Signal-squaring  method,  218-219 
Signal-to-noise  ratio  (SNR),  9, 610 
in  DPCM,  344 

exchange  with  bandwidth,  876 
in  PCM,  329-332 

transmitter  power  loading  for  maximizing 
receiver,  804-805 
Simplex  signal  set,  674-676 
Simplified  signal  space  and  decision 
procedure,  640-644 
Sine  function,  100-102 


SINCGARS,  722-723 
Single  balanced  modulator,  185 
Single-input-multiple-output  (SIMO) 
model,  785-786 

Single-sideband,  suppressed-carrier 

(SSB-SC)  modulation,  566-567 
carrier  acquisition  in,  220 
Single-sideband  (SSB)  modulation,  198-199 
signals  with  carrier  (SSB+C),  204 
systems,  202-204 
phase  shift  method,  202 
selective-filtering  method,  202-203 
Weaver's  method,  203 
time  domain  representation  of,  200-202 
Sinusoidal  carrier,  1 79 
Slope  detection,  282,  283 
Slope  overload,  349 
Slotted  frequency  hopping,  719 
Soft  decoding,  946-948 
Soft-output  Viterbi  algorithm  (SOVA), 
949-950 
Source,  2 
Source  coding,  13 
randomness  and,  14 
redundancy  and,  14 
Source  encoding,  841-847 
Spectral  density,  99 
energy  (ESD),  140-141,  145-148 
power  (PSD),  149-150,  152,  154-155, 
387-388,  381-385, 425^127 
Spectrum,  99 
amplitude,  56,  59 

direct  sequence  spread  (DSSS),  724-737, 
743-751 
Fourier,  56-61 

frequency  hopping  spread  (FHSS), 
714-724 

phase,  56,  115-116 

spread  spectrum,  in  GPS,  714, 748-749 
vestigial,  399 
video,  224-225 
Standard  deviation,  484 
State  transition  diagram,  934—935 
Stationary  white  noise,  630 
Successive  interference  cancellation  (SIC) 
receiver,  742 

Sum-product  algorithm,  for  LDPC  decoding, 
963-965 
Superframe,  335 
extended  (ESF),  335 
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Superheterodyne  receiver,  289-290 
Superposition  theorem,  97 
Switching  modulators,  for  DSB-SC, 

185-189 
Symmetry,  61 
Synchronization,  334—335 
Synchronous  detection,  1 82 
Synchronous  modulators,  190 
Syndrome,  914, 942 
Systematic  code,  9 1 1 
Systemic  cyclic  codes,  921-923 
Systems,  20 

amplitude-modulated,  565-574 
BIBO  linear,  123 
binary,  615-619 
cellular,  744-745 
communication,  1-4 
digital  carrier,  423-43 1 
digital  communication,  377-380,  605, 
767,816 

FM  broadcasting,  291-292 
global  positioning  (GPS),  747-749 
linear,  123-129 

modulated  communication,  564-574,  59 1 
multicarrier  communication,  800 
T1  carrier,  332-335 

3G  cellular  services,  746-747 
Threshold  detection,  474-476 
binary,  606-607 
Threshold  detector,  45 
Threshold  effect,  572 

in  angle-modulated  systems,  585-589 
Time  autocorrelation  function,  145-148 
of  power  signals,  1 50-154 
Time  convolution,  119 
Time  differentiation  property,  121-123 
Time  division  multiple-access  (TDM A) 
systems,  338 

Time  division  multiplexing  (TDM),  13, 
318,319 

Time  domain  equalizer  (TEQ),  807 
Time-frequency  duality,  106-108 
Time  integration  property,  121-1- 
Time  inversion,  31-32 

Time  scaling,  29-31 

signal  duration,  reciprocity  ot, 

110-111 

significance  of,  HO 


Time  shifting,  29,  1 12-1 14 

linear  phase,  physical  explanation  of, 
112-113 

Time-varying  channels,  813-816 
Timing  extraction,  4 1 4-4 1 5 
Timing  jitter,  415,  416 
Tint  control,  229 
Toeplitz  matrix,  412 
T1  carrier  systems,  332-335 

synchronizing  and  signaling,  334-335 
time  division  multiplexing,  332-333 
Tone  frequency  modulation 
spectral  analysis  of,  264-267 
Tone  modulation,  183,  585 
T1  multiplexer,  339 

T1  time  division  multiplexing,  332-333 
Total  probability  theorem,  458-460 
Transmission,  11-12,  123-129 
digital  data,  377 
distortionless,  125-129 
multipath,  137-138 
Transmitter,  2,  804-805 
M1MO,  887-893 
Trellis  diagram,  935,  942-943 
Trigonometric  Fourier  series,  51-61 
compact,  52-55 
Fourier  spectrum,  56-61 
periodicity  of,  55-56 
symmetry,  effect  of,  6 1 
Trigonometric  set,  5 1 
T-spaced  equalization  (TSE),  772, 
777-785 

based  on  MMSE,  780 
zero-forcing  equalizer,  778-780 
Turbo  codes,  951-959 

Uncorrelated  variables,  492 
mean  square  of  sum  of,  493 
Undetermined  multipliers,  861 
Union,  448-449 
Unit  impulse  function,  32 

as  generalized  function,  33-34 
multiplication  of,  33 
sampling  property  of,  33 
Unit  impulse  signal,  32-34 
Unit  rectangular  function,  99-100 
Unit  step  function,  34 
Unit  triangular  function,  100 
Upconversion,  189 
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Variance,  484 

of  sum  of  independent  random  variables, 
488-489 

Vector  decomposition  of  white  noise  random 
processes,  629-635 

Vectors 

vs.  signals,  34-41 

complex  signal  space  and  orthogonality, 
39-40 

component  of  vector  along  another  vector, 
35-36 

decomposition  of  signal  and  signal 
components,  36-39 
Vector  space,  35 

Vestigial  sideband  (VSB)  modulation, 

~  206-210 

in  broadcast  television,  209-210 

filter,  208-209 

signals  with  carrier  (VSB+C),  208-209 
Vestigial  spectrum,  399 
Video  compression,  351,  355-360 
Viterbi  algorithm,  936-939 
Vocoders,  35 1 

linear  prediction  coding,  352-355 
Voltage-controlled  oscillator  (VCO), 
213-214 


Weaver's  method  of  SSB 
generation,  203 
White  channel  noise,  630-631 
White  Gaussian  noise,  614-615, 

632-633,  643 
White  noise,  630-63 1 

additive  white  Gaussian  noise 
(AWGN),  635 

Gaussian,  614-615,  632-633, 

635,  643 

geometrical  representation  of,  630-632 
stationary,  630 

vector  decomposition  of  random 
processes,  629-635 

Wideband  frequency  modulation  (WBFM), 
261-263 

Wireless  multipath  channels,  linear 
distortions  of,  767-770 
Word  interleaving,  336 

Zero-crossing  detectors,  283 
Zero-forcing  (ZF)  equalizer,  410-413, 
778-780 

FSE  design,  787-788 
Zero  padding,  157,  801 
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